GO Tools: Databases

agriGO

Platform
Online tool
Developer
Bioinformatics Center, China Agricultural University, Beijing, China.
Contact
Zhen Su
Publications
PMID:20435677
License
Free for academic use
GO data updates
weekly (or more frequently)
GO data used
  • terms
  • definitions and comments
  • synonyms
  • cross-references
  • relationships
  • subsets or GO slims
  • gene product data
  • taxon
  • evidence codes
  • references
  • qualifiers

Gene Ontology (GO), the de facto standard in gene functionality description, is used widely in functional annotation and enrichment analysis. Here, we introduce agriGO, an integrated web-based GO analysis toolkit for the agricultural community, using the advantages of our previous GO enrichment tool (EasyGO), to meet analysis demands from new technologies and research objectives. EasyGO is valuable for its proficiency, and has proved useful in uncovering biological knowledge in massive data sets from high-throughput experiments. For agriGO, the system architecture and website interface were redesigned to improve performance and accessibility. The supported organisms and gene identifiers were substantially expanded (including 38 agricultural species composed of 274 data types). The requirement on user input is more flexible, in that user-defined reference and annotation are accepted. Moreover, a new analysis approach using Gene Set Enrichment Analysis strategy and customizable features is provided. Four tools, SEA (Singular enrichment analysis), PAGE (Parametric Analysis of Gene set Enrichment), BLAST4ID (Transfer IDs by BLAST) and SEACOMPARE (Cross comparison of SEA), are integrated as a toolkit to meet different demands. We also provide a cross-comparison service so that different data sets can be compared and explored in a visualized way. Lastly, agriGO functions as a GO data repository with search and download functions; agriGO is publicly accessible at http://bioinfo.cau.edu.cn/agriGO/.

  • ontology or annotation browser
  • ontology or annotation visualization
  • database or data warehouse
  • statistical analysis
  • term enrichment
  • text mining

Tool listing last updated 02 June 2010

Bioconductor

Platform
Windows compatible Mac OS X compatible Linux compatible Unix compatible
Developer
BioConductor.
Contact
Bioconductor webmaster
License
Free for academic use
GO data updates
every three months (or more frequently)
GO data used
  • terms
  • definitions and comments
  • synonyms
  • cross-references
  • relationships
  • subsets or GO slims
  • gene product data
  • taxon
  • evidence codes
  • references
  • qualifiers
Open Source (OSI) Logo

Bioconductor provides tools for the analysis and comprehension of high-throughput (microarray, sequence, flow, etc.) genomic data. Bioconductor uses the R statistical programming language, and is open source and open development. There are more than core and user-contributed 400 packages. Bioconductor packages the GO ontology into our semi-annual release, with software tools to: query; join with diverse additional gene, microarray, and sequence annotations; incorporate GO into annotation, differential expression, and gene set enrichment work flows; and visualize.

  • ontology or annotation browser
  • ontology or annotation search engine
  • ontology or annotation visualization
  • database or data warehouse
  • software library
  • statistical analysis
  • term enrichment
  • text mining
  • other analysis
  • Flexible integration of GO into statistical analysis and comprehension of high-throughput genetic data.

Tool listing last updated 04 February 2011

FunSimMat

Platform
Online tool
Developer
Department of Computational Biology and Applied Algorithmics, Max Planck Institute for Informatics, Saarbrucken, Germany.
Contact
FunSimMat Team
Publications
PMID:17932054
PMID:19923227
License
Free for academic use
GO data updates
every three months (or more frequently)
GO data used
  • terms
  • definitions and comments
  • synonyms
  • cross-references
  • relationships
  • subsets or GO slims
  • gene product data
  • taxon
  • evidence codes
  • references
  • qualifiers

FunSimMat is a comprehensive resource of semantic and functional similarity values. It allows ranking disease candidate proteins and searching for functional similarity values for proteins (extracted from UniProt), and protein families (Pfam, SMART). FunSimMat provides several different semantic and functional similarity measures for each protein pair using the Gene Ontology annotation from UniProtKB.

  • database or data warehouse
  • functional similarity
  • semantic similarity
  • disease gene candidate prioritization

Tool listing last updated 03 June 2010

Gene Ontology For Functional Analysis (GOFFA)

Platform
Windows compatible Mac OS X compatible Linux compatible Unix compatible
Developer
National Center for Toxicological Research (NCTR), Food and Drug Administration, Jefferson, Arkansas, USA.
Contact
Don Ding
Publications
PMID:17118145
License
Free for academic use
GO data updates
monthly (or more frequently)
GO data used
  • terms
  • definitions and comments
  • synonyms
  • cross-references
  • relationships
  • subsets or GO slims
  • gene product data
  • taxon
  • evidence codes
  • references
  • qualifiers

Gene Ontology For Functional Analysis (GOFFA) is a tool developed for ArrayTrack that takes a list of genes and identifies terms in Gene Ontology associated with those genes. GOFFA provides tools to view/access the following:
* GO term hierarchy
* Full listing of GO terms annotated with the genes associated with a given term
* Fisher's exact test p-value providing the probability of identifying that many genes for a given term by chance alone
* Relative enrichment factor (E-value) giving the enrichment of a GO term for genes in the submitted list relative to the frequency of genes assigned to that term from the full set of GOFFA annotated genes for a particular species

  • ontology or annotation browser
  • ontology or annotation search engine
  • ontology or annotation visualization
  • database or data warehouse
  • term enrichment
  • text mining

Tool listing last updated 26 September 2011

GeneMANIA

Platform
Online tool Windows compatible Mac OS X compatible Linux compatible Unix compatible
Developer
GeneMania team: Bader and Morris labs, University of Toronto, Toronto, Canada.
Contact
GeneMANIA team
Publications
PMID:20576703
PMID:20926419
License
Free for academic use
GO data updates
every three months (or more frequently)
GO data used
  • terms
  • definitions and comments
  • synonyms
  • cross-references
  • relationships
  • subsets or GO slims
  • gene product data
  • taxon
  • evidence codes
  • references
  • qualifiers
Open Source (OSI) Logo

GeneMANIA helps you predict the function of your favourite genes and gene sets.
GeneMANIA finds other genes that are related to a set of input genes, using a very large set of functional association data. Association data include protein and genetic interactions, pathways, co-expression, co-localization and protein domain similarity. You can use GeneMANIA to find new members of a pathway or complex, find additional genes you may have missed in your screen or find new genes with a specific function, such as protein kinases. Your question is defined by the set of genes you input.
If members of your gene list make up a protein complex, GeneMANIA will return more potential members of the protein complex. If you enter a gene list, GeneMANIA will return connections between your genes, within the selected datasets.
GeneMANIA also performs Gene Ontology term enrichment of the query list along with the returned gene list.
GeneMANIA is also accessible via a Cytoscape plugin, designed for power users.
GeneMANIA is actively developed at the University of Toronto, in the Donnelly Centre for Cellular and Biomolecular Research, in the labs of Gary Bader and Quaid Morris, with input from an independent scientific advisory board. GeneMANIA development was funded by Genome Canada, through the Ontario Genomics Institute (2007-OGI-TD-05).

  • database or data warehouse
  • software library
  • statistical analysis
  • term enrichment
  • other analysis
  • an interaction browser and GeneMANIA suggests annotations for genes based on Gene Ontology term enrichment of highly interacting genes with the gene of interest., GeneMANIA is also a gene recommendation system

Tool listing last updated 13 January 2011

GeneTools

Platform
Online tool
Developer
Norwegian University of Science and Technology, Trondheim, Norway.
Contact
Vidar Beisvag
Publications
PMID:17062145
License
Free for academic use
GO data updates
no fixed update schedule
GO data used
  • terms
  • definitions and comments
  • synonyms
  • cross-references
  • relationships
  • subsets or GO slims
  • gene product data
  • taxon
  • evidence codes
  • references
  • qualifiers

GeneTools is a collection of web-based tools that brings together information from a broad range of resources, and provides this in a manner particularly useful for genome-wide analyses. Today, the two main tools connected to this database are the NMC Annotation Database V2.0 and eGOn V2.0 (explore Gene Ontology).

The NMC Annotation Database V2.0 provides information from UniGene, EntrezGene, SwissProt and Gene Ontology (GO). Major features are:

  • Single search/Batch search, extraction of data for single or batches of genes.
  • Manage reporter lists: in folders and share selected lists with other users.
  • Manual GO Annotation: add your own Gene Ontology (GO) annotations to genes of interest.
  • Export: to Excel, text or XML format.

eGOn V2.0 facilitates interpretation of GO annotation. GO terms are retrieved in batch modus from EntrezGene and the GO database and displayed in the GO directed acyclic hierarchical graph (DAG). Essential features of eGOn V2.0 are:

  • Visualization: gene annotations are visualized in the GO DAG or as a table view. The granularity of the GO DAG can be edited freely by the user.
  • Filtering: GO annotations can be filtered on evidence codes.
  • Include user defined GO annotations: previously added to the Annotation database.
  • Statistical analysis: Several gene lists are analyzed simultaneously to compare the distribution of the annotated genes over the GO hierarchy.

Statistical tests are implemented to allow the user to compute GO annotation dissimilarity within or between gene lists.

  • Connection to Annotation database: Links to Annotation database gene and protein information are offered directly from the GO DAG or in exported data.
  • Export: GO DAG information, statistical results and gene and protein information can be exported in excel, text or XML format.
  • ontology or annotation browser
  • ontology or annotation search engine
  • ontology or annotation visualization
  • database or data warehouse
  • statistical analysis
  • term enrichment

Tool listing last updated 18 January 2011

GO Online SQL Environment (GOOSE)

Platform
Online tool Windows compatible Mac OS X compatible Linux compatible Unix compatible
Developer
GO Consortium.
Contact
GO Helpdesk
Publications
PMID:19033274
License
Free for academic use
GO data updates
daily
GO data used
  • terms
  • definitions and comments
  • synonyms
  • cross-references
  • relationships
  • subsets or GO slims
  • gene product data
  • taxon
  • evidence codes
  • references
  • qualifiers
Open Source (OSI) Logo

The GO Online SQL Environment (GOOSE) provides a direct interface to the GO database, allowing users to run custom queries without having to install a copy of the GO database locally.

  • database or data warehouse

Tool listing last updated 10 October 2010

go-db-perl

Platform
Windows compatible Mac OS X compatible Linux compatible Unix compatible
Developer
GO Consortium.
License
Free for academic use
GO data updates
not applicable (e.g. because users provide data)

go-db-perl extends the functionality of go-perl (on which it depends) with GO Database access functionality.
go-db-perl comes bundled with various scripts and a shell command line interface that can be used as standalone tools.
Installation is more involved than for go-perl; you will need a MySQL database plus the requisite DBI and DBD Perl modules. Full installation instructions are included in the download.
go-db-perl is in use both to drive AmiGO and internally within Ensembl.

  • database or data warehouse
  • software library

Tool listing last updated 10 October 2010

GOTaxExplorer

Platform
Online tool Windows compatible Mac OS X compatible Linux compatible Unix compatible
Developer
Department of Computational Biology and Applied Algorithmics, Max Planck Institute for Informatics, Saarbrucken, Germany.
Contact
GOTax Team
Publications
PMID:17346342
License
Free for academic use
GO data updates
every three months (or more frequently)
GO data used
  • terms
  • definitions and comments
  • synonyms
  • cross-references
  • relationships
  • subsets or GO slims
  • gene product data
  • taxon
  • evidence codes
  • references
  • qualifiers

GOTaxExplorer presents a new approach to comparative genomics that integrates functional information and families with the taxonomic classification. It integrates UniProt, Gene Ontology, NCBI Taxonomy, Pfam and SMART in one database. GOTaxExplorer provides four different query types: selection of entity sets, comparison of sets of Pfam families, semantic comparison of sets of GO terms, functional comparison of sets of gene products.
This permits to select custom sets of GO terms, families or taxonomic groups. For example, it is possible to compare arbitrarily selected organisms or groups of organisms from the taxonomic tree on the basis of the functionality of their genes. Furthermore, it enables to determine the distribution of specific molecular functions or protein families in the taxonomy. The comparison of sets of GO terms allows to assess the semantic similarity of two different GO terms. The functional comparison of gene products makes it possible to identify functionally equivalent and functionally related gene products from two organisms on the basis of GO annotations and a semantic similarity measure for GO.

  • ontology or annotation search engine
  • ontology or annotation visualization
  • database or data warehouse
  • functional similarity
  • semantic similarity
  • other analysis
  • comparative genomics analysis

Tool listing last updated 03 June 2010

Onto-Compare

Platform
Online tool
Developer
Intelligent Systems and Bioinformatics Laboratory, Wayne State University, Detroit, Michigan, USA.
Contact
Dr. Sorin Draghici
Publications
PMID:12664686
PMID:15215428
License
Free for academic use
GO data updates
every three months (or more frequently)
GO data used
  • terms
  • definitions and comments
  • synonyms
  • cross-references
  • relationships
  • subsets or GO slims
  • gene product data
  • taxon
  • evidence codes
  • references
  • qualifiers

Microarrays are at the center of a revolution in biotechnology, allowing researchers to screen tens of thousands of genes simultaneously. Typically, they have been used in exploratory research to help formulate hypotheses. In most cases, this phase is followed by a more focused, hypothesis driven stage in which certain specific biological processes and pathways are thought to be involved. Since a single biological process can still involve hundreds of genes, microarrays are still the preferred approach as proven by the availability of focused arrays from several manufacturers. Since focused arrays from different manufacturers use different sets of genes, each array will represent any given regulatory pathway to a different extent. We argue that a functional analysis of the arrays available should be the most important criterion used in the array selection. We developed Onto-Compare as a database that can provide this functionality, based on the GO nomenclature.

  • ontology or annotation browser
  • ontology or annotation search engine
  • ontology or annotation visualization
  • database or data warehouse
  • other analysis
  • compare commercially available microarrays based on GO

Tool listing last updated 29 June 2010

Onto-Design

Platform
Online tool
Developer
Intelligent Systems and Bioinformatics Laboratory, Wayne State University, Detroit, Michigan, USA.
Contact
Dr. Sorin Draghici
Publications
PMID:15215428
License
Free for academic use
GO data updates
every three months (or more frequently)
GO data used
  • terms
  • definitions and comments
  • synonyms
  • cross-references
  • relationships
  • subsets or GO slims
  • gene product data
  • taxon
  • evidence codes
  • references
  • qualifiers

Many Laboratories chose to design and print their own microarrays. At present, the choice of the genes to include on a certain microarray is a very laborious process requiring a high level of expertise. Onto-Design database is able to assist the designers of custom microarrays by providing the means to select genes based on their experiment.

  • ontology or annotation browser
  • ontology or annotation search engine
  • ontology or annotation visualization
  • database or data warehouse
  • other analysis
  • design custom microarrays based on GO terms of interest

Tool listing last updated 29 June 2010

Onto-Express (OE)

Platform
Online tool
Developer
Intelligent Systems and Bioinformatics Laboratory, Wayne State University, Detroit, Michigan, USA.
Contact
Dr. Sorin Draghici
Publications
PMID:11829497
PMID:12620386
PMID:15215428
License
Free for academic use
GO data updates
every three months (or more frequently)
GO data used
  • terms
  • definitions and comments
  • synonyms
  • cross-references
  • relationships
  • subsets or GO slims
  • gene product data
  • taxon
  • evidence codes
  • references
  • qualifiers

The typical result of a microarray experiment is a list of tens or hundreds of genes found to be differentially regulated in the condition under study. Independently of the methods used to select these genes, the common task faced by any researcher is to translate these lists of genes into a better understanding of the biological phenomena involved. Currently, this is done through a tedious combination of searches through the literature and a number of public databases. We developed Onto-Express (OE) as a novel tool able to automatically translate such lists of differentially regulated genes into functional profiles characterizing the impact of the condition studied. OE constructs functional profiles (using Gene Ontology terms) for the following categories: biochemical function, biological process, cellular role, cellular component, molecular function and chromosome location. Statistical significance values are calculated for each category. We demonstrated the validity and the utility of this comprehensive global analysis of gene function by analyzing two breast cancer data sets from two separate laboratories. OE was able to identify correctly all biological processes postulated by the original authors, as well as discover novel relevant mechanisms (Draghici et.al, Genomics, 81(2), 2003). Other results obtained with Onto-Express can be found in Khatri et.al., Genomics. 79(2), 2002.

  • ontology or annotation browser
  • ontology or annotation search engine
  • ontology or annotation visualization
  • database or data warehouse
  • statistical analysis
  • term enrichment
  • other analysis
  • custom level of abstraction of the Gene Ontology

Tool listing last updated 29 June 2010

Onto-Miner (OM)

Platform
Online tool
Developer
Intelligent Systems and Bioinformatics Laboratory, Wayne State University, Detroit, Michigan, USA.
Contact
Dr. Sorin Draghici
Publications
PMID:15215428
PMID:17584796
License
Free for academic use
GO data updates
every three months (or more frequently)
GO data used
  • terms
  • definitions and comments
  • synonyms
  • cross-references
  • relationships
  • subsets or GO slims
  • gene product data
  • taxon
  • evidence codes
  • references
  • qualifiers

Onto-Miner (OM) provide a single and convenient interface that allow the user to interrogate our databases regarding annotations of known genes. OM will return all known information about a given list of genes. Advantages or OM include the fact it allows queries with multiple genes and allows for scripting. This is unlike GenBank which uses a single gene navigation process.

  • ontology or annotation search engine
  • database or data warehouse
  • other analysis
  • scripted search of the Onto-Tools database for gene annotations

Tool listing last updated 29 June 2010

Onto-Translate

Platform
Online tool
Developer
Intelligent Systems and Bioinformatics Laboratory, Wayne State University, Detroit, Michigan, USA.
Contact
Dr. Sorin Draghici
Publications
PMID:15215428
License
Free for academic use
GO data updates
every three months (or more frequently)
GO data used
  • terms
  • definitions and comments
  • synonyms
  • cross-references
  • relationships
  • subsets or GO slims
  • gene product data
  • taxon
  • evidence codes
  • references
  • qualifiers

In the annotation world, the same piece of information can be stored and viewed differently across different databases. For instance, more than one Affymetrix probe ID can refer to the same GenBank sequence (accession number) and more than one nucleotide sequence from GenBank can be grouped in a single UniGene cluster. The result of Onto-Express depends on whether the input list contains Affymetrix probe IDs, GenBank accession numbers or UniGene cluster IDs. The user has to be aware of relations between the different forms of the data in order to interpret correctly the results. Even if the user is aware of the relationships and knows how to convert them, most existing tools allow conversions of individual genes. Onto-Translate is a tool that allows the user to perform easily such translations.

  • database or data warehouse
  • other analysis
  • Affymetrix probe IDs, etc., translate GO terms into other identifiers like GenBank accession number, Uniprot IDs

Tool listing last updated 29 June 2010

QuickGO

Platform
Online tool Windows compatible Mac OS X compatible Linux compatible Unix compatible
Developer
UniProtKB-GOA, European Bioinformatics Institute, Cambridge, UK.
Contact
Tony Sawford
Publications
PMID:19744993
PMID:20157483
License
Free for academic use
GO data updates
daily
GO data used
  • terms
  • definitions and comments
  • synonyms
  • cross-references
  • relationships
  • subsets or GO slims
  • gene product data
  • taxon
  • evidence codes
  • references
  • qualifiers
Open Source (OSI) Logo

QuickGO is a web-based tool that allows easy browsing of the Gene Ontology (GO) and all associated electronic and manual GO annotations provided by the GO Consortium annotation groups. QuickGO offers a range of facilities including bulk downloads of GO annotation data which can be extensively filtered by a range of different parameters and GO slim set generation.

  • ontology or annotation browser
  • ontology or annotation search engine
  • ontology or annotation visualization
  • database or data warehouse
  • slimmer-type tool

Tool listing last updated 21 June 2010

ToppGene Suite

Platform
Online tool
Developer
Cincinnati Childrens Hospital Medical Center, Cincinnati, Ohio, USA.
Contact
Anil Jegga
Publications
PMID:19465376
License
Free for academic use
GO data updates
no fixed update schedule
GO data used
  • terms
  • definitions and comments
  • synonyms
  • cross-references
  • relationships
  • subsets or GO slims
  • gene product data
  • taxon
  • evidence codes
  • references
  • qualifiers

ToppGene Suite is a one-stop portal for (i) gene list functional enrichment, (ii) candidate gene prioritization using either functional annotations or network analysis and (iii) identification and prioritization of novel disease candidate genes in the interactome. Functional annotation-based disease candidate gene prioritization uses a fuzzy-based similarity measure to compute the similarity between any two genes based on semantic annotations. The similarity scores from individual features are combined into an overall score using statistical meta-analysis. A P-value of each annotation of a test gene is derived by random sampling of the whole genome.

  • database or data warehouse
  • slimmer-type tool
  • term enrichment
  • protein interactions
  • functional similarity
  • other analysis
  • disease gene ranking based on functional annotation similarity

Tool listing last updated 21 January 2011

Db for Dummies!

Platform
Windows compatible
Developer
Custom Microsystems.
License
Free for academic use

Db for Dummies! is a small database that imports the Generic GO Slim. It allows data to be viewed in a tree.

  • database or data warehouse

Tool listing submitted before 2009; tool may be unsupported or inactive.

Genes2Diseases

Platform
Online tool
Developers
European Molecular Biology Laboratory.
Bork group, European Molecular Biology Laboratory.
License
Free for academic use

Genes2Diseases is a database of candidate genes for mapped inherited human diseases. The database is generated using an analysis of relations between phenotypic features and chemical objects, and from chemical objects to Gene Ontology protein function terms, based on the whole MEDLINE and RefSeq databases. Can be used to view all GO terms associated with a particular genetically inherited disease.

  • database or data warehouse

Tool listing submitted before 2009; tool may be unsupported or inactive.