GO Tools: Databases

agriGO*
Bioconductor*
FunSimMat*
Gene Ontology For Functional Analysis*
GeneMANIA*
GeneTools*
GO Online SQL Environment*
go-db-perl*
GOTaxExplorer*
Onto-Compare*
Onto-Design*
Onto-Express*
Onto-Miner*
Onto-Translate*
QuickGO*
ToppGene Suite*
Db for Dummies!
Genes2Diseases

agriGO

Platform

Developer

Bioinformatics Center, China Agricultural University, Beijing, China.

Contact

Zhen Su

Publications

PMID:20435677

License

Free for academic use

GO data updates

weekly (or more frequently)

GO data used

terms
definitions and comments
synonyms
cross-references
relationships
subsets or GO slims
gene product data
taxon
evidence codes
references
qualifiers

Gene Ontology (GO), the de facto standard in gene functionality description, is used widely in functional annotation and enrichment analysis. Here, we introduce agriGO, an integrated web-based GO analysis toolkit for the agricultural community, using the advantages of our previous GO enrichment tool (EasyGO), to meet analysis demands from new technologies and research objectives. EasyGO is valuable for its proficiency, and has proved useful in uncovering biological knowledge in massive data sets from high-throughput experiments. For agriGO, the system architecture and website interface were redesigned to improve performance and accessibility. The supported organisms and gene identifiers were substantially expanded (including 38 agricultural species composed of 274 data types). The requirement on user input is more flexible, in that user-defined reference and annotation are accepted. Moreover, a new analysis approach using Gene Set Enrichment Analysis strategy and customizable features is provided. Four tools, SEA (Singular enrichment analysis), PAGE (Parametric Analysis of Gene set Enrichment), BLAST4ID (Transfer IDs by BLAST) and SEACOMPARE (Cross comparison of SEA), are integrated as a toolkit to meet different demands. We also provide a cross-comparison service so that different data sets can be compared and explored in a visualized way. Lastly, agriGO functions as a GO data repository with search and download functions; agriGO is publicly accessible at http://bioinfo.cau.edu.cn/agriGO/.

ontology or annotation browser
ontology or annotation visualization
database or data warehouse
statistical analysis
term enrichment
text mining

Tool listing last updated 02 June 2010

Bioconductor

Platform

Developer

BioConductor.

Contact

Bioconductor webmaster

License

Free for academic use

GO data updates

every three months (or more frequently)

GO data used

terms
definitions and comments
synonyms
cross-references
relationships
subsets or GO slims
gene product data
taxon
evidence codes
references
qualifiers

Bioconductor provides tools for the analysis and comprehension of high-throughput (microarray, sequence, flow, etc.) genomic data. Bioconductor uses the R statistical programming language, and is open source and open development. There are more than core and user-contributed 400 packages. Bioconductor packages the GO ontology into our semi-annual release, with software tools to: query; join with diverse additional gene, microarray, and sequence annotations; incorporate GO into annotation, differential expression, and gene set enrichment work flows; and visualize.

ontology or annotation browser
ontology or annotation search engine
ontology or annotation visualization
database or data warehouse
software library
statistical analysis
term enrichment
text mining
other analysis
Flexible integration of GO into statistical analysis and comprehension of high-throughput genetic data.

Tool listing last updated 04 February 2011

FunSimMat

Platform

Developer

Department of Computational Biology and Applied Algorithmics, Max Planck Institute for Informatics, Saarbrucken, Germany.

Contact

FunSimMat Team

Publications

PMID:17932054

PMID:19923227

License

Free for academic use

GO data updates

every three months (or more frequently)

GO data used

terms
definitions and comments
synonyms
cross-references
relationships
subsets or GO slims
gene product data
taxon
evidence codes
references
qualifiers

FunSimMat is a comprehensive resource of semantic and functional similarity values. It allows ranking disease candidate proteins and searching for functional similarity values for proteins (extracted from UniProt), and protein families (Pfam, SMART). FunSimMat provides several different semantic and functional similarity measures for each protein pair using the Gene Ontology annotation from UniProtKB.

database or data warehouse
functional similarity
semantic similarity
disease gene candidate prioritization

Tool listing last updated 03 June 2010

Gene Ontology For Functional Analysis (GOFFA)

Platform

Developer

National Center for Toxicological Research (NCTR), Food and Drug Administration, Jefferson, Arkansas, USA.

Contact

Don Ding

Publications

PMID:17118145

License

Free for academic use

GO data updates

monthly (or more frequently)

GO data used

terms
definitions and comments
synonyms
cross-references
relationships
subsets or GO slims
gene product data
taxon
evidence codes
references
qualifiers

Gene Ontology For Functional Analysis (GOFFA) is a tool developed for ArrayTrack that takes a list of genes and identifies terms in Gene Ontology associated with those genes. GOFFA provides tools to view/access the following:
* GO term hierarchy
* Full listing of GO terms annotated with the genes associated with a given term
* Fisher's exact test p-value providing the probability of identifying that many genes for a given term by chance alone
* Relative enrichment factor (E-value) giving the enrichment of a GO term for genes in the submitted list relative to the frequency of genes assigned to that term from the full set of GOFFA annotated genes for a particular species

ontology or annotation browser
ontology or annotation search engine
ontology or annotation visualization
database or data warehouse
term enrichment
text mining

Tool listing last updated 26 September 2011

GeneMANIA

Platform

Developer

GeneMania team: Bader and Morris labs, University of Toronto, Toronto, Canada.

Contact

GeneMANIA team

Publications

PMID:20576703

PMID:20926419

License

Free for academic use

GO data updates

every three months (or more frequently)

GO data used

terms
definitions and comments
synonyms
cross-references
relationships
subsets or GO slims
gene product data
taxon
evidence codes
references
qualifiers

GeneMANIA helps you predict the function of your favourite genes and gene sets.
GeneMANIA finds other genes that are related to a set of input genes, using a very large set of functional association data. Association data include protein and genetic interactions, pathways, co-expression, co-localization and protein domain similarity. You can use GeneMANIA to find new members of a pathway or complex, find additional genes you may have missed in your screen or find new genes with a specific function, such as protein kinases. Your question is defined by the set of genes you input.
If members of your gene list make up a protein complex, GeneMANIA will return more potential members of the protein complex. If you enter a gene list, GeneMANIA will return connections between your genes, within the selected datasets.
GeneMANIA also performs Gene Ontology term enrichment of the query list along with the returned gene list.
GeneMANIA is also accessible via a Cytoscape plugin, designed for power users.
GeneMANIA is actively developed at the University of Toronto, in the Donnelly Centre for Cellular and Biomolecular Research, in the labs of Gary Bader and Quaid Morris, with input from an independent scientific advisory board. GeneMANIA development was funded by Genome Canada, through the Ontario Genomics Institute (2007-OGI-TD-05).

database or data warehouse
software library
statistical analysis
term enrichment
other analysis
an interaction browser and GeneMANIA suggests annotations for genes based on Gene Ontology term enrichment of highly interacting genes with the gene of interest., GeneMANIA is also a gene recommendation system

Tool listing last updated 13 January 2011

GeneTools

Platform

Developer

Norwegian University of Science and Technology, Trondheim, Norway.

Contact

Vidar Beisvag

Publications

PMID:17062145

License

Free for academic use

GO data updates

no fixed update schedule

GO data used

terms
definitions and comments
synonyms
cross-references
relationships
subsets or GO slims
gene product data
taxon
evidence codes
references
qualifiers

GeneTools is a collection of web-based tools that brings together information from a broad range of resources, and provides this in a manner particularly useful for genome-wide analyses. Today, the two main tools connected to this database are the NMC Annotation Database V2.0 and eGOn V2.0 (explore Gene Ontology).

The NMC Annotation Database V2.0 provides information from UniGene, EntrezGene, SwissProt and Gene Ontology (GO). Major features are:

Single search/Batch search, extraction of data for single or batches of genes.
Manage reporter lists: in folders and share selected lists with other users.
Manual GO Annotation: add your own Gene Ontology (GO) annotations to genes of interest.
Export: to Excel, text or XML format.

eGOn V2.0 facilitates interpretation of GO annotation. GO terms are retrieved in batch modus from EntrezGene and the GO database and displayed in the GO directed acyclic hierarchical graph (DAG). Essential features of eGOn V2.0 are:

Visualization: gene annotations are visualized in the GO DAG or as a table view. The granularity of the GO DAG can be edited freely by the user.
Filtering: GO annotations can be filtered on evidence codes.
Include user defined GO annotations: previously added to the Annotation database.
Statistical analysis: Several gene lists are analyzed simultaneously to compare the distribution of the annotated genes over the GO hierarchy.

Statistical tests are implemented to allow the user to compute GO annotation dissimilarity within or between gene lists.

Connection to Annotation database: Links to Annotation database gene and protein information are offered directly from the GO DAG or in exported data.
Export: GO DAG information, statistical results and gene and protein information can be exported in excel, text or XML format.

ontology or annotation browser
ontology or annotation search engine
ontology or annotation visualization
database or data warehouse
statistical analysis
term enrichment

Tool listing last updated 18 January 2011

GO Online SQL Environment (GOOSE)

Platform

Developer

GO Consortium.

Contact

GO Helpdesk

Publications

PMID:19033274

License

Free for academic use

GO data updates

daily

GO data used

terms
definitions and comments
synonyms
cross-references
relationships
subsets or GO slims
gene product data
taxon
evidence codes
references
qualifiers

The GO Online SQL Environment (GOOSE) provides a direct interface to the GO database, allowing users to run custom queries without having to install a copy of the GO database locally.

database or data warehouse

Tool listing last updated 10 October 2010

go-db-perl

Platform
Developer: GO Consortium.
License: Free for academic use
GO data updates: not applicable (e.g. because users provide data)

go-db-perl extends the functionality of go-perl (on which it depends) with GO Database access functionality.
go-db-perl comes bundled with various scripts and a shell command line interface that can be used as standalone tools.
Installation is more involved than for go-perl; you will need a MySQL database plus the requisite DBI and DBD Perl modules. Full installation instructions are included in the download.
go-db-perl is in use both to drive AmiGO and internally within Ensembl.

database or data warehouse
software library

Tool listing last updated 10 October 2010

GOTaxExplorer

Platform

Developer

Department of Computational Biology and Applied Algorithmics, Max Planck Institute for Informatics, Saarbrucken, Germany.

Contact

GOTax Team

Publications

PMID:17346342

License

Free for academic use

GO data updates

every three months (or more frequently)

GO data used

terms
definitions and comments
synonyms
cross-references
relationships
subsets or GO slims
gene product data
taxon
evidence codes
references
qualifiers

GOTaxExplorer presents a new approach to comparative genomics that integrates functional information and families with the taxonomic classification. It integrates UniProt, Gene Ontology, NCBI Taxonomy, Pfam and SMART in one database. GOTaxExplorer provides four different query types: selection of entity sets, comparison of sets of Pfam families, semantic comparison of sets of GO terms, functional comparison of sets of gene products.
This permits to select custom sets of GO terms, families or taxonomic groups. For example, it is possible to compare arbitrarily selected organisms or groups of organisms from the taxonomic tree on the basis of the functionality of their genes. Furthermore, it enables to determine the distribution of specific molecular functions or protein families in the taxonomy. The comparison of sets of GO terms allows to assess the semantic similarity of two different GO terms. The functional comparison of gene products makes it possible to identify functionally equivalent and functionally related gene products from two organisms on the basis of GO annotations and a semantic similarity measure for GO.

ontology or annotation search engine
ontology or annotation visualization
database or data warehouse
functional similarity
semantic similarity
other analysis
comparative genomics analysis

Tool listing last updated 03 June 2010

Onto-Compare

Platform

Developer

Intelligent Systems and Bioinformatics Laboratory, Wayne State University, Detroit, Michigan, USA.

Contact

Dr. Sorin Draghici

Publications

PMID:12664686

PMID:15215428

License

Free for academic use

GO data updates

every three months (or more frequently)

GO data used

terms
definitions and comments
synonyms
cross-references
relationships
subsets or GO slims
gene product data
taxon
evidence codes
references
qualifiers

Microarrays are at the center of a revolution in biotechnology, allowing researchers to screen tens of thousands of genes simultaneously. Typically, they have been used in exploratory research to help formulate hypotheses. In most cases, this phase is followed by a more focused, hypothesis driven stage in which certain specific biological processes and pathways are thought to be involved. Since a single biological process can still involve hundreds of genes, microarrays are still the preferred approach as proven by the availability of focused arrays from several manufacturers. Since focused arrays from different manufacturers use different sets of genes, each array will represent any given regulatory pathway to a different extent. We argue that a functional analysis of the arrays available should be the most important criterion used in the array selection. We developed Onto-Compare as a database that can provide this functionality, based on the GO nomenclature.

ontology or annotation browser
ontology or annotation search engine
ontology or annotation visualization
database or data warehouse
other analysis
compare commercially available microarrays based on GO

Tool listing last updated 29 June 2010

Onto-Design

Platform

Developer

Intelligent Systems and Bioinformatics Laboratory, Wayne State University, Detroit, Michigan, USA.

Contact

Dr. Sorin Draghici

Publications

PMID:15215428

License

Free for academic use

GO data updates

every three months (or more frequently)

GO data used

terms
definitions and comments
synonyms
cross-references
relationships
subsets or GO slims
gene product data
taxon
evidence codes
references
qualifiers

Many Laboratories chose to design and print their own microarrays. At present, the choice of the genes to include on a certain microarray is a very laborious process requiring a high level of expertise. Onto-Design database is able to assist the designers of custom microarrays by providing the means to select genes based on their experiment.

ontology or annotation browser
ontology or annotation search engine
ontology or annotation visualization
database or data warehouse
other analysis
design custom microarrays based on GO terms of interest

Tool listing last updated 29 June 2010

Onto-Express (OE)

Platform

Developer

Intelligent Systems and Bioinformatics Laboratory, Wayne State University, Detroit, Michigan, USA.

Contact

Publications

License

Free for academic use

GO data updates

every three months (or more frequently)

GO data used

terms
definitions and comments
synonyms
cross-references
relationships
subsets or GO slims
gene product data
taxon
evidence codes
references
qualifiers

The typical result of a microarray experiment is a list of tens or hundreds of genes found to be differentially regulated in the condition under study. Independently of the methods used to select these genes, the common task faced by any researcher is to translate these lists of genes into a better understanding of the biological phenomena involved. Currently, this is done through a tedious combination of searches through the literature and a number of public databases. We developed Onto-Express (OE) as a novel tool able to automatically translate such lists of differentially regulated genes into functional profiles characterizing the impact of the condition studied. OE constructs functional profiles (using Gene Ontology terms) for the following categories: biochemical function, biological process, cellular role, cellular component, molecular function and chromosome location. Statistical significance values are calculated for each category. We demonstrated the validity and the utility of this comprehensive global analysis of gene function by analyzing two breast cancer data sets from two separate laboratories. OE was able to identify correctly all biological processes postulated by the original authors, as well as discover novel relevant mechanisms (Draghici et.al, Genomics, 81(2), 2003). Other results obtained with Onto-Express can be found in Khatri et.al., Genomics. 79(2), 2002.

ontology or annotation browser
ontology or annotation search engine
ontology or annotation visualization
database or data warehouse
statistical analysis
term enrichment
other analysis
custom level of abstraction of the Gene Ontology

Tool listing last updated 29 June 2010

Onto-Miner (OM)

Platform

Developer

Intelligent Systems and Bioinformatics Laboratory, Wayne State University, Detroit, Michigan, USA.

Contact

Dr. Sorin Draghici

Publications

PMID:15215428

PMID:17584796

License

Free for academic use

GO data updates

every three months (or more frequently)

GO data used

terms
definitions and comments
synonyms
cross-references
relationships
subsets or GO slims
gene product data
taxon
evidence codes
references
qualifiers

Onto-Miner (OM) provide a single and convenient interface that allow the user to interrogate our databases regarding annotations of known genes. OM will return all known information about a given list of genes. Advantages or OM include the fact it allows queries with multiple genes and allows for scripting. This is unlike GenBank which uses a single gene navigation process.

ontology or annotation search engine
database or data warehouse
other analysis
scripted search of the Onto-Tools database for gene annotations

Tool listing last updated 29 June 2010

Onto-Translate

Platform

Developer

Intelligent Systems and Bioinformatics Laboratory, Wayne State University, Detroit, Michigan, USA.

Contact

Dr. Sorin Draghici

Publications

PMID:15215428

License

Free for academic use

GO data updates

every three months (or more frequently)

GO data used

terms
definitions and comments
synonyms
cross-references
relationships
subsets or GO slims
gene product data
taxon
evidence codes
references
qualifiers

In the annotation world, the same piece of information can be stored and viewed differently across different databases. For instance, more than one Affymetrix probe ID can refer to the same GenBank sequence (accession number) and more than one nucleotide sequence from GenBank can be grouped in a single UniGene cluster. The result of Onto-Express depends on whether the input list contains Affymetrix probe IDs, GenBank accession numbers or UniGene cluster IDs. The user has to be aware of relations between the different forms of the data in order to interpret correctly the results. Even if the user is aware of the relationships and knows how to convert them, most existing tools allow conversions of individual genes. Onto-Translate is a tool that allows the user to perform easily such translations.

database or data warehouse
other analysis
Affymetrix probe IDs, etc., translate GO terms into other identifiers like GenBank accession number, Uniprot IDs

Tool listing last updated 29 June 2010

QuickGO

Platform

Developer

UniProtKB-GOA, European Bioinformatics Institute, Cambridge, UK.

Contact

Tony Sawford

Publications

PMID:19744993

PMID:20157483

License

Free for academic use

GO data updates

daily

GO data used

terms
definitions and comments
synonyms
cross-references
relationships
subsets or GO slims
gene product data
taxon
evidence codes
references
qualifiers

QuickGO is a web-based tool that allows easy browsing of the Gene Ontology (GO) and all associated electronic and manual GO annotations provided by the GO Consortium annotation groups. QuickGO offers a range of facilities including bulk downloads of GO annotation data which can be extensively filtered by a range of different parameters and GO slim set generation.

ontology or annotation browser
ontology or annotation search engine
ontology or annotation visualization
database or data warehouse
slimmer-type tool

Tool listing last updated 21 June 2010

ToppGene Suite

Platform

Developer

Cincinnati Childrens Hospital Medical Center, Cincinnati, Ohio, USA.

Contact

Anil Jegga

Publications

PMID:19465376

License

Free for academic use

GO data updates

no fixed update schedule

GO data used

terms
definitions and comments
synonyms
cross-references
relationships
subsets or GO slims
gene product data
taxon
evidence codes
references
qualifiers

ToppGene Suite is a one-stop portal for (i) gene list functional enrichment, (ii) candidate gene prioritization using either functional annotations or network analysis and (iii) identification and prioritization of novel disease candidate genes in the interactome. Functional annotation-based disease candidate gene prioritization uses a fuzzy-based similarity measure to compute the similarity between any two genes based on semantic annotations. The similarity scores from individual features are combined into an overall score using statistical meta-analysis. A P-value of each annotation of a test gene is derived by random sampling of the whole genome.

database or data warehouse
slimmer-type tool
term enrichment
protein interactions
functional similarity
other analysis
disease gene ranking based on functional annotation similarity

Tool listing last updated 21 January 2011

Db for Dummies!

Platform
Developer: Custom Microsystems.
License: Free for academic use

Db for Dummies! is a small database that imports the Generic GO Slim. It allows data to be viewed in a tree.

database or data warehouse

Tool listing submitted before 2009; tool may be unsupported or inactive.

Genes2Diseases

Platform
Developers: European Molecular Biology Laboratory.; Bork group, European Molecular Biology Laboratory.
License: Free for academic use

Genes2Diseases is a database of candidate genes for mapped inherited human diseases. The database is generated using an analysis of relations between phenotypic features and chemical objects, and from chemical objects to Gene Ontology protein function terms, based on the whole MEDLINE and RefSeq databases. Can be used to view all GO terms associated with a particular genetically inherited disease.

database or data warehouse

Tool listing submitted before 2009; tool may be unsupported or inactive.