Files in this directory are used for carrying out checks for violations between the implicit species specificity of GO terms and the species of origin of the annotated gene products. * report-*.txt - per database report of all taxon violations (see below) * ncbi_taxon_slim.obo - The NCBI taxonomy slim that we use in-house for trigger editing purposes. * taxon_go_triggers.obo - The list of GO terms, and the taxonomic groupings that we suggest that they should be used for. * taxon_union_terms.obo - The file of in-house taxonomic groupings that are unions of two or more NCBI taxonomy groupings. * taxon_union_materialized.obo - taxon_union_terms.obo, with is_a relationship materialized * taxon_go_imports.obo - The file that can be called by OBO-Edit to automatically load all of the other files for editing or browsing. * taxon_violations.txt - The file listing annotations that currently show inconsistencies (DEPRECATED) See http://gocwiki.geneontology.org/index.php/Taxon_Main_Page See the Makefile for the scripts used to generate the files. Report format: The columns are 1. GO ID (col 5 in original GAF). E.g. GO:0000311 2. GO Label. E.g. plastid large ribosomal subunit 3. Relationship between GO ID and Taxon ID (e.g. only_in_taxon, never_in_taxon) 4. Taxon ID (e.g. NCBITaxon:33090) 5. Taxon Label (e.g. Viridiplantae) 6. Error class (for now always INVALID) 7. Annotated Taxon ID (e.g. taxon:5833) 8. Annotated Object DB (e.g. GeneDB_Pfalciparum) 9. Annotated Object Local ID 10. PMID from GAF 11. Evidence code from GAF Cols 1-5 show the taxonomic restriction that was violated (e.g. plastid large ribosomal subunit only_in_taxon viridiplantae) Cols 6-11 give the important information from the GAF line. ---------- ## What follows below is the procedure that was outlined by Chris in October 2012, but on Oct. 25th 2012 we decided to use the perl script assign-relation-ids.pl rather than manually entering GOTAX IDs. (See http://wiki.geneontology.org/index.php/Ontology_meeting_2012-10-25#FOLLOW-UP:_Taxon_triggers) We still want to keep a copy of this file because it contains useful information; just ignore the section about requesting/using ID ranges. Prefix: dc: Prefix: rdfs: Prefix: idrange:= Prefix: allocatedto:= Prefix: iddigits:= Prefix: idprefix:= Prefix: idsfor:= Prefix: comment:= Ontology: Annotations: idprefix: "http://purl.obolibrary.org/obo/GOTAX_", iddigits: 7, idsfor: "GOTAX", dc:title "GO taxon constraints editor ID ranges and protocol documentation"^^xsd:string, dc:creator "Chris Mungall"^^xsd:string, dc:description " ## Protocol for editing GOTAX In order to add entries with a GOTAX ID you MUST have an idrange specified in this file. Request an idrange from one of the editors listed in this document. Ranges are allocated in blocks. An editor may have multiple blocks. ## Adding new constraints Every constraint must have a unique GOTAX ID. IDs must never be resued. Currently there is no automated checking of this, so it relies on editors checking themselves. ## Removing constraints A constraint can never be removed. Previously the protocol was to comment out using obo comments. For example: ``` ![Term] !id: GO:0007595 !name: lactation !relationship: only_in_taxon NCBITaxon:40674 {id=GOTAX:0000104} ! Mammalia ``` Notw the protocol is to ALSO put the uncommented version in taxon_go_triggers_obsoleted.obo. In addition, the commented stanza should remain in the main obo file as a clear visual indication to editors that the constraint is obsoleted, and to avoid GOTAX id reuse. ## Syntax Follow previous examples. Make sure you don't introduce additional characters or whitespace, or remove any. This can break parsing. ## Guidelines Every constraint MUST have a GOTAX id. Every constraint SHOULD have some piece of provenance - e.g. a PMID or indication of where the constraint comes from. These are added as trailing qualifiers, which are axiom annotations in OWL. ## Possible future changes Currently the GOTAX file is a hand-edited obo bridge file. The x-taxon.owl file is automatically derived from this obo file. In future, the source file may become OWL. ## Protocol for adding new idranges This file can be edited in Protege or a text editor "^^xsd:string AnnotationProperty: allocatedto: AnnotationProperty: idprefix: AnnotationProperty: iddigits: AnnotationProperty: idsfor: AnnotationProperty: comment: AnnotationProperty: dc:title AnnotationProperty: dc:creator AnnotationProperty: dc:description Datatype: idrange:1 Annotations: allocatedto: "GOC:ecd" EquivalentTo: xsd:integer[> 1 , <= 1000] Datatype: idrange:2 Annotations: allocatedto: "GOC:tb" EquivalentTo: xsd:integer[> 1001 , <= 2000] Datatype: idrange:3 Annotations: allocatedto: "GOC:pr" EquivalentTo: xsd:integer[> 2001 , <= 3000] Datatype: idrange:4 Annotations: allocatedto: "GOC:dph" EquivalentTo: xsd:integer[> 3001 , <= 4000] Datatype: idrange:5 Annotations: allocatedto: "GOC:jl" EquivalentTo: xsd:integer[> 4001 , <= 5000]