GO Ontology Relations

The following page documents the relations used in the filtered GO ontology. For information on how relations are represented in OBO format, see the OBO File Format Guide. For relations in the full GO file, which includes inter-ontology links and extra data, see the extended GO ontology relations documentation.

The occurs in relationship

occurs in relationships are only used in the extended GO file at present.

The occurs in relation is used to specify the location in which a biological process or a molecular function takes place. Whereas part of and has part are used to link two entities (cellular components) or two occurrents (functions or processes), occurs in is used for linking an occurrent—a function or process—to a location. As with part of and has part, a relationship is only created if the process always occurs in that location, i.e. process A necessarily occurs in component B. If process A occurs, component B must exist; however, if component B exists, we cannot say for certain that process A occurs.

Many of the process terms linked by occurs in relations mention the location in the term name.

diagram to illustrate necessarily occurs

i.e. all process A occur in B; some B are the location of occurrence of A.

For example:

mitochondrial DNA replication necessarily occurs in mitochondrion

mitochondrial DNA replication necessarily occurs in mitochondrion: all mitochondrial DNA replication takes place in some mitochondria; some mitochondria are the location of mitochondrial DNA replication.

Reasoning over occurs in

occurs in relation composes with is aoccurs in

If A occurs in B, and B is a C, A occurs in C. For example:

A occurs in B is a C therefore A occurs in C
occurs in plus is a = occurs in

mitochondrial DNA replication occurs in mitochondrion, and mitochondrion is a intracellular membrane-bounded organelle
therefore mitochondrial DNA replication occurs in intracellular membrane-bounded organelle

is a relation composes with occurs inoccurs in

If the order of the relationships is reversed, the result is the same; if A is a B, and B occurs in C, A occurs in C.

A is a B occurs in C therefore A occurs in C

For example:

is a plus occurs in = occurs in

mitochondrial DNA repair is a mitochondrial DNA metabolic process and mitochondrial DNA metabolic process occurs in mitochondrion
therefore mitochondrial DNA repair occurs in mitochondrion

occurs in relation composes with part ofoccurs in

A occurs in B is a C therefore A occurs in C

For example:

occurs in plus part of = occurs in

mitochondrial DNA replication occurs in mitochondrion and mitochondrion part of cell
therefore mitochondrial DNA replication occurs in mitochondrion

part of relation composes with occurs inoccurs in

A is a B occurs in C therefore A occurs in C

For example:

part of plus occurs in = occrus in

transcription elongation from mitochondrial promoter part of termination of mitochondrial transcription and termination of mitochondrial transcription occurs in mitochondrion
therefore transcription elongation from mitochondrial promoter occurs in mitochondrion

occurs in relation composes with has part → no inference

A occurs in B is a C therefore A occurs in C

has part relation composes with occurs inpartially occurs in

A is a B occurs in C therefore A occurs in C

For example:

is a plus has part = has part

has part and occurs in
therefore partially occurs in

has part AND occurs in

A is a B occurs in C therefore A occurs in C

This situation is where the term B has both a has part child and an occurs in child. Process A is a necessary subprocess of B, so we can infer that because B occurs in C, A must occur in C.

occurs in relation composes with regulatesINVALID

The subject of a regulates relationship—the activity that does the regulating—must be an occurrent, not an entity; therefore the object of an occurs in relationship, which must always be an entity, cannot be the subject of a regulates relationship. The same applies for positively regulates and negatively regulates.

A regulates B is a C therefore A occurs in C

regulates relation composes with occurs in???

We cannot make an inference

A is a B regulates C therefore A occurs in C

Back to top

Gene Product Relations

In addition to specifying relations between terms, we can also name the relationships between gene products and terms. Informally, we say "gene product X is annotated to term Y", but it can be stated more precisely using the relations localizes to and has function in.

Note that these relations are only used to describe the connection between a gene product and an ontology term.

Back to top

The localizes to relation

localizes to describes the relationship of a gene product to a cellular component; for example: gene product X localizes to cell membrane; gene product Y localizes to phosphopyruvate hydratase complex

localizes to is only used for terms in the cellular component ontology.

Reasoning over localizes to

Just as we can infer relationships between terms, we can also combine the relationship between a gene product and a term with those of the term to its ancestors. As with inference of relations between terms, we proceed iteratively through the ontology, following the direction of the relationships.

localizes to relation composes with is alocalizes to

localizes to is transitive over is a, so we can infer that any gene product that localizes to a certain cellular component also localizes to all is a ancestors of the term. For example, knowing a gene product that localizes to a mitochondrion, which is an organelle, allows us to infer that it localizes to an organelle.

localizes to is transitive over is a

localizes to relation composes with part oflocalizes to

localizes to is also transitive over part of; a gene product that localizes to a cellular component also localizes to all is a ancestors of the term. In the example below, mitochondrion is part of the cytoplasm, so the gene product localizes to the cytoplasm.

localizes to is transitive over part of

localizes to relation composes with localizes toINVALID

Note that because the subject of the localizes to relationship (i.e. the X in the statement ‘X localizes to Y’) must be a gene product and the object (Y) must be a cellular component term, the combination of two localizes to relationships would not occur with the set of relationships currently used in the Gene Ontology. Similarly, localizes to would not be combined with has function in relationships would not occur with the set of relationships currently used in the Gene Ontology.

localizes to relation composes with regulatesINVALID

Cellular component terms represent locations, and as such cannot have a regulates relationship with other terms.

Back to top

The has function in relation

A gene product has function in a biological process or a molecular function. For example:

  • xxx has function in homeostasis
  • xxx has function in regulation of cell size
  • xxx has function in UDP-glucose 6-dehydrogenase activity

has function in is only used for terms in the biological process and molecular function ontologies.

Reasoning over has function in

has function in relation composes with is ahas function in

has function in is transitive over is a, and a gene product that has function in a biological process or molecular function has function in all the is a ancestors of that process or function. In the example, mitotic telophase is a cell cycle phase, so we can infer that a gene product annotated to mitotic telophase has function in a cell cycle phase.

has function in is transitive over is a

has function in relation composes with part ofhas function in

has function in is also transitive over part of; mitotic telophase is part of mitosis, and a gene product annotated to mitotic telophase has function in mitosis.

has function in is transitive over part of

has function in relation composes with regulatesregulator of

Combining a has function in relation with regulates results in a regulator of relationship; for example, a gene product that has function in regulation of mitosis, which regulates mitosis would be a regulator of mitosis.

has function in plus regulates results in regulator of

Similarly, a gene product that has function in a process that positively regulates or negatively regulates a second process is a positive regulator of or negative regulator of the second process.

has function in plus positively or negatively regulates equivalent to positive or negative regulator of

Back to top

Annotation Relationship Summary

localizes to

  • localizes to relation composes with is ainferred localizes to
  • localizes to relation composes with part ofinferred localizes to

has function in

  • has function in relation composes with is ainferred has function in
  • has function in relation composes with part ofinferred has function in
  • has function in relation composes with regulatesinferred regulator of
  • has function in relation composes with positively regulatesinferred positive regulator of
  • has function in relation composes with negatively regulatesinferred negative regulator of

Back to top

Annotations and other relationships

Problems with inferring over localizes to

The relationship between gene products and classes in the cellular component ontology is localizes to. Its usage is such that it means some gene product X localize to some cellular component.

full ontology

Set up: we have the above ontology, and we want to annotate two gene products; one is a neuronal GP, and the other is a GP from the hippocampus. Both sharks and humans have neurons in their brains, but Wikipedia reliably informs me that only mammals have a hippocampus, so the poor old sharks are bereft of the second gene product.

This would be the ontology structure with inferred relations if we were to use the standard GO practice of creating specific child terms.

Annotating the neuronal GP

relations for a neuronal GP

Annotating the hippocampus GP

relations for a hippocampus GP

In the spirit of global cutbacks, we want to get rid of those extra terms, so we make has part links and do away with the old terms.

Let's see what we can infer from the graph now. Here's the neuronal GP:

relations for a neuronal GP

So far so good... now let's check out the hippocampus GP

relations for a hippocampus GP

Oh crap! Now what do we do?! It turns out that we probably shouldn't have inferred that localized to was transitive over has part. We do still want to be able to capture the fact that the hippocampus GP localizes to humans, though. How can we do that? Well, how about a relation that states that in a certain species—humans, in this case—a certain class always has certain annotatable entity in it? Or, to say that in English, why not say that the neuronal GP is integral to brains, i.e. an all-some localization relation from brain to the neuronal GP? Let's give it a go:

relations for a neuronal GP

Not bad. Let's check out our finished graph:

The graph you've all been waiting for!

As you can see, this method of representation loses us the hippocampus-brain connection, but if sharks don't need 'em, nor do we!

localizes to

The relationship between gene products and classes in the cellular component ontology is localizes to. Its usage is such that it means some gene product X localize to some cellular component.

is a

Given this statement:

gene product X localizes to human brain is a brain

what do we know?

  • some instances of X localize to some human brains
  • all human brains are a type of brain
  • therefore some Xs localize to some brains
  • some human brains, and therefore some brains, have Xs localized in them
  • the existence of human brains or of brains does not imply the existence of X
  • the existence of X does not imply the existence of human brains or brains

Inferences:

part of

gene product X localizes to human brain part of human

  • all human brains are part of some human, so the existence of human brains implies the existence of humans
  • some Xs localize to some human brains, so some Xs localize to some humans
  • some human brains, and therefore some humans, have Xs localized in them
  • the existence of human brains or of humans does not imply the existence of X
  • the existence of X does not imply the existence of human brains or humans

Inferences:

has part

gene product X localizes to human has part brain

  • all humans have some brains as a part [theoretically, anyway]
  • some brains are part of humans, but in an open world (such as that in which we do GO), we cannot infer the existence of humans from the brains as the brains may be part of other entities (e.g. sharks)
  • the existence of X does not imply the existence of humans or of brains
  • however, an instance of X localized to humans implies that humans exist, and therefore brains exist
  • we cannot infer where the gene product localizes to from the current graph

gene product X localizes to brain
human has part brain

  • all humans have some brains as a part; if humans exist, brains exist
  • the existence of brains does not imply the existence of humans
  • the existence of X does not imply the existence of humans or of brains
  • an instance of the localization of X to brains means that brains exist, but we cannot be sure that humans exist

localized in (integral to)

More powerful reasoning can be accomplished by stating that gene product X is integral to cellular component Y, i.e. all cellular component Y have some X localized in them. Now our annotations make the following statements:

  • SOME gene product X localizes to cellular component Y
  • ALL cellular component Y have SOME gene product X localized in them

is a

  • some instances of X localize to some human brain
  • all human brains are a type of brain
  • therefore some Xs localize to some brains
  • all human brains have X localized in them
  • some brains have X localized in them
  • the existence of human brains implies the existence of X
  • the existence of brains does not imply the existence of X (might not be any human brains)
  • the existence of X does not imply the existence of human brains or brains

part of

  • all human brains are part of some human, so the existence of human brains implies the existence of humans
  • some Xs localize to some human brains, so some Xs localize to some humans
  • all human brains have X localized to them, therefore so do all humans
  • the existence of human brains implies the existence of X
  • the existence of humans implies the existence of X
  • the existence of X does not imply the existence of human brains or humans

has part

  • all humans have some brains as a part
  • all humans have some X localized in them
  • the existence of humans implies the existence of brains and of X
  • the existence of X does not imply the existence of humans
  • the existence of brains does not imply the existence of humans
  • an instance of X localized in a human implies the existence of humans and brains
  • all humans have some brains as a part
  • all brains have some X localized in them; the existence of brains implies the existence of X
  • therefore all humans have X localized in them; the existence of humans implies the existence of brains and of X
  • the existence of X does not imply the existence of humans or of brains

Back to top