Max-Planck-Institut für Informatik
max planck institut
informatik
mpii logo Minerva of the Max Planck Society
 

User manual, GOTaxExplorer, Version 1.0

Query language

The query language provides support for four basic query types.


Result

The <result> can be composed of one or more of the following types: GO, PFAM, SMART, TAX, LTAX, GENE. Different results have to be separated with a comma ",". LTAX returns the lowest common taxon in the taxonomic tree that satisfies a given condition.

Condition

The condition is composed of boolean operators and entity definitions. The operators "AND", "OR" and "NOT" are available. Round brackets "(...)" can be used to group parts of the condition. An entity is always defined with a keyword that indicates the data source followed by a colon ":" and the accession number from the source database, e.g. "GO:GO:0000015" or "PFAM:PF02811". It is possible to use an asterisk as wildcard for all entities of a specific type, e.g. "SMART:*" for all SMART domains. The following keywords are defined:

Limit

Limits are only applicable to GO and taxonomy results. They are ignored in all other cases. The taxonomy queries can be limited to return only species by adding "SPECIES" to the <limit>. GO results can be limited to ontologies using "TT:ontology". Here, "ontology" can either be "biological_process", "molecular_function" or "cellular_component". Adding "EXTT" to the <limit> leads to the exclusion of the specified ontologies. Similar to this, it is possible to restrict the results to mappings with selected evidence codes. The codes are added with "EC:ec" where "ec" is a valid evidence code. The specified evidence codes can be excluded from the computation by adding "EXEC" to <limit>.

Group

Grouping is only applicable to GO results. The keyword is ignored in other cases. The number of GO terms satisfying the query and being a descendant of is calculated for the terms in the list after the GROUP keyword. The results are summarized in a table in the Results frame and visualized as histogram in a new window.


Sample queries

A query that selects all human proteins is


  GENE WHERE TAX:9606

A more complex query that selects all species with proteins belonging to the PHP domain (Pfam accession: PF02811) and involved in "DNA repair" (GO:0006281) reads


  TAX WHERE PFAM:PF02811 AND GO:GO:0006281 RESTRICT SPECIES

The query


  GO WHERE TAX:4751 RESTRICT TT:biological_process
    SIM GO WHERE TAX:9606 RESTRICT TT:biological_process

compares all GO biological process from fungi (Taxonomy accession: 4751) with the processes from human (Taxonomy accession: 9606). Modifying the query to compare only processes from fungi that are not in human leads to


  GO WHERE TAX:4751 AND NOT TAX:9606 RESTRICT 
  TT:biological_process SIM GO WHERE TAX:9606 
  RESTRICT TT:biological_process

Performing a functional equivalence comparison between Mycobacterium paratuberculosis and human is possible with the following query


  GENE WHERE TAX:1770 FUNSIM GENE WHERE TAX:9606