Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Next revision
Previous revision
Last revision Both sides next revision
biogrid_projectindex [2018/09/13 00:59]
biogridadmin created
biogrid_projectindex [2018/09/13 01:36]
biogridadmin
Line 4: Line 4:
 [[downloads|Back to BioGRID Download Formats]] [[downloads|Back to BioGRID Download Formats]]
  
-==== How to Detect a BioGRID ​CHEMTAB ​file ==== +==== How to Detect a BioGRID ​PROJECTINDEX ​file ==== 
-BioGRID ​CHEMTAB ​files are denoted by the extension **.chemtab.txt** or **.chemtab.zip**+BioGRID ​PROJECTINDEX ​files are denoted by the extension **.projectindex.txt** or **.projectindex.zip**
  
 ==== Header Definitions ==== ==== Header Definitions ====
-The first line of a BioGRID ​CHEMTAB ​file is the heading line and starts with a hash (#). This line is purely for informational purposes and gives a brief description of the content contained in each column. If you are scripting the use of this file, you can simply ignore it.+The first line of a BioGRID ​PROJECTINDEX ​file is the heading line and starts with a hash (#). This line is purely for informational purposes and gives a brief description of the content contained in each column. If you are scripting the use of this file, you can simply ignore it.
  
 ==== Column Definitions ==== ==== Column Definitions ====
-The column contents of BioGRID ​CHEMTAB ​files should ​be as follows:+The column contents of BioGRID ​PROJECTINDEX ​files will always contain the following columns: 
 + 
 +  - **BioGRID ID** is the identifier in the BioGRID database that corresponds to the gene. These identifiers are best used for creating links to the BioGRID from your own websites or applications. To link to a page within our site, simply append the URL: http://​www.thebiogrid.org/​ID/​ to each ID. For example, http://​www.thebiogrid.org/​31623/​. 
 +  - **Entrez Gene ID** is the identifier from the Entrez-Gene database that corresponds to the interacting gene. If no Entrez Gene ID is available, this will be a "​-"​. 
 +  - **Systematic name** is a plain text systematic name if known for the gene. Will be a "​-"​ if no name is available. 
 +  - **Official symbol** is a common gene name/​official symbol for the gene. Will be a "​-"​ if no name is available. 
 +  - **Synonyms/​Aliases** is a "​|"​ separated list of alternate identifiers for the gene. Will be "​-"​ if no aliases are available. 
 +  - **Organism ID** is the NCBI taxonomy ID for the gene. 
 +  - **Organism Name** is the official name of the organism for the gene. 
 +  - **Interaction Count** is the number of interactions in the BioGRID for this gene. 
 +  - **PTM Count** is the number if post translational modifications in the BioGRID for this gene. 
 +  - **Chemical Interaction Count** is the number of chemical interactions in the BioGRID for this gene. 
 +  - **Source** is the source database for the curation of this gene within this project. 
 + 
 +=== Additional Column Definitions (Repeating) === 
 +In addition to above, PROJECTINDEX files can contain any number of additional columns, depending on the project being represented. Since these columns may differ significantly from one project to the next, it's best to examine the project page within BioGRID for more detailed information about the project specific columns. Any additional columns will come in the following sets of 7 columns which are repeated N times for each unique dataset: 
 +  - **<​COLUMN_NAME>​_values** - This column contains the values used on the web version for the corresponding column on our project pages. This will contain one or more values (separated by "​|"​) and are usually a type of classification or ontology reference. 
 +  - **<​COLUMN_NAME>​_ids** - If the value entered up above also has an accompanying official id, this column will contain matching "​|"​ separate ID values associated with those in the <​COLUMN_NAME>​_values column. For example, if <​COLUMN_NAME>​_values contains Gene Ontology Terms, this column would contain corresponding Gene Ontology Term IDs. If not applicable, this column will contain "​-"​ 
 +  - **<​COLUMN_NAME>​_tags** - Tags are another level of associated qualifications on an original V 
 +  - **<​COLUMN_NAME>​_evidence_values** - This is easy to read common reference values for this column in "​|"​ separated format for multiple entries. These are usually terms that classify or annotate a gene. Examples: Group Classifications,​ Ontology Terms etc. This column will contain "​-"​ if not applicable. 
 +  - **<​COLUMN_NAME>​_evidence_ids** - These are ids that correspond to the values in <​COLUMN_NAME>​_values. For example, if <​COLUMN_NAME>​_values contained the gene ontology term: nucleus, this column may contain the corresponding GO ID of GO:0005634. This column will contain "​-"​ if not applicable. 
 +  - **<​COLUMN_NAME>​_evidence_classes** -  
 +  - **<​COLUMN_NAME>​_evidence_methods** - 
  
-  - **BioGRID Chemical Interaction ID**. A unique identifier for each interaction within the BioGRID database. 
-  - **BioGRID ID** for Gene Interactor. The identifier in the BioGRID database that corresponds to the interacting gene. These identifiers are best used for creating links to the BioGRID from your own websites or applications. To link to a page within our site, simply append the URL: http://​www.thebiogrid.org/​ID/​ to each ID. For example, http://​www.thebiogrid.org/​31623/​. 
-  - **Entrez Gene ID** for Gene Interactor. The identifier from the Entrez-Gene database that corresponds to the interacting gene. If no Entrez Gene ID is available, this will be a "​-"​. 
-  - **Systematic name** for Gene Interactor. A plain text systematic name if known for the interacting gene. Will be a "​-"​ if no name is available. 
-  - **Official symbol** for Gene Interactor. A common gene name/​official symbol for the interacting gene. Will be a "​-"​ if no name is available. 
-  - **Synonyms/​Aliases** for Gene Interactor. A "​|"​ separated list of alternate identifiers for the interacting gene. Will be "​-"​ if no aliases are available. 
-  - **Organism ID** for Gene Interactor. An NCBI taxonomy ID for the interacting gene. 
-  - **Organism Name** for Gene Interactor. The official name of the organism for the interacting gene. 
-  - **Action**. Chemical action for this interaction. 
-  - **Interaction Type**. Methods such as target, enzyme, carrier, transporter,​ etc. 
-  - **First author surname** of the publication in which the interaction has been shown, optionally followed by additional indicators, e.g. Stephenson A (2005) 
-  - **Pubmed ID** of the publication in which the interaction has been shown. ​ 
-  - **BioGRID Publication ID**. This is the internal BioGRID reference id for this publication. Useful for linking to BioGRID publication pages, such as: http://​thebiogrid.org/​101382/​publication/​ 
-  - **BioGRID Chemical ID**. The internal BioGRID identifier for the interacting chemical. 
-  - **Chemical Name**. Common name for the Chemical. 
-  - **Chemical Synonyms**. A "​|"​ separated list of synonyms for this chemical. Will be "​-"​ if non-exist. 
-  - **Chemical Brands**. A "​|"​ separated list of brand names for this chemical. Will be "​-"​ if non-exist. 
-  - **Molecular Formula**. A molecular formula for this chemical, will be "​-"​ if unknown. 
-  - **Chemical Type**. Will be "​-"​ if unknown. 
-  - **Chemical ATC Codes**. A "​|"​ separated list of ATC codes for the chemical. Will be "​-"​ if unknown. 
-  - **Chemical CAS Number**. Will be "​-"​ if unknown. 
-  - **Curated By**. Name of the database interaction was curated by. 
  
 **All columns are mandatory so columns with no values are filled with "​-"​** **All columns are mandatory so columns with no values are filled with "​-"​**
  
 [[downloads|Back to BioGRID Download Formats]] [[downloads|Back to BioGRID Download Formats]]
 
biogrid_projectindex.txt · Last modified: 2018/09/13 01:47 by biogridadmin