BioGRID ORCS - Matrix Tab (*.matrix.tab.txt)

The BioGRID ORCS Matrix Tab file contains all of the genes and score values associated with a multiple screens in a single file. It is a tab-delimited file and the columns contained are outlined below.

Back to BioGRID ORCS Download Formats

How to detect a BioGRID ORCS Matrix File

BioGRID ORCS Screen files are denoted by the extension .matrix.tab.txt

Header Definitions

The first line of a BioGRID ORCS Matrix file is the heading line and starts with a hash (#). This line is purely for informational purposes and gives a brief description of the content contained in each column. If you are scripting the use of this file, you can simply ignore it.

Matrix Column Definitions

The column contents of BioGRID ORCS Matrix Files should be as follows:

  1. Screen ID - A unique identifier in the BioGRID ORCS database representing this screen. You can match this value with the similarly named value provided in the screen index file
  2. Identifier ID - An identifier for the gene represented by the row. This is generally an ENTREZ_GENE ID, but will sometimes contain an alternative ID when the original dataset was ambiguous or we were unable to map the name used by the original authors.
  3. Identifier Type - This will be the name of the source database, but can also be AMBIGUOUS or UNKNOWN if we were unable to definitively map the original authors dataset.
  4. Official Symbol - The official name for the gene
  5. Aliases - List of common aliases for the gene
  6. Organism ID - The NCBI Taxonomy ID of the organism used in this screen (example: 9606 for Human)
  7. Organism Official - The NCBI official name for the organism used in this screen (example: Homo sapiens for Human)
Repeated Columns

The following columns will be repeated N times, where N represents the number of screens in the matrix dataset. Each column will be prefixed with the screen id. For example 255-Score.1, 255-Score.2, 255-HIT etc.

  1. <SCREEN_ID>-Score.1 - The first score value, the type of score presented will be provided in the screen index file
  2. <SCREEN_ID>-Score.2 - The second score value (if necessary), the type of score presented will be provided in the screen index file
  3. <SCREEN_ID>-Score.3 - The third score value (if necessary), the type of score presented will be provided in the screen index file
  4. <SCREEN_ID>-Score.4 - The fourth score value (if necessary), the type of score presented will be provided in the screen index file
  5. <SCREEN_ID>-Score.5 - The fifth score value (if necessary), the type of score presented will be provided in the screen index file
  6. <SCREEN_ID>-Hit - This will be YES, NO, or N/A to indicate whether or not this gene is considered a “HIT” by the original authors.

All columns are mandatory so columns with no values are filled with “-“

 
orcs/downloads/matrix.txt · Last modified: 2018/08/29 14:14 by biogridadmin