Download IRView data

Dataset

The PSI-MITAB25 formatted file of the current dataset can be downloaded from the following link:

irview_1.0.mitab.gz

The PSI-MITAB25 format is a tab delimited format for the molecular interaction data exchange and only describes binary interactions, one pair of interactors per row. It has been derived from the tabular format provided by the BioBRID.

Column definitions

Following 1-15 columns has been defined for the PSI-MITAB 1.0 format by the BioGRID:

  1. Unique identifier for interactor A, represented as databaseName:ac, where databaseName is the name of the corresponding database as defined in the PSI-MI controlled vocabulary, and ac is the unique primary identifier of the molecule in the database. Identifiers from multiple databases can be separated by "|". It is recommended that proteins be identified by stable identifiers such as their UniProtKB or RefSeq accession number.
  2. Unique identifier for interactor B.
  3. Alternative identifier for interactor A, for example the official gene symbol as defined by a recognised nomenclature committee. Representation as databaseName:identifier. Multiple identifiers separated by "|".
  4. Alternative identifier for interactor B.
  5. Aliases for A, separated by "|". Representation as databaseName:identifier. Multiple identifiers separated by "|".
  6. Aliases for B.
  7. Interaction detection methods, taken from the corresponding PSI-MI controlled Vocabulary, and represented as darabaseName:identifier(methodName), separated by "|".
  8. First author surname(s) of the publication(s) in which this interaction has been shown, optionally followed by additional indicators, e.g. "Doe-2005-a". Separated by "|".
  9. Identifier of the publication in which this interaction has been shown. Database name taken from the PSI-MI controlled vocabulary, represented as databaseName:identifier. Multiple identifiers separated by "|".
  10. NCBI Taxonomy identifier for interactor A. Database name for NCBI taxid taken from the PSI-MI controlled vocabulary, represented as databaseName:identifier (typicaly databaseName is set to 'taxid'). Multiple identifiers separated by "|". Note: In this column, the databaseName:identifier(speciesName) notation is only there for consistency. Currently no taxonomy identifiers other than NCBI taxid are anticipated, apart from the use of -1 to indicate "in vitro", -2 to indicate "chemical synthesis", -3 indicates "unknown", -4 indicates "in vivo" and -5 indicates "in silico".
  11. NCBI Taxonomy identifier for interactor B.
  12. Interaction types, taken from the corresponding PSI-MI controlled vocabulary, and represented as dataBaseName:identifier(interactionType), separated by "|".
  13. Source databases and identifiers, taken from the corresponding PSI-MI controlled vocabulary, and represented as databaseName:identifier(sourceName). Multiple source databases can be separated by "|".
  14. Interaction identifier(s) in the corresponding source database, represented by databaseName:identifier
  15. Confidence score. Denoted as scoreType:value. There are many different types of confidence score, but so far no controlled vocabulary. Thus the only current recommendation is to use score types consistently within one source. Multiple scores separated by "|".

Following columns (16-19) have been originally added by IRView:

  1. Experimental role(s) of interactor A, taken from the corresponding PSI-MI controlled Vocabulary, and represented as darabaseName:identifier(methodName), separated by "|".
  2. Experimental role(s) of interactor B.
  3. Sufficient region(s) of A to bind to interactor B, represented as start..end in the protein sequence specified in the 1st column.
  4. Sufficient region(s) of B to bind to interactor A, represented as start..end in the protein sequence specified in the 2nd column.
  5. Source database link to access original data.