InterPro: IPR001413 Dopamine 1A receptor

User manual: (open in help window)

The Signatures field lists the protein signature matches. For each protein signature the Member database, the signature ID, signature name and number of proteins it matches are given. The member database names are linked to their respective home page and the signature IDs are linked to the corresponding entry information page.

User manual: (open in help window)

The PARENT/CHILD relationship is used to indicate true protein family/subfamily relationships. If an InterPro parent has more than one InterPro child a protein sequence cannot be found in the match table of more than one of these children. If one InterPro entry is described as the child of another InterPro entry, this implies that the child entry is more specific than the parent, and that in all cases a protein sequence match to the child entry implies a match to the parent. Signatures for the parent and child entries must overlap. Parent/child relationships can be displayed through the link to a Parent/Child tree.

The CONTAINS/FOUND IN relationship is used to indicate domain composition. Some domains can be found in more than one type of protein or family of proteins, but is not a SUBTYPE in the family sense. The domain is a functional entity, which can be found in proteins with different domain organisations. The CONTAINS/FOUND IN relationship does not imply that this is always the case, but suggests that a protein may contain this domain. It is useful in linking InterPro entries which are often associated, but not in a PARENT/CHILD relationship. Domain composition relations are shown within the parent child tree as a bracketed list after each entry.

User manual: (open in help window)

Functional classification of the entry is given by listing associated GO terms. The Gene Ontology project (GO) http://www.geneontology.org/ is a dynamic controlled vocabulary defined in three ontology's, molecular function, biological process and cellular component.

Molecular function is the action characteristic of a gene product.
Biological process describes a phenomenon marked by changes that lead to a particular result, mediated by one or more gene products.
Cellular component is the part of a cell of which a gene product is a component; GO includes the extracellular environment of cells; a gene product may be a component of one or more parts of a cell.

For each associated term the name of the term and GO accession number is given. The assignment of GO terms to InterPro entries was done manually by reading the abstract of the entries and annotation of proteins in the protein match table for each entry. An appropriate GO term for an entry is one, which applies to the whole protein. The GO terms associated with an InterPro entry applies to all proteins with true hits to the signatures in that entry. The assignments are incomplete and are ongoing due to the dynamic nature of the GO project. Some entries could be mapped to very low level (specific) GO terms, while entries describing wider families or common domains were mapped to higher level terms or could not be mapped at all. The GO terms and mappings can be found using the EBI QuickGo browser.

It is important to remember these mappings provide useful predictions of GO assignments to the corresponding proteins however, biological exceptions like inactivated enzymes may occur.

User manual: (open in help window)

The Abstract describes the signatures in the entry, the protein matches, the taxonomic range and provides references. Where possible a functional inference is made.

User manual: (open in help window) As above but only those proteins having a known structure in the PDB are displayed.

User manual: (open in help window)

Database links include, cross-references to:

The BLOCKS database; it contains multiple alignments of conserved regions in protein families
The IntEnz database; EC numbers, systematic and common name, synonyms, function and links to other databases: BRENDA , EXPASY , GO , KEGG , WIT
PROSITE documents; e.g. PDOC00020
Preliminary PROSITE documents; e.g. QDOC50218
The Carbohydrate-Active EnZymes database; CAZy describes families of related catalytic and carbohydrate-binding modules of enzymes that act on glycosodic bonds
The IUPHAR Receptor Database
COMe database, a bioinorganic motif database
MEROPS, a database of peptidases and peptidase inhibitors
PANDIT, a database of multiple sequence alignments and phylogenetic trees based on Pfam signatures.
MSDsite, provides a link to the PROSITE ligand statistics page of MSD.
CluSTr, a database containing protein clusters from more than 190 organisms with completely sequenced genomes. InterPro entries are linked to protein clusters, only where >= 70% of the CluSTr members occur in the InterPro entry. Please note that the clustering is continually recalculated as the protein set is revised and thus the links may change at any time.

User manual: (open in help window)

The Taxonomy Display aims to provide a, 'at a glance', view of the taxonomic range of the sequences associated with each InterPro entry and the number of sequences associated with each lineage. The numbers associated with each taxonomic lineage are 'clickable' and link to the protein overview matches for the selected taxonomy, the species being sorted and displayed alphabetically. Full taxonomic information can be retrieved from the Newt taxonomy browser for the species by clicking on the taxonomic id number next to the species name on the display. Both the protein accession number and the protein overview match are clickable and return the detailed matches view for the protein. For proteins with a known structure a link to the MSD is provided in the InterPro name column.

The lineages were carefully selected to provide a view of the major groups of organisms. The circular display has the taxonomy-tree root as its centre. The model organisms selected populate the outer most circle. Nodes of the taxonomy-tree are placed on the inner circles. Radial lines lead to the description for each node. No significance is attached to the position of the node on a particular inner-circle, other than convenience, though some attempt has been made to group nodes. The nodes themselves are either true taxonomy nodes and have a NCBI taxonomy number or are artificial nodes created for this display; of which there are three: 'Unclassified', 'Other Eukaryota (Non-Metazoa)' and the 'Plastid Group'.

Artificial Taxon: 'Unclassified' contains the following NCBI taxon groups:

Taxonomy:12884 Viroids
Taxonomy:12908 unclassified
Taxonomy:28384 other

The Eukaryota (TAXONOMY:2759) comprises 29 taxons, these have been grouped into two artificial taxons and one existing taxon:

Fungi/Metazoa (TAXONOMY:33154); Node 'Metazoa'

Artificial Taxon; Plastid Group, this contains the following NCBI taxon groups:

TAXONOMY:33090 Viridiplantae
TAXONOMY:3027 Cryptophyta
TAXONOMY:2763 Rhodophyta
TAXONOMY:2830 Haptophyceae
TAXONOMY:33630 Alveolata
TAXONOMY:33634 Stramenopiles
TAXONOMY:33682 Euglenozoa
TAXONOMY:38254 Glaucocystophyceae
TAXONOMY:136419 Cercozoa
TAXONOMY:42452 unclassified eukaryotes

Each taxonomic group within this artificial taxon contains organisms that have a plastid.

Artificial Taxon; Other Eukaryotes (Non-Metazoa), this comprises the following NCBI taxon groups:

TAXONOMY:207245 Diplomonadida
TAXONOMY:5719 Parabasalidea
TAXONOMY:5752 Heterolobosea
TAXONOMY:29177 Granuloreticulosea
TAXONOMY:33084 Entamoebidae
TAXONOMY:33677 Acanthamoebidae
TAXONOMY:37357 Plasmodiophorida
TAXONOMY:65574 Acantharea
TAXONOMY:65582 Polycystinea
TAXONOMY:66288 Oxymonadida
TAXONOMY:95225 Lobosea
TAXONOMY:98369 Paramyxea
TAXONOMY:136087 Malawimonadidae
TAXONOMY:142796 Mycetozoa
TAXONOMY:143015 Jakobidae
TAXONOMY:172820 Apusomonadidae
TAXONOMY:173726 Pelobiontida
TAXONOMY:193537 Centroheliozoa

Each taxonomic group within this artificial taxon are the remaining taxonomic groups of the NCBI taxon:2759, which are not in the Plastid Group and are not Fungi/Metazoa (TAXONOMY:33154).

Note, many UniProt proteins do not have a Database cross-reference to InterPro (DR line); therefore not all sequence records associated with an InterPro entry can be recovered using the InterPro accession number and the taxonomy group as search terms with SRS. In addition some PROSITE signatures give false positives, which could result in a misleading taxonomy display. Some protein records may have more than one taxonomy, for example where a mouse and human sequence have been merged, this will result in multiple taxonomy counts for a protein.

User manual: (open in help window)

This section displays entries that share more than 70% of their proteins. Such overlaps define Parent/Child and Contains/Found In relationships between InterPro entries.

IPR000001 Numbers of overlapping proteins Average numbers of overlapping amino acids

In the above example, InterPro entry IPR008293 contains proteins which are also found in IPR000001 as a result of the protein signatures of the two entries overlapping.

The two entries have been compared firstly by counting the number of proteins which are common to both, the results of which are displayed in the Venn diagram on the left, and secondly by calculating the average overlap of the protein signatures, in amino acids, with the results displayed in the bar diagram on the right.

Venn diagram display of the overlap of proteins common to both entries:

The purple intersection contains the number of overlapping proteins common to both IPR000001 and IPR008293, which is 10 in this case.
The pink section on the left is the number of proteins found in IPR000001 but not IPR008293, which is 236.
The blue section on the right is the number of proteins found in IPR008293 but not IPR000001, which is 0; i.e. all proteins associated with IPR008293 occur in IPR000001.

Bar diagram display of the average amino acid overlap between the protein signatures:

The average number of amino acids overlapping in the sequences of the 10 proteins common to both entries is then calculated, with the results displayed in the bar diagram on the right. The bar diagram display is only shown for 'Domain - Domain' relationships.

The purple segment in the middle shows the average number of amino acids overlapping between IPR000001 and IPR008293 for the 10 proteins, in this case 86.
The pink segment shows the average number of amino acids found in IPR000001, but not IPR008293, for the 10 proteins, which is 0.
The blue segment shows the average number of amino acids found in IPR008293, but not IPR000001, for the 10 proteins, which is 347.

The results of these comparisons are used to calculate the percentage overlap score, with all scores greater than 70% displayed on the InterPro pages. In this example, since all proteins found in IPR008293 are also found in IPR000001, and all the amino acids from IPR000001 overlap with those from IPR008293, the percentage overlap score is 100%.

User manual: (open in help window)

The protein entries in the examples, have a match status of TRUE, and illustrate as far as possible the kind of diversity in structure and function of the proteins in the InterPro entry. For each example protein the accession number, UniProt name and a compact view of the matches is given.

User manual: (open in help window)

The reference field provides a list of publications associated with each InterPro entry. The list is often derived from the reference lists of the member databases.

User manual: (open in help window)

InterPro protein matches are now calculated for all UniProt proteins, which are a combination of UniProt/Swiss-Prot, UniProt/TrEMBL and PIR proteins. For more information go to the UniProt home page.

Match lists give a number of different views of the signature matches on the sequences in each InterPro entry. Match information includes the protein sequence accession number, the accession number of the signature (PROSITE, PRINTS, Pfam, ProDom, SMART, TIGRFAMs, PIRSF, SUPERFAMILY, Gene3D and PANTHER), the position of the signature on the protein sequence and the status of the match (true, false positive, false negative or unknown).

User manual: (open in help window)

Accession numbers provide a stable way of identifying InterPro entries from release to release. When the signatures in an InterPro entry are split or merged to give new or modified entries, then the accession number of the original InterPro entry becomes the secondary accession number in the new or modified InterPro entry.

In a recent change accession numbers are now linked to methods so any accession number that has been associated with a method will become a secondary accession number in the entry in which the method currently appears. In this way it will be possible to trace movement of methods through splitting and merging of entries.

User manual: (open in help window)

Every InterPro entry has an accession number of the form IPRXXXXXX, where X is a digit. The accession number provides a stable way of identifying InterPro entries. InterPro accession numbers are stable and therefore allow unambiguous citation of database entries.

The InterPro entry Name describes the InterPro entry and should give an idea of the type of protein matches for that entry.

User manual: (open in help window)

Type defines the entry as a Family, Domain, Repeat or Site. Sites are classified into either PTM, post-translational modification; AS, active site or BS, binding site.

An InterPro family is a group of evolutionarily related proteins that share similar domain (or repeat) architecture. One or more signatures may define an InterPro Family and a single signature may not necessarily cover the whole protein. A signature may also define a group of proteins with more than one function - a superfamily. A list of the current Families in InterPro is available: Family List.

An InterPro domain is an independent structural unit, which can be found alone or in conjunction with other domains or repeats. Domains are evolutionarily related. An InterPro entry of Type=Domain is diagnostic for a domain but does not necessarily define the domain boundaries exactly. A list of the current Domains in InterPro is available: Domain List.

An InterPro repeat is a region that is not expected to fold into a globular domain on its own. For example 6-8 copies of the WD40 repeat are needed to form a single globular domain. There are also many other short repeat motifs that probably do not form a globular fold that have TYPE=Repeat. A list of the current Repeats in InterPro is available: Repeat List.

A post-translational modification modifies the primary protein structure. This modification may be necessary for activation or de-activation of function. Examples include glycosylation, phosphorylation, and sulphation, splicing etc. The process of modification may be permanent or reversible and the process may be required for functional activation or deactivation. To be recognised in InterPro the sequence signature must be described. Many of the PTM sites have low specificity and the number of proteins recognised by the sequence signatures cannot be displayed. Such signatures also group together many functionally unrelated proteins. A list of the current PTMs in InterPro is available: PTM List.

An InterPro Binding site binds chemical compounds, which themselves are not substrates for a reaction. The compound, which is bound, may be a required co-factor for a chemical reaction, be involved in electron transport or be involved in protein structure modification. The binding is reversible and the amino acids involved in the binding reaction must be described for a site to be described. A list of the current Binding Sites in InterPro is available: Binding Site List.

Active sites are best known as the catalytic pockets of enzymes where a substrate is bound and converted to a product, which is then released. Distant parts of a protein's primary structure may be involved in the formation of the catalytic pocket. Therefore, to describe an active site, different signatures will be needed to cover the active site residues. A list of the current Active Sites in InterPro is available: Active Site List.

User manual: (open in help window)

There are some cases where no matches are shown for an InterPro entry due to low specificity of the signature(s). The number of hits is excessive, and includes many false positives. In the case of some PTMs, the signatures are either general rules or weak patterns, resulting in a large number of matches. The InterPro entries affected are:

IPR000042
IPR000134
IPR000220
IPR000338
IPR000430
IPR000437
IPR000694
IPR000865
IPR001230
IPR001430
IPR001472
IPR001495
IPR001687
IPR001833
IPR001918
IPR002032
IPR002158
IPR002179

User manual: (open in help window)

Every InterPro entry has an accession number of the form IPRXXXXXX, where X is a digit. The accession number provides a stable way of identifying InterPro entries. InterPro accession numbers are stable and therefore allow unambiguous citation of database entries.

The short name is a short, concise name unique to each InterPro entry.

The number of proteins, with match status TRUE, matching one or more of the entry signatures is displayed next to the short name.

Nov	MAY	Jun
	22
2004	2005	2013

InterPro home	Text Search	Sequence Search	Databases	Documentation	FTP site	Protein of the month
Search:

Publications
1.	Attwood T.K. , Findlay J.B.C. Fingerprinting G-protein-coupled receptors. Protein Eng. 7: 195-203 (1994) [PubMed: 8170923]
2.	Birnbaumer L. G-proteins in signal transduction. Annu. Rev. Pharmacol. Toxicol. 30: 675-705 (1990) [PubMed: 2111655]
3.	Casey P.J. , Gilman A.G. G-protein involvement in receptor-effector coupling. J. Biol. Chem. 263: 2577-2580 (1988) [PubMed: 2830256]
4.	Attwood T.K. , Findlay J.B.C. Design of a discriminating fingerprint for G-protein-coupled receptors. Protein Eng. 6: 167-176 (1993) [PubMed: 8386361]
5.	Grandy D.K. , Marchionni M.A. , Makam H. , Stofko R.E. , Alfano M. , Frothingham L. , Fischer J.B. , Burke-Howie K.J. , Bunzow J.R. , Server A.C. , Civelli O. Cloning of the cDNA and gene for a human D2 dopamine receptor. Proc. Natl. Acad. Sci. U.S.A. 86: 9762-9766 (1989) [PubMed: 2532362]
6.	Watson S. , Arkinstall S. Dopamine. The G-protein Linked Receptor Factsbook. : 96-110 (1994) [PubMed: ]