Forkhead-associated domain containing protein 1 (FHAD1) is a protein encoded by the FHAD1 gene.
As the name suggests, it has a forkhead-associated domain and an extensive coiled coil structure. It is predicted to have a function related to DNA transcription. It is localized to the nucleus and has a nuclear localization signal.
Gene
editLocus and Size
editIn humans, the FHAD1 gene is located on chromosome 1 (1p36.21) and the genomic sequence is on the plus strand starting from 15236559 bp and ending at 15400283 bp.[1] There are 3 main genes around FHAD1, out of which 2 encode proteins with known functions. Two genes, EFHD2 and Chymotrypsin-C (CTRC) lie downstream of FHAD1 on the plus strand.[1] TMEM51 lies upstream of FHAD1.[1]
FHAD1 is 163,682 bases long and contains 43 exons.
Common Aliases
editFHAD1 has 4 aliases, Forkhead associated phosphopeptide binding domain 1, Forkhead-associated (FHA) phosphopeptide binding domain 1, FHA Domain-Containing Protein 1, and KIAA1937.[2]
mRNA
editThe mRNA transcript of FHAD1 5138 bp long. The gene has 30 isoforms based on NCBI gene data.
Protein
editThe FHAD 1 protein is 1412 aa long, weighs 16.2 kDa and has an isoelectric point of 6.52.[3] It has 3 isoforms, namely 1, 3 and 4, but only isoform 1 is supported by experimental evidence. It consists of 1 glutamic acid rich region and 1 proline rich region.
Domains and Motifs
editForehead-associated domain
editThe FHA domain extends from 18 - 84 aa in the protein. It can recognize and bind to phosphorylation sites, specifically pSer, pThr and pTyr. The exact mechanism and function of this domain still being studied, but it is found in proteins performing many different functions, mainly DNA repair and transduction.[4]
Smc region
editFHAD1 contains one Smc (Structural maintenance of chromosomes) region from 275 - 1401 aa. This region encodes Smc proteins that are involved in cell cycle control, cell division and chromosome separation.[5]
TMPIT-like protein, pfam07851
editThis region extends from 394 - 494 aa in FHAD1. The proteins encoded by the TMPIT proteins are predicted to be transmembrane proteins.[6] However, there is lack of literature to support this.
DUF342
editThis domain extends from 694 - 777 aa in FHAD1. It encodes a protein from a family of bacterial proteins with no known function.[7]
Structure
editFHAD1 contains the forkhead-associated domain that consists of beta sheets. Based on structure prediction software, the rest of the protein consists of alpha helices and random coils. Overall, FHAD1 has a coiled coil structure as shown in the figure.
Post-translational modifications
editFHAD1 is predicted to undergo multiple different types of post-translational modifications based on prediction software.
- Glycosylation: There were 101 possible glycosylation sites on FHAD1 and consisted mainly of amino acids involved in O-linked glycosyaltion.
- Phosphorylation: The protein was predicted to have a large number of phosphorylation sites, at least more than 100.
- Glycation: Multiple lysine residues of FHAD1 were predicted for glycation of their
ε amino groups. - SUMOylation: 4 SUMOylation consensus sequences and 3 interaction sites were predicted on FHAD1.
- O-GlcNAc sites: 6 sites for O-GlcNAc glycosylation were predicted on FHAD1. Research has shown that this specific type of glycosylation is most abundant in nucleocytoplasmic proteins.[8]
Subcellular localization
editFHAD1 has been predicted to be a nuclear protein with 94.1% reliability. It also contains possible nuclear localization signal sequences between 1100 - 1107 aa. Two pat4 and one pat7 sequences were predicted. Pat4 and pat7 are consensus sequences consisting of clusters of lysine or arginine residues.
Expression
editIn humans, FHAD1 is expressed in testis, fallopian tube and uterine tissues in females, nasopharynx and bronchi of lungs based on studies found on the Human Protein Atlas.[9] NCBI's EST Profile also showed that FHAD1 is highly expressed in the testis, with some expression in the trachea and esophagus. In mice, the gene was also expressed in the testis, along with the pituitary gland, lung and brain.
Regulation of expression
editFHAD1 has a promoter that extends from 15246234 – 15247380 bp and is 1147 bp long. It includes an initial part of the 5' UTR of FHAD1. Some transcription factors predicted to bind to this promoter are:
- MAX binding protein - This protein is likely a transcriptional repressor from the E-box binding factors family[10]
- TR4/TR2 - These proteins are part of a family of nuclear receptors and bind to DR1 (direct repeat) elements of promoters. They act as anchors to recruit other corepressors[11]
- Kaiso - This transcriptional regulator is encoded by the ZBTB33 gene and is involved in response to DNA damage by interacting with p53[12]
- LYL1-E12 - This transcriptional factor is a dimer of two proteins, LYL1 and E12, where E12 is an E-box binding protein. LYL1 is also involved in some leukemias and is a possible oncogenic factor.[13]
- Nur 77 - This protein is also known as NGFIB (Nerve growth factor IB) and belongs to a family of nuclear receptors. It is involved in apoptosis and cell growth pathways.[14]
In the 5' UTR and 3' UTR of FHAD1, multiple stem loops are predicted to form .
Function
editFHAD1 can be involved in transcriptional regulation through interaction with other transcriptional regulators.
Protein interactions
editFHAD1 was found to be a binding partner for GTF2IRD1 (GTF2I repeat domain containing protein 1) via a yeast 2 hybrid screen.[15] GTF2I is a gene that encodes the general transcription factor II-1. This specific study showed that GTF2IRD1 is a nuclear protein that is involved transcriptional regulation through chromatin modification. The fact that it exists in the nucleus and was found in neuronal cells correlates with the localization and functional data for FHAD1. Additionally, FHAD1 and GTF2IRD1 interacted through RD2 (repeat domain 2) of GTF2IRD1. RD2 has shown some level of DNA binding activity.
FHAD1 was found to interact (colocalization) with 14-3-3 protein epsilon via cosedimentation. This protein binds to a number of binding partners, mostly by recognizing phosphothreonine or phosphoserine motifs.[16]
Clinical Significance
editFHAD1 showed differential expression in patients diagnosed with endometriosis and obesity.[17]
Homology and Evolution
editFHAD1 has no known paralogs. It has orthologs in the organisms in the following classes: Mammalia, Reptilia, Aves, Sarcopterygii, Actinopterygii, Gastropoda and Lingulata. There was significant conservation in the FHA domain in all the organisms in the table below.
The rate of evolution of FHAD1 was compared with that of fibrinogen and cytochrome c and it showed that FHAD1 is a rapidly evolving gene.
Common Name | Time of divergence (mya) | Sequence identity |
---|---|---|
Black fruit bat | 96 | 78% |
Goat | 96 | 75% |
Killer whale | 96 | 75% |
Giant Panda | 96 | 75% |
Sea otter | 96 | 72% |
Guinea Pig | 90 | 71% |
Great roundleaf bat | 96 | 71% |
European hedgehog | 96 | 67% |
Mongolian gerbil | 90 | 63% |
Green sea turtle | 312 | 43% |
Chinese soft-shell turtle | 312 | 40% |
Emperor penguin | 312 | 39% |
American alligator | 312 | 38% |
Brown mesite | 312 | 38% |
Pigeon | 312 | 36% |
West Indian Ocean coelacanth | 413 | 33% |
Atlantic salmon | 435 | 29% |
Red bellied piranha | 435 | 29% |
California sea hare/slug | 797 | 28% |
References
edit- ^ a b c "FHAD1 forkhead associated phosphopeptide binding domain 1 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2018-04-30.
- ^ Database, GeneCards Human Gene. "FHAD1 Gene - GeneCards | FHAD1 Protein | FHAD1 Antibody". www.genecards.org. Retrieved 2018-04-30.
- ^ "ExPASy - Compute pI/Mw tool". web.expasy.org. Retrieved 2018-05-06.
- ^ Weng JH, Hsieh YC, Huang CC, Wei TY, Lim LH, Chen YH, Ho MR, Wang I, Huang KF, Chen CJ, Tsai MD (October 2015). "Uncovering the Mechanism of Forkhead-Associated Domain-Mediated TIFA Oligomerization That Plays a Central Role in Immune Responses". Biochemistry. 54 (40): 6219–29. doi:10.1021/acs.biochem.5b00500. ISSN 0006-2960. PMID 26389808.
- ^ group, NIH/NLM/NCBI/IEB/CDD. "NCBI CDD Conserved Protein Domain Smc". www.ncbi.nlm.nih.gov. Retrieved 2018-05-06.
{{cite web}}
:|last=
has generic name (help) - ^ group, NIH/NLM/NCBI/IEB/CDD. "NCBI CDD Conserved Protein Domain TMPIT". www.ncbi.nlm.nih.gov. Retrieved 2018-05-06.
{{cite web}}
:|last=
has generic name (help) - ^ group, NIH/NLM/NCBI/IEB/CDD. "NCBI CDD Conserved Protein Domain DUF342". www.ncbi.nlm.nih.gov. Retrieved 2018-05-06.
{{cite web}}
:|last=
has generic name (help) - ^ Hart, Gerald W.; Akimoto, Yoshihiro (2009). Varki, Ajit; Cummings, Richard D.; Esko, Jeffrey D.; Freeze, Hudson H.; Stanley, Pamela; Bertozzi, Carolyn R.; Hart, Gerald W.; Etzler, Marilynn E. (eds.). Essentials of Glycobiology (2nd ed.). Cold Spring Harbor (NY): Cold Spring Harbor Laboratory Press. ISBN 9780879697709. PMID 20301273.
- ^ "Tissue expression of FHAD1 - Summary - The Human Protein Atlas". www.proteinatlas.org. Retrieved 2018-05-05.
- ^ "MNT MAX network transcriptional repressor [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2018-05-06.
- ^ Shi, Lihong; Sierant, M. C.; Gurdziel, Katherine; Zhu, Fan; Cui, Shuaiying; Kolodziej, Katarzyna E.; Strouboulis, John; Guan, Yuanfang; Tanabe, Osamu (2014-05-08). "Biased, Non-equivalent Gene-Proximal and -Distal Binding Motifs of Orphan Nuclear Receptor TR4 in Primary Human Erythroid Cells". PLOS Genetics. 10 (5): e1004339. doi:10.1371/journal.pgen.1004339. ISSN 1553-7404. PMC 4014424. PMID 24811540.
- ^ Koh, Dong-In; Han, Dohyun; Ryu, Hoon; Choi, Won-Il; Jeon, Bu-Nam; Kim, Min-Kyeong; Kim, Youngsoo; Kim, Jin Young; Parry, Lee (2014-10-21). "KAISO, a critical regulator of p53-mediated transcription of CDKN1A and apoptotic genes". Proceedings of the National Academy of Sciences. 111 (42): 15078–15083. Bibcode:2014PNAS..11115078K. doi:10.1073/pnas.1318780111. ISSN 0027-8424. PMC 4210320. PMID 25288747.
- ^ "WikiGenes - Collaborative Publishing". WikiGenes - Collaborative Publishing. Retrieved 2018-05-06.
- ^ Song KH, Park JI, Lee MO, Soh J, Lee K, Choi HS (December 2001). "LH induces orphan nuclear receptor Nur77 gene expression in testicular Leydig cells". Endocrinology. 142 (12): 5116–23. doi:10.1210/endo.142.12.8525. ISSN 0013-7227. PMID 11713204.
- ^ Carmona-Mora, Paulina; Widagdo, Jocelyn; Tomasetig, Florence; Canales, Cesar P.; Cha, Yeojoon; Lee, Wei; Alshawaf, Abdullah; Dottori, Mirella; Whan, Renee M. (2015-08-15). "The nuclear localization pattern and interaction partners of GTF2IRD1 demonstrate a role in chromatin regulation". Human Genetics. 134 (10): 1099–1115. doi:10.1007/s00439-015-1591-0. ISSN 0340-6717. PMID 26275350. S2CID 15601752.
- ^ "Ywhae - 14-3-3 protein epsilon - Mus musculus (Mouse) - Ywhae gene & protein". www.uniprot.org. Retrieved 2018-05-07.
- ^ geo. "GEO DataSet Browser". www.ncbi.nlm.nih.gov. Retrieved 2018-05-07.