MIPOL1
MIPOL1 | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | MIPOL1, CCDC193, mirror-image polydactyly 1 | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | OMIM: 606850; MGI: 1920740; HomoloGene: 16340; GeneCards: MIPOL1; OMA:MIPOL1 - orthologs | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
MIPOL1 (Mirror Image Polydactyly 1), also known as CCDC193 (Coiled-coil domain containing 193), is a protein that in humans is encoded by the MIPOL1 gene.[5][6] Mutation of this gene is associated with mirror-image polydactyly (also known as Laurin-Sandrow syndrome.[7]) in humans, which is a rare genetic condition characterized by mirror-image duplication of digits.[8]
Gene
MIPOL1 is also known as CCDC193 (Coiled-coil domain containing 193).
Locus
The MIPOL1 gene is located at 14q13.3-q21.1 on the plus strand, spanning base pairs 37,197,888 to 37,579,207 (in the human GRCh38 primary assembly, length: 381,320 base pairs), consisting of 15 exons and 11 introns. Some notable genes in its neighborhood include SLC25A21 (mutation of this gene causes synpolydactyly[10]) and FOXA1.
mRNA
MIPOL1 has at least 15 known splice isoforms produced by alternative splicing.[11]
Protein
Properties
The unmodified MIPOL1 protein isoform 1 in humans has an isoelectric point of 5.6 and molecular weight 51.5 kDa.[12] Relative to other human proteins, MIPOL1 consists of unusually low amounts of Proline and Glycine and higher amounts of Glutamic acid and Glutamine.[13]
Isoforms
There are at least three known isoforms of this protein in humans produced by alternative splicing: isoform 1, of length 442 amino acids, isoform 2 of length 261 amino acids and isoform 3 of length 169 amino acids.[5]
Domains and motifs
MIPOL1 contains two coiled-coil domains in its C-terminus at positions 107 – 212 and 253 – 435[5] (shown in Fig.1). A bipartite nuclear localization signal is predicted at position 128 – 143.[16]
Post-translational modifications
The following post-translational modifications are predicted using bioinformatics tools for MIPOL1.[17] Multiple phosphorylation sites are predicted for this protein, that are conserved in close orthologs, including a Casein kinase 1 (CK1) site, three Casein kinase 2 (CK2) sites, and three NEK2 sites.[18]
Post-translational modification | Amino acid site | Prediction tool |
---|---|---|
Phosphorylation | Ser (37, 42, 69, 75, 105, 126, 205, 275, 344, 350, 364, 412), Thr (80, 251, 259, 338, 365, 396, 435, 437, 440), Tyr (77, 83) | MyHits,[16] NetPhos[19] |
O-linked glycosylation | Ser (34, 105, 294, 344, 412), Thr (155, 293) | NetOGlyc[20] |
O-GlcNAcylation | Thr (104) | YinOYang[21] |
Glycation | Lys (6, 41, 133, 207, 347, 421) | NetGlycate[22] |
SUMOylation | Lys (136, 147) | GPS-SUMO[23] |
Ubiquitination | Lys (22, 47, 133, 162, 314, 418) | BDM-PUB[24] |
Structure
The exact structure of the MIPOL1 has not yet been characterized. Homology-based and de novo predictions of its tertiary structure suggest that it may consist of inter-twined alpha helices, forming coiled-coil domains (see Fig.4.).[26][25]
Sub-cellular localization
Immunofluorescence imaging in the human U2OS cell line (bone Osteosarcoma epithelial cells) shows localization in the cytosol.[27] Immunohistochemistry imaging of human prostate tissue also suggests cytosolic localization.[28] A bipartite nuclear localization signal is predicted at position 128 – 143, which is highly conserved in mammalian orthologs (see Fig.2.), indicating possible localization in the nucleus.[16]
Gene regulation
The predicted promoter sequence for this gene spans from base pair 37196852 to 37198126 (1,275 bp) and has multiple predicted binding sites for transcription factors such as GATA binding factors, SMAD3, TP63 and NRF1.[29]
Gene Expression
MIPOL1 is ubiquitously expressed at low levels in humans, with highest expression in the prostate.[5]
Transcript regulation
The RNA secondary structure is stabilized by multiple stem loops that have been predicted (using bioinformatics tools[30]), and conserved across closely related species. Multiple binding targets are found for microRNAs such as MIR3163 and MIR190a, that could silence these regions on the mRNA and inhibit translation.[31]
Clinical significance
The MIPOL1 gene is an autosomal dominant gene.[32] It is one of six genes in humans causing non-syndromic polydactyly (i.e. polydactyly occurring as a separate event with no other associated anomalies).[33] Mutation of this gene is associated with mirror-image polydactyly (also known as Laurin-Sandrow syndrome[32]) in humans, which is a rare genetic condition characterized by mirror-image duplication of digits in hands and feet.[8]
This gene has also been associated with central nervous system development, and the loss of this gene can cause craniofacial defects and agenesis of the corpus callosum.[34]
The gene is shown to function as a tumor suppressor in nasopharyngeal carcinoma (NPC), through the up-regulation of the p21 (WAF1/CIP1) and p27 (proteins that are both cyclin-dependent kinases that are linked with tumor suppression via cell cycle arrest) pathways.[35] Another study investigating the role of MIPOL1 gene in cancer progression reported that MIPOL1 was downregulated in NPC tumor tissues, and that artificially re-expressing the gene caused tumor suppression by down-regulating angiogenic factors and reducing the phosphorylation of metastasis associated proteins like AKT, p65 and FAK14.[36] MIPOL1 interacts with another well-known tumor-suppressing gene, RhoB and this interaction was confirmed to enhance RhoB activity.
In a study of pediatric high grade glioma (pHGG), MIPOL1 gene was found to be down-regulated 2.4-fold in the high vascularity tumors[37]
The protein is known to interact with Replicase polyprotein 1ab in SARS-CoV2, which is a protein involved in the transcription and replication of viral RNAs.[38]
Interacting proteins
This protein is known to interact with multiple human proteins, verified via two-hybrid screening. A few notable examples include:
LATS2: Negatively regulates YAP1 in the Hippo signaling pathway that plays a pivotal role in organ size control and tumor suppression by restricting cell proliferation and promoting apoptosis.[39]
ZGPAT (Zinc finger CCCH-type with G patch domain-containing protein): A transcription repressor that negatively regulates expression of EGFR, a gene involved in cell proliferation, survival and migration, suggesting that it may act as a tumor suppressor.[40]
RCOR3 (REST Corepressor 3): A protein that may act as a component of a co-repressor complex that represses transcription[41]
It also interacts with viral proteins such as:
Replicase polyprotein 1ab (SARS-CoV2): A multifunctional protein involved in the transcription and replication of viral RNAs.[38]
Protein E7 (Human Papillomavirus): Plays a role in viral genome replication by driving entry of quiescent cells into the cell cycle.[42]
Origin and evolution
The earliest known ortholog of this protein appeared around 948 million years ago in Trichoplax adhaerens in phylum Placozoa in kingdom Animalia. The next most distant orthologs appear in phylum Cnidaria, around 824 million years ago.
Sequence Homology
The MIPOL1 protein has no known paralogs in humans and other species for which orthologs have been found, therefore, it is the only member of its gene family.
There are more than 300 known orthologs of the MIPOL1 protein in Animalia, ranging from primates to corals and sea anemones in phylum Cnidaria.[43] Orthologs of the protein were found in species as distant as Trichoplax adhaerens, a simple primitive invertebrate species. Table 2 shows a sample of the ortholog space.
Closely related orthologs are found in chordates such as mammals, reptiles, birds and amphibians, with sequence similarities greater than 70%. Sequence lengths of orthologs were similar to the human MIPOL1 protein, with no significant gene duplication observed.
Organisms with sequence similarities in the 55-70% range (moderately related orthologs) were found in bony fish, cartilaginous fish and coelacanths. Sequence length is generally longer in these species, with a longer amino acid sequence in the N-terminus (alignment with human protein occurs around amino acid 100).
Distantly related orthologs with similarities less than 50% (around 30 – 40%) are found in hemichordates, echinoderms, arthropods, molluscs, cnidaria and placozoa. Multiple sequence alignment with distant orthologs indicates poor alignment in the N-terminus of the protein.
Two COG (Clusters of Orthologous Groups of proteins) domains were found in this protein (see Fig.3): COG1196 at position 106 - 340 (Chromosome segregation ATPase[44]) and COG4372 at 259 - 431 (uncharacterized conserved protein containing a DUF3084 domain[45])[46]
Phylogenetics
Using a linear regression analysis on a plot of corrected percent divergence (amino acid changes per 100 amino acids) as a function of date of divergence from humans for different MIPOL1 orthologs (see Fig.5), it is estimated that a 1% change in amino acids in the MIPOL1 protein takes 5.68 million years. MIPOL1 protein is evolving at a moderate rate relative to fast evolving protein such as fibrinogen alpha, and slow evolving proteins such as cytochrome C.
References
- ^ a b c GRCh38: Ensembl release 89: ENSG00000151338 – Ensembl, May 2017
- ^ a b c GRCm38: Ensembl release 89: ENSMUSG00000047022 – Ensembl, May 2017
- ^ "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
- ^ "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
- ^ a b c d NCBI Gene Mirror-Image Polydactyly 1. Retrieved 27 July 2020.
- ^ "MIPOL1 - Mirror-image polydactyly gene 1 protein - Homo sapiens (Human) - MIPOL1 gene & protein". www.uniprot.org. Uniprot. Retrieved 2020-08-02.
- ^ OMIM Entry on Laurin-Sandrow Syndrome (Mirror-Image Polydactyly). Retrieved 27 July 2020.
- ^ a b Kondoh S, Sugawara H, Harada N, et al. A novel gene is disrupted at a 14q13 breakpoint of t(2;14) in a patient with mirror-image polydactyly of hands and feet. J Hum Genet. 2002;47(3):136-139. doi:10.1007/s1003802000 15
- ^ Hubbard T, Barker D, Birney E, et al. "The Ensembl genome database project." Nucleic Acids Res. 2002;30(1):38-41. doi:10.1093/nar/30.1.38
- ^ Meyertholen K, Ravnan JB, Matalon R. Identification of a Novel 14q13.3 Deletion Involving the SLC25A21 Gene Associated with Familial Synpolydactyly. Mol Syndromol. 2012;3(1):25-29. doi:10.1159/000339177
- ^ Ensembl Entry on MIPOL1. Retrieved 30 July 2020
- ^ ExPASy Compute pI/MW tool. Retrieved 27 July 2020.
- ^ SAPS (Statistical Analysis of Protein Sequences) - Compositional Analysis. Retrieved 27 July 2020
- ^ Prosite MyDomains tool. Retrieved 27 July 2020.
- ^ Nardozzi, J. D., Lott, K., & Cingolani, G. (2010). Phosphorylation meets nuclear import: a review. Cell communication and signaling : CCS, 8, 32. https://doi.org/10.1186/1478-811X-8-32
- ^ a b c Swiss Institute of Bioinformatics MyHits Motif Scan. Retrieved 27 July 2020.
- ^ Swiss Institute of Bioinformatics ExPASy Portal. Retrieved 27 July 2020
- ^ Eukaryotic Linear Motif resource. Retrieved 27 July 2020.
- ^ NetPhos 3.1. Retrieved 27 July 2020.
- ^ NetOGlyc. Retrieved 27 July 2020
- ^ YinOYang. Retrieved 27 July 2020
- ^ NetGlycate. Retrieved 27 July 2020
- ^ GPS-SUMO. Retrieved 27 July 2020
- ^ BDM-PUB. Retrieved 27 July 2020
- ^ a b I-TASSER Server for protein structure and function prediction. Retrieved 20 July 2020
- ^ Coiled-coil prediction. Retrieved 27 July 2020
- ^ Thermo Fisher Scientific anti-MIPOL1 Antibody produced in rabbit (PA5-65599). Retrieved 27 July 2020.
- ^ Sigma Aldrich anti-MIPOL1 polyclonal antibody produced in rabbit (HPA002893). Retrieved 27 July 2020.
- ^ ElDorado Genomatix regulatory analysis tools. Retrieved 13 July 2020.
- ^ RNA secondary structure prediction. Retrieved 13 July 2020.
- ^ miRDB microRNA database. Retrieved 2 July 2020.
- ^ a b OMIM Entry on Laurin-Sandrow Syndrome (Mirror-Image Polydactyly). Retrieved 27 July 2020
- ^ Umair M, Ahmad F, Bilal M, Ahmad W, Alfadhel M. Clinical Genetics of Polydactyly: An Updated Review. Front Genet. 2018;9:447. Published 2018 Nov 6. doi:10.3389/fgene.2018.00447
- ^ Shaffer JR, Orlova E, Lee MK, et al. Genome-Wide Association Study Reveals Multiple Loci Influencing Normal Human Facial Morphology. PLoS Genet. 2016;12(8):e1006149. Published 2016 Aug 25. doi:10.1371/journal.pgen.1006149
- ^ Cheung AK, Lung HL, Ko JM, et al. Chromosome 14 transfer and functional studies identify a candidate tumor suppressor gene, mirror image polydactyly 1, in nasopharyngeal carcinoma. Proc Natl Acad Sci U S A. 2009;106(34):14478-14483. doi:10.1073/pnas.0900198106
- ^ Leong MML, Cheung AKL, Kwok TCT, Lung ML. Functional characterization of a candidate tumor suppressor gene, Mirror Image Polydactyly 1, in nasopharyngeal carcinoma. Int J Cancer. 2020;146(10):2891-2900. doi:10.1002/ijc.32732
- ^ Smith SJ, Tilly H, Ward JH, et al. CD105 (Endoglin) exerts prognostic effects via its role in the microvascular niche of paediatric high grade glioma. Acta Neuropathol. 2012;124(1):99-110. doi:10.1007/s00401-012-0952-1
- ^ a b UniProt entry on Replicase 1ab. Retrieved 27 July 2020.
- ^ UniProt entry on LATS2. Retrieved 27 July 2020.
- ^ UniProt entry on ZGPAT. Retrieved 27 July 2020.
- ^ UniProt entry on RCOR3. Retrieved 27 July 2020.
- ^ UniProt entry on protein E7. Retrieved 27 July 2020.
- ^ NCBI entry on MIPOL1 orthologs. Retrieved 30 June 2020
- ^ NCBI Conserved Protein Domain Family entry on COG1196. Retrieved 10 June 2020.
- ^ NCBI Conserved Protein Domain Family entry on COG4372. Retrieved 10 June 2020
- ^ Tatusov, R. L., Galperin, M. Y., Natale, D. A., & Koonin, E. V. (2000). The COG database: a tool for genome-scale analysis of protein functions and evolution. Nucleic acids research, 28(1), 33–36. https://doi.org/10.1093/nar/28.1.33
- ^ Time tree: Approximate divergence between two taxa. Retrieved 30 June 2020.