Chromosome 12 open reading frame 71
C12orf71 | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | C12orf71, chromosome 12 open reading frame 71 | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | MGI: 1920594; HomoloGene: 53485; GeneCards: C12orf71; OMA:C12orf71 - orthologs | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
Chromosome 12 open reading frame 71 (c12orf71) is a protein which in humans is encoded by c12orf71 gene. The protein is also known by the alias LOC728858.[5]
Gene
[edit]The gene is located on the minus strand of chromosome 12 (12p11.23).[6][7] The DNA sequence of the c12orf71 gene is 3071 base pairs long and 8 significant structural variations have been identified including deletions, duplications, gain- and loss-of-function mutations.[8] c12orf71 gene was determined to be altered (gain of 21 Mb) in the chromosomal region 12p11.21-p13.3 of a male patient with chromosomal aberrations and in a duplication (gain of 411 kb) at chromosome 12p11.23 along with c12orf70, the coding regions of STK38L and ARNTL2 and a portion of PPFIBP1.[9][10] Manual inspection of alignments, has determined that c12orf71 gene is mammalian specific.[11] Furthermore, genome-wide screening has identified c12orf71 as one of 1000 disrupted genes that are positively selected by cisplatin, a chemotherapy drug.[12]
RNA
[edit]c12orf71 transcript variant 1 mRNA is 1022 nucleotides long and consists of 2 exons.[6] There is one more, slightly longer transcript variant of c12orf71, with a length of 1087 nucleotides.[13] The mRNA sequence of c12orf71 transcript variant 1 consists of a coding sequence that spans over two exons and 2 poly-A signal sequences. [6]
Expression
[edit]In humans c12orf71 has shown an intermediate expression level in testis and low expression in the bone marrow, skin, spleen, lymph node and liver. Human c12orf71 is expressed after the fetal-development stage.[14] RNA-sequencing analysis has revealed that c12orf71 was expressed at a very low level or not expressed at all in osteoarthritis and non-osteoarthritis hip cartilage. [15] A genome engineering study that studied mice knock-outs has found that c12orf71 has a decreased expression in humans compared to mouse testis, however the absence of the c12orf71 had no effect on mouse fertilization.[16]
Protein
[edit]c12orf71 protein is 269 amino acids long and the unmodified precursor protein has a predicted molecular weight of 30.4 kDa and a theoretical isoelectric point of 5.21.[17] Additionally, the protein is rich in Serine and Aspartic Acid and has a relatively low amount of Valine and Tyrosine. [18]
Cellular localization
[edit]Cellular localization analysis showed that human c12orf71 protein is found in the cytoplasm of the cell. All of the orthologs of the protein were also localized to the cytoplasm.[19] Immunohistochemistry with polyclonal antibody for c12orf71 localized the protein in the cytosol of the cell.[20]
Domains
[edit]The first 21 amino acids of the coding sequence are comprising a disordered region, followed by a domain of unknown function (DUF4640) which spans almost the whole coding sequence.[21] Additionally, the human protein also contains a vacuolar domain, which is mammal specific and may be modulated by phosphorylation.[7]
Post-translation modifications
[edit]c12orf71 protein has multiple predicted phosphorylation sites,[19] which can have an impact on the protein interactions and sub-cellular localization as well as affect the protein's stability and activity. The protein has one predicted SUMOylation[22] and one ubiquitination predicted site, which can influence many biological functions of the protein, such as cellular response to stress and degradation, respectively. Five different Lysine acetylation[23] sites were predicted, which can neutralize the positive charge on the Lysine, but at the same time the transfer of acetyl group can increase the expression of the protein. 2 N-glycosylation,[19] multiple O-glycosylation[19] and O-linked-N-acetylglucosaminylation[19] sites were predicted, which could potentially affect the protein stability. There is a competition for Lysine-acetylation and ubiquitination at K130, suggesting that a deacetylase enzyme is acting at this site.
Interacting proteins
[edit]There is a direct interaction between c12orf71 and AP2B1, with a moderate confidence level. Adaptor related protein complex 2 subunit beta (AP2B1) helps establish a link between clathrin and receptors in coated vesicles.[24] c12orf71 protein has been found to be present in a protein-protein interaction (PPI) network of the Carboxypeptidase M (CPM) gene, along with nine more genes.[25]
Protein | Function |
YEATS4 | YEATS domain-containing protein 4; Component of the NuA4 histone acetyltransferase (HAT) complex which is involved in transcriptional activation of select genes principally by acetylation of nucleosomal histones H4 and H2A. |
CPM | Carboxypeptidase M; Specifically removes C-terminal basic residues (Arg or Lys) from peptides and proteins. It is believed to play important roles in the control of peptide hormone and growth factor activity at the cell surface, and in the membrane-localized degradation of extracellular proteins; Belongs to the peptidase M14 family |
ZNF215 | Zinc finger protein 215; May be involved in transcriptional regulation; SCAN domain containing |
SPACA5B | Sperm acrosome associated 5B; Enable lysozyme activity |
SPACA5 | Sperm acrosome-associated protein 5; Belongs to the glycosyl hydrolase 22 family |
LYZL4 | Lysozyme-like protein 4; May be involved in fertilization (By similarity). Has no detectable bacteriolytic and lysozyme activities in vitro (By similarity); Belongs to the glycosyl hydrolase 22 family |
LYZL6 | Lysozyme-like protein 6; May be involved sperm-egg plasma membrane adhesion and fusion during fertilization. Exhibits bacteriolytic activity in vitro against Micrococcus luteus and Staphylococcus aureus. Shows weak bacteriolytic activity against Gram-positive bacteria at physiological pH. Bacteriolytic activity is pH- dependent, with a maximum at around pH 5.6; Lysozymes, c-type |
NUP107 | Nuclear pore complex protein Nup107; Plays a role in the nuclear pore complex (NPC) assembly and/or maintenance. Required for the assembly of peripheral proteins into the NPC. May anchor NUP62 to the NPC; Belongs to the nucleoporin Nup84/Nup107 family |
SPRYD4 | Spry domain-containing protein 4; SPRY domain containing 4 |
CPSF6 | Cleavage and polyadenylation specificity factor subunit 6; Component of the cleavage factor Im complex (CFIm) that plays a key role in pre-mRNA 3'-processing. Involved in association with NUDT21/CPSF5 in pre-MRNA 3'-end poly(A) site cleavage and poly(A) addition. CPSF6 binds to cleavage and polyadenylation RNA substrates and promotes RNA looping; RNA binding motif containing |
Structure
[edit]Homology and evolution
[edit]Orthologs of the c12orf71 gene have been found only in mammals, in particular Theria (marsupials and placentals). No orthologs in monotremes, birds or reptiles, amphibians, fish, invertebrates, fungi, plants, bacteria, and viruses[26]
Species | Common name | Order | Date of divergence (MYA) | Percent identity (%) | Percent similarity (%) | Length (amino acids) | Accession number | |
---|---|---|---|---|---|---|---|---|
Primate mammals | Homo sapiens | Human | Primates | 0 | 100.0 | 100.0 | 269 | NP_001073875.1 |
Pan paniscus | Pygmy chimpanzee | Primates | 6.4 | 99.3 | 100.0 | 269 | XP_003828900.1 | |
Gorilla gorilla gorilla | Gorilla | Primates | 8.6 | 96.3 | 98.1 | 269 | XP_004052946.1 | |
Pongo abelii | Sumatran orangutan | Primates | 15.2 | 95.9 | 97.0 | 269 | XP_002823096.1 | |
Nomascus leucogenys | Gibbon | Primates | 19.6 | 93.7 | 95.5 | 269 | XP_003265679.1 | |
Placental mammals | Equus quagga | Zebra | Perissodactyla | 87 | 59.9 | 72.0 | 279 | XP_046519988.1 |
Loxodonta africana | African savannah elephant | Proboscidea | 87 | 61.3 | 73.5 | 279 | XP_023410800.1 | |
Mus musculus | Mouse | Rhodentia | 94 | 38.2 | 51.0 | 300 | NP_001157708.1 | |
Bos taurus | Cattle | Artiodactyla | 94 | 45.6 | 56.4 | 347 | XP_010803693.1 | |
Panthera uncia | Snow leopard | Carnivora | 94 | 49.7 | 62.9 | 325 | XP_049483174.1 | |
Vulpes vulpes | Red fox | Carnivora | 94 | 55.0 | 69.1 | 281 | XP_025851313.1 | |
Leptonychotes weddellii | Weddell seal | Carnivora | 94 | 55.0 | 70.9 | 281 | XP_006740901.1 | |
Orycteropus afer afer | Aardvark | Tubulidentata | 94 | 55.6 | 69.9 | 277 | XP_007947992.1 | |
Pteropus alecto | Black flying fox | Chiroptera | 94 | 55.9 | 72.2 | 278 | XP_024903180.1 | |
Canis lupus familiaris | Dog | Carnivora | 99 | 49.4 | 62.2 | 319 | XP_005637089.1 | |
Marsupials | Gracilinanus agilis | Agile gracile opossum | Didelphimorphia | 160 | 26.4 | 43.0 | 341 | XP_044534595.1 |
Trichosurus vulpecula | Common brushtail possum | Diprotodontia | 160 | 30.1 | 44.2 | 320 | XP_036616685.1 | |
Sarcophilus harrisii | Tasmanian devil | Dasyuromorphia | 160 | 30.2 | 45.9 | 318 | XP_003772759.2 | |
Vombatus ursinus | Common wombat | Diprotodontia | 160 | 31.7 | 44.6 | 321 | XP_027700114.1 | |
Dromiciops gliroides | Colocolo opossum | Microbiotheria | 160 | 32.3 | 45.9 | 317 | XP_043823568.1 |
Evolutionary history
[edit]It has been estimated that c12orf71 gene first appeared in marsupials approximately 160 million years ago. Among the marsupial species, based on the sequence similarity, the gene has first appeared in species from the Microbotheria taxonomic group, represented by the Dromiciops Gliroides (Colocolo opossum) species. Only one isoform of the c12orf71 protein has been found in this species.
Clinical association
[edit]There is only one disease associated with c12orf71 gene, common warts.[7] A study of global gene methylation of common warts caused by HPV infection found that c12orf71 gene is differentially methylated in Arab male patients with common warts. In particular, c12orf71 is hypomethylated in skin infected with common warts compared to normal skin.[27] 10 SNPs from the GWAS catalog associate c12orf71 gene with obsolete and androgenic alopecia, healing of bone mineral density and educational attainment.[8]
References
[edit]- ^ a b c GRCh38: Ensembl release 89: ENSG00000214700 – Ensembl, May 2017
- ^ a b c GRCm38: Ensembl release 89: ENSMUSG00000040163 – Ensembl, May 2017
- ^ "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
- ^ "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
- ^ "C12orf71 Gene - GeneCards | CL071 Protein | CL071 Antibody". www.genecards.org. Retrieved 2022-12-16.
- ^ a b c "Homo sapiens chromosome 12 open reading frame 71 (C12orf71), transcript variant 1, mRNA". Nucleotide. 2022-06-09.
- ^ a b c "AceView: Gene:C12orf71, a comprehensive annotation of human, mouse and worm genes with mRNAs or ESTsAceView". www.ncbi.nlm.nih.gov. Retrieved 2022-09-30.
- ^ a b "C12orf71 Gene - GeneCards | CL071 Protein | CL071 Antibody". www.genecards.org. Retrieved 2022-09-30.
- ^ Gomes, Alexandra Galvão (2014-06-26). Caracterização citogenômica de aberrações cromossômicas (text thesis) (in Brazilian Portuguese). Universidade de São Paulo.
- ^ Pyatt RE, Astbury C (December 2011). "Interpretation of copy number alterations identified through clinical microarray-comparative genomic hybridization". Clinics in Laboratory Medicine. 31 (4): 565–80, viii. doi:10.1016/j.cll.2011.08.007. PMID 22118737.
- ^ Cañas, Villanueva (2015-11-20). Insights into mammalian adaptive evolution through genomics data (Ph.D. Thesis thesis). Universitat Pompeu Fabra. hdl:10803/397756.
- ^ Ko, Tengyu (2018). Genome-Wide Screening Identifies Genes and Biological Processes Implicated in Chemoresistance and Oncogene-Induced Apoptosis (Thesis). ProQuest 2665131757.
- ^ "Homo sapiens chromosome 12 open reading frame 71 (C12orf71), transcript variant 2, mRNA". Nucleotide. 2022-08-19.
- ^ "C12orf71 chromosome 12 open reading frame 71 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2022-09-30.
- ^ Johnson, Katherine (2016). Functional analysis of the osteoarthritis susceptibility loci marked by the polymorphisms rs10492367 and rs9350591 (Thesis thesis). Newcastle University. hdl:10443/3247.
- ^ Miyata H, Castaneda JM, Fujihara Y, Yu Z, Archambeault DR, Isotani A, et al. (July 2016). "Genome engineering uncovers 54 evolutionarily conserved and testis-enriched genes that are not required for male fertility in mice". Proceedings of the National Academy of Sciences of the United States of America. 113 (28): 7704–7710. Bibcode:2016PNAS..113.7704M. doi:10.1073/pnas.1608458113. PMC 4948324. PMID 27357688.
- ^ "Compute pI/MW - SIB Swiss Institute of Bioinformatics | Expasy". www.expasy.org. Retrieved 2022-12-08.
- ^ "SAPS < Sequence Statistics < EMBL-EBI". www.ebi.ac.uk. Retrieved 2022-12-08.
- ^ a b c d e "Services". DTU Health Tech. Retrieved 2022-12-08.
- ^ "C12orf71 protein expression summary - The Human Protein Atlas". www.proteinatlas.org. Retrieved 2022-12-08.
- ^ "uncharacterized protein C12orf71 [Homo sapiens] - Protein - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2022-09-30.
- ^ "GPS-SUMO: Prediction of SUMOylation Sites & SUMO-interaction Motifs". sumosp.biocuckoo.org. Archived from the original on 2013-05-10. Retrieved 2022-12-08.
- ^ "GPS-PAIL 2.0 - Prediction of Acetylation on Internal Lysines". pail.biocuckoo.org. Retrieved 2022-12-08.
- ^ "PSICQUIC View". www.ebi.ac.uk. Retrieved 2022-12-08.
- ^ a b Asghari Alashti F, Goliaei B, Minuchehr Z (2022-01-15). "Analyzing large scale gene expression data in colorectal cancer reveals important clues; CLCA1 and SELENBP1 downregulated in CRC not in normal and not in adenoma". American Journal of Cancer Research. 12 (1): 371–380. PMC 8822279. PMID 35141024.
- ^ "C12orf71 orthologs". NCBI. Retrieved 2022-09-30.
- ^ Alghamdi MA, Al-Eitan LN, Tarkhan AH, Al-Qarqaz FA (January 2021). "Global gene methylation profiling of common warts caused by human papillomaviruses infection". Saudi Journal of Biological Sciences. 28 (1): 612–622. doi:10.1016/j.sjbs.2020.10.050. PMC 7783806. PMID 33424347. S2CID 228813393.