User:Gturns/sandbox
C2orf72
[edit]Chromosome 2 open reading frame 72 (c2orf72) is a protein that in humans is encoded by the gene c2orf72. The function of this protein is currently unknown.
Gene
[edit]This gene consists of 3,629 nucleotides, contains 3 exons and 2 introns, and the protein it encodes is 295 amino acids long.[1][2]
It has orthologs in a number of different organisms, including mice, dogs, fish, frogs, chickens, snakes, and many others. C2orf72 has no known isoforms.
Location
[edit]This gene is located in the human genome at 2q37.1.[2] Its transcript contains 3,629 nucleotides, and has 3 exons.
Expression
[edit]On average, the most abundant tissues for gene c2orf72 are the liver and brain, while the abundance in the placenta has the potential to be quite high. During the developmental phase, the gene is most prevalent in the intestine and stomach.[3] In the brain, it is found primarily in the cingulate cortex.[4]
Protein
[edit]Primary Sequences
[edit]The protein C2orf72 is 295 amino acids long. It has an experimental molecular weight of 38 kDa.[5] It is predicted to be located in the cytoplasm.[6] There is a high amount of alanine and arginine relative to other proteins, as they make up 22.4% and 11.5% of the amino acids in the protein respectively.[6]
Post-Translational Modifications
[edit]The protein contains both a globular domain and a section of high disorder. It contains no trans-membrane regions.[7]
Secondary Structure and Tertiary Structure
[edit]The predicted secondary structure of c2orf72 consists primarily of alpha helices, with significant coiling in the final 30 amino acids.
Homology and Evolution
[edit]Paralogs
[edit]No paralogs exist for protein c2orf72. There is one paralogous domain found in hCG1777421.[10]
Orthologs
[edit]Genus and species | Common name | Taxonomic group | Date of divergence from human lineage (MYA) (estimated) | Accession number | Sequence length (aa) | Sequence identity to human protein | Sequence similarity to human protein |
Homo sapiens | Human | Primate | 0 | NP_001138466.1 | 295 | 100% | 100% |
Urocitellus parryii | Arctic ground squirrel | Rodent | 90 | XP_026252042.1 | 286 | 82% | 85% |
Microtus ochrogaster | Prairie vole | Rodent | 90 | XP_026642192.1 | 298 | 68% | 75% |
Canis lupus dingo | Dingo | Canids | 96 | XP_025319191.1 | 284 | 80% | 83% |
Eumeotopias jubatus | Steller sea lion | Seal | 96 | XP_027960872.1 | 281 | 79% | 83% |
Phyllostomus discolor | Pale spear-nosed bat | Chiroptera | 96 | XP_028364834.1 | 289 | 73% | 75% |
Delphinapterus leucas | Beluga whale | Cetacean | 96 | XP_022423643.1 | 383 | 58% | 60% |
Ornithorhynchus anatinus | Platypus | Monotreme | 177 | XP_028928459.1 | 387 | 26% | 34% |
Terrapene carolina triunguis | Three-toed box turtle | Reptile | 312 | XP_026505723.1 | 269 | 40% | 50% |
Gallus gallus | Red junglefowl | Bird | 312 | XP_004943655.1 | 244 | 38% | 47% |
Anas platyrhynchos | Mallard | Bird | 312 | XP_027320562.1 | 251 | 36% | 48% |
Pseudonaja textilis | Eastern brown snake | Reptile | 312 | XP_026577460.1 | 272 | 33% | 43% |
Pelodiscus sinensis | Chinese soft-shelled turtle | Reptile | 312 | XP_025038651.1 | 252 | 28% | 40% |
Python bivittatus | Burmese python | Reptile | 312 | XP_025023716.1 | 279 | 24% | 37% |
Xenopus tropicalis | Western clawed frog | Amphibian | 351.8 | XP_002937397.2 | 291 | 30% | 41% |
Nanorana parkeri | High Himalaya frog | Amphibian | 351.8 | XP_018432004.1 | 304 | 27% | 39% |
Scleropages formosus | Asian arowana | Fish | 435 | XP_029110182.1 | 311 | 26% | 38% |
Erpetoichthys calabaricus | Reedfish | Fish | 435 | XP_028650671.1 | 280 | 24% | 38% |
Kryptolebias marmoratus | Mangrove rivulus | Fish | 435 | XP_017268814.1 | 317 | 22% | 37% |
Distant Homologs
[edit]The most distant homolog discovered is the Mangrove Rivulus.
Interacting Proteins
[edit]Abbreviated Name | Full Name | Aliases | Info & Function |
CD81 | Cluster of differentiation 81 | CVID6, S5.7, TAPA1, TSPAN28 | Cell surface glycoprotein
Complexes with integrins Appears to promote muscle cell fusion and support myotube maintenance May be involved in signal transduction |
KRAS | Kirsten rat sarcoma viral oncogene homolog | C-K-RAS, CFC2, K-RAS2A, K-RAS2B, K-RAS4A, K-RAS4B, KI-RAS, KRAS1, KRAS2, NS, NS3, RALD, RASK2, K-ras, KRAS proto-oncogene, GTPase, c-Ki-ras2 | Acts as a molecular on/off switch
Upon activation, it recruits and activates proteins necessary for the propagation of growth factors, as well as other cell signaling receptors |
NRAS | Neuroblastoma RAS viral (v-ras) oncogene homolog | ALPS4, CMNS, N-ras, NCMS, NRAS1, NS6, Neuroblastoma RAS viral oncogene homolog, NRAS proto-oncogene, GTPase | Involved in regulating cell division |
SPATA3 | Spermatogenesis-associated protein 3 | --- | Associated with spermatogenesis |
PSMD1 | 26S proteasome non-ATPase regulatory subunit 1 | P112, Rpn2, S1, proteasome 26S subunit, non-ATPase 1 | --- |
GPR55 | G-protein coupled receptor 55 | --- | --- |
- ^ "uncharacterized protein C2orf72 [Homo sapiens]". NCBI Protein. Retrieved 07/02/2019.
{{cite web}}
: Check date values in:|access-date=
(help) - ^ a b "C2orf72 chromosome 2 open reading frame 72 [ Homo sapiens (human) ]". NCBI Gene. Retrieved 07/02/2019.
{{cite web}}
: Check date values in:|access-date=
(help) - ^ "C2orf72 chromosome 2 open reading frame 72 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2019-08-04.
- ^ "GDS596 / 213143_at". www.ncbi.nlm.nih.gov. Retrieved 2019-08-04.
- ^ "C2orf72 Antibody". www.thermofisher.com. Retrieved 2019-08-04.
- ^ a b "SAPS < Sequence Statistics < EMBL-EBI". www.ebi.ac.uk. Retrieved 2019-08-04.
- ^ "ELM - Search the ELM resource". elm.eu.org. Retrieved 2019-08-04.
- ^ "CFSSP: Chou & Fasman Secondary Structure Prediction Server". www.biogem.org. Retrieved 2019-08-04.
- ^ "CFSSP: Chou & Fasman Secondary Structure Prediction Server". www.biogem.org. Retrieved 2019-08-04.
- ^ "hCG1777421 [Homo sapiens] - Protein - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2019-07-30.