C1orf159: Difference between revisions
m Dating maintenance tags: {{Cn}} |
consistent citation formatting |
||
Line 43: | Line 43: | ||
=== Composition === |
=== Composition === |
||
C1orf159 protein is a [[proline|proline-]] and [[arginine]]-rich, and a [[Lysine|lysine-]] and [[Glutamic acid|glutamic acid-]] poor protein. The [[isoelectric point]] of the human C1orf159 protein is 10.07,<ref>{{Cite web |title=SIB Swiss Institute of Bioinformatics {{!}} Expasy |url=https://www.expasy.org/ |access-date=2022-07-28 |website=www.expasy.org}}</ref> which is more [[Base (chemistry)|basic]] than the average human [[Proteomics|proteomic]] protein pI of 7.36.<ref>{{ |
C1orf159 protein is a [[proline|proline-]] and [[arginine]]-rich, and a [[Lysine|lysine-]] and [[Glutamic acid|glutamic acid-]] poor protein. The [[isoelectric point]] of the human C1orf159 protein is 10.07,<ref>{{Cite web |title=SIB Swiss Institute of Bioinformatics {{!}} Expasy |url=https://www.expasy.org/ |access-date=2022-07-28 |website=www.expasy.org}}</ref> which is more [[Base (chemistry)|basic]] than the average human [[Proteomics|proteomic]] protein pI of 7.36.<ref>{{cite journal | vauthors = Kurotani A, Tokmakov AA, Sato KI, Stefanov VE, Yamada Y, Sakurai T | title = Localization-specific distributions of protein pI in human proteome are governed by local pH and membrane charge | journal = BMC Molecular and Cell Biology | volume = 20 | issue = 1 | pages = 36 | date = August 2019 | pmid = 31429701 | pmc = 6701068 | doi = 10.1186/s12860-019-0221-4 }}</ref> |
||
=== Domain === |
=== Domain === |
||
Line 59: | Line 59: | ||
=== Orthologs === |
=== Orthologs === |
||
[[Sequence homology|Orthologs]] of human C1orf159 are found in vertebrates including [[Mammal|mammals]], [[Bird|birds]], [[Reptile|reptiles]], [[Amphibian|amphibians]], and [[fish]]<ref name=":5">{{Cite web |title=C1orf159 orthologs |url=https://www.ncbi.nlm.nih.gov/gene/54991/ortholog/ |access-date=2022-07-28 |website=NCBI |language=en}}</ref> with the most distantly related group of organisms being [[Chondrichthyes|cartilaginous fish]], with a date of divergence of approximately 450 million years ago.<ref>{{ |
[[Sequence homology|Orthologs]] of human C1orf159 are found in vertebrates including [[Mammal|mammals]], [[Bird|birds]], [[Reptile|reptiles]], [[Amphibian|amphibians]], and [[fish]]<ref name=":5">{{Cite web |title=C1orf159 orthologs |url=https://www.ncbi.nlm.nih.gov/gene/54991/ortholog/ |access-date=2022-07-28 |website=NCBI |language=en}}</ref> with the most distantly related group of organisms being [[Chondrichthyes|cartilaginous fish]], with a date of divergence of approximately 450 million years ago.<ref>{{cite journal | vauthors = Redmond AK, Macqueen DJ, Dooley H | title = Phylotranscriptomics suggests the jawed vertebrate ancestor could generate diverse helper and regulatory T cell subsets | journal = BMC Evolutionary Biology | volume = 18 | issue = 1 | pages = 169 | date = November 2018 | pmid = 30442091 | doi = 10.1186/s12862-018-1290-2 | pmc = 6238376 }}</ref> Orthologs are not found in [[Agnatha|jawless fish]] or [[Invertebrate|invertebrates]].<ref name=":5" /> |
||
{| class="wikitable mw-collapsible" |
{| class="wikitable mw-collapsible" |
||
|+Orthologs of C1orf159 Protein |
|+Orthologs of C1orf159 Protein |
Revision as of 13:45, 30 July 2022
This article has no lead section. (July 2022) |
Chromosome 1 Open Reading Frame 159 (C1orf159)
C1orf159 is a protein that in human is encoded by the C1orf159 gene located on chromosome 1.[1][2] This gene is also found to be an unfavorable prognosis marker for renal and liver cancer, and a favorable prognosis marker for urothelial cancer.[3]
Gene
The Homo sapiens C1orf159 gene (UniProt ID: Q96HA4) is a gene located on the long arm of chromosome 1 at locus 1p36.33.[2] The gene is 34,544 base pairs in length, located at Chromosome 1 position 1,081,818 to 1,116,361 on the reverse strand.[4]
Transcript
The longest variant of human C1orf159 gene encodes an mRNA that is 2,432 nucleotides in length with 12 exons.[5] A promoter region was predicted using UCSC Genome Browser,[6] which is 762 nucleotides long, including a 434 base-pair region upstream of the transcriptional start site, exon 1, and a 298 base-pair region of intron 1.[citation needed]
Protein
Size and Isoforms
Alternative splicing of the gene creates 5 protein isoforms.[1] The longest isoform is 380 amino acids in length with a molecular mass of 40.382 kDa.[1]
Isoform | UniProt[7] ID | Length (aa) |
---|---|---|
1 | Q96HA4-1 | 380 |
2 | Q96HA4-2 | 185 |
3 | Q96HA4-3 | 189 |
4 | Q96HA4-4 | 198 |
5 | Q96HA4-5 | 254 |
Composition
C1orf159 protein is a proline- and arginine-rich, and a lysine- and glutamic acid- poor protein. The isoelectric point of the human C1orf159 protein is 10.07,[8] which is more basic than the average human proteomic protein pI of 7.36.[9]
Domain
The human C1orf159 protein contains a domain of unknown function DUF4501.[1] Although the exact function of the domain is not clear, it is thought to be a single pass-membrane protein with highly conserved cysteine residues.[citation needed]
The protein also contains a transmembrane domain at positions 144-169[7] and a signal peptide at positions 1-18.[5][7]
Structure
Alphafold predicts the structure of human C1orf159 protein to be mainly composed of alpha-helices.[10]
Post-translational Modification
The predicted post-translational modifications of the C1orf159 protein includes N-linked glycosylation on asparagine at positions 104, 111, and 128.[1][11]
Homology/Evolution
Orthologs
Orthologs of human C1orf159 are found in vertebrates including mammals, birds, reptiles, amphibians, and fish[12] with the most distantly related group of organisms being cartilaginous fish, with a date of divergence of approximately 450 million years ago.[13] Orthologs are not found in jawless fish or invertebrates.[12]
Species | Group | Taxonomic
Group |
NCBI Protein Accession Number | Protein Sequence
Similarity (% Relative to Human Protein) |
Human | Mammals | Primates | NP_001317235.1 | 100.0 |
Chimpanzee | Primates | XP_024204744.1 | 98.4 | |
Bonobo | Primates | XP_008975653.2 | 88.9 | |
House Mouse | Rodentia | NP_796179.1 | 40.9 | |
Cattle | Artiodactyla | NP_001026925.1 | 36.6 | |
Sunda Flying Lemur | Dermoptera | XP_008567908.1 | 39.4 | |
Chinese Tree Shrew | Scandentia | XP_027622332.1 | 35.8 | |
Cougar | Carnivora | XP_025768111.1 | 41.7 | |
Chicken | Birds | Galliformes | XP_024998437.2 | 32.8 |
Rock Pigeon | Columbiformes | XP_013226562.2 | 35.7 | |
Hooded Crow | Passeriformes | XP_039420032.1 | 29.9 | |
Golden-collared Manakin | Passeriformes | XP_017934783.1 | 36.5 | |
Gharial | Reptiles | Crocodilia | XP_019367354.1 | 36.8 |
Leatherback Sea Turtle | Testudines | XP_027584571.1 | 35.9 | |
Chinese Softshell Turtle | Testudines | XP_006127168.1 | 35.2 | |
Western Clawed Frog | Anura | NP_001039047.1 | 34.6 | |
Two-lined Caecilian | Amphibians | Gymnophiona | XP_029433955.1 | 33.9 |
Asiatic Toad | Anura | XP_044137731.1 | 31.6 | |
Zebrafish | Fish | Cypriniformes | NP_001313355.1 | 26.4 |
Sterlet | Acipenseriformes | XP_034760226.1 | 32.8 | |
Reedfish | Polypteriformes | XP_028663678.1 | 32.9 | |
Small-spotted Catshark | Carcharhiniformes | XP_038629468.1 | 28.0 | |
Whale Shark | Orectolobiformes | XP_020381962.1 | 32.9 |
Evolutionary History
When compared with the evolution rate with cytochrome c and fibrinogen alpha, the C1orf159 protein has a similar evolutionary rate of change to the fast-evolving fibrinogen alpha protein, C1orf159 protein has a relatively fast evolution rate.[citation needed]
Clinical Significance
The Human Protein Atlas shows that C1orf159 is an unfavorable prognosis marker for renal and liver cancer, and a favorable prognosis marker for urothelial cancer, indicating that a high expression of C1orf159 is associated with a lower survival probability for patients with renal and liver cancer, and is associated with a higher survival probability for patients with urothelial cancer.[3]
References
- ^ a b c d e "C1orf159 Gene - GeneCards | CA159 Protein | CA159 Antibody". www.genecards.org. Retrieved 2022-06-20.
- ^ a b "Gene symbol report | HUGO Gene Nomenclature Committee". www.genenames.org. Retrieved 2022-06-20.
- ^ a b "Expression of C1orf159 in cancer - Summary - The Human Protein Atlas". www.proteinatlas.org. Retrieved 2022-06-20.
- ^ "Gene: C1orf159 (ENSG00000131591) - Summary - Homo_sapiens - Ensembl genome browser 107". uswest.ensembl.org. Retrieved 2022-07-27.
- ^ a b "Homo sapiens chromosome 1 open reading frame 159 (C1orf159), transcript variant 1, mRNA". 2022-04-17.
{{cite journal}}
: Cite journal requires|journal=
(help) - ^ "UCSC Genome Browser Home". genome.ucsc.edu. Retrieved 2022-07-28.
- ^ a b c "UniProt". www.uniprot.org. Retrieved 2022-07-28.
- ^ "SIB Swiss Institute of Bioinformatics | Expasy". www.expasy.org. Retrieved 2022-07-28.
- ^ Kurotani A, Tokmakov AA, Sato KI, Stefanov VE, Yamada Y, Sakurai T (August 2019). "Localization-specific distributions of protein pI in human proteome are governed by local pH and membrane charge". BMC Molecular and Cell Biology. 20 (1): 36. doi:10.1186/s12860-019-0221-4. PMC 6701068. PMID 31429701.
{{cite journal}}
: CS1 maint: unflagged free DOI (link) - ^ "AlphaFold Protein Structure Database". alphafold.ebi.ac.uk. Retrieved 2022-07-28.
- ^ "C1orf159 - Proteomics". www.nextprot.org. Retrieved 2022-07-30.
- ^ a b "C1orf159 orthologs". NCBI. Retrieved 2022-07-28.
- ^ Redmond AK, Macqueen DJ, Dooley H (November 2018). "Phylotranscriptomics suggests the jawed vertebrate ancestor could generate diverse helper and regulatory T cell subsets". BMC Evolutionary Biology. 18 (1): 169. doi:10.1186/s12862-018-1290-2. PMC 6238376. PMID 30442091.
{{cite journal}}
: CS1 maint: unflagged free DOI (link) - ^ "Phylogeny.fr: "One Click" Mode". www.phylogeny.fr. Retrieved 2022-07-28.