C1orf159: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
AnomieBOT (talk | contribs)
m Dating maintenance tags: {{Cn}}
consistent citation formatting
Line 43: Line 43:


=== Composition ===
=== Composition ===
C1orf159 protein is a [[proline|proline-]] and [[arginine]]-rich, and a [[Lysine|lysine-]] and [[Glutamic acid|glutamic acid-]] poor protein. The [[isoelectric point]] of the human C1orf159 protein is 10.07,<ref>{{Cite web |title=SIB Swiss Institute of Bioinformatics {{!}} Expasy |url=https://www.expasy.org/ |access-date=2022-07-28 |website=www.expasy.org}}</ref> which is more [[Base (chemistry)|basic]] than the average human [[Proteomics|proteomic]] protein pI of 7.36.<ref>{{Cite journal |last=Kurotani |first=Atsushi |last2=Tokmakov |first2=Alexander A. |last3=Sato |first3=Ken-Ichi |last4=Stefanov |first4=Vasily E. |last5=Yamada |first5=Yutaka |last6=Sakurai |first6=Tetsuya |date=2019-08-20 |title=Localization-specific distributions of protein pI in human proteome are governed by local pH and membrane charge |url=https://doi.org/10.1186/s12860-019-0221-4 |journal=BMC Molecular and Cell Biology |volume=20 |issue=1 |pages=36 |doi=10.1186/s12860-019-0221-4 |issn=2661-8850 |pmc=PMC6701068 |pmid=31429701}}</ref>
C1orf159 protein is a [[proline|proline-]] and [[arginine]]-rich, and a [[Lysine|lysine-]] and [[Glutamic acid|glutamic acid-]] poor protein. The [[isoelectric point]] of the human C1orf159 protein is 10.07,<ref>{{Cite web |title=SIB Swiss Institute of Bioinformatics {{!}} Expasy |url=https://www.expasy.org/ |access-date=2022-07-28 |website=www.expasy.org}}</ref> which is more [[Base (chemistry)|basic]] than the average human [[Proteomics|proteomic]] protein pI of 7.36.<ref>{{cite journal | vauthors = Kurotani A, Tokmakov AA, Sato KI, Stefanov VE, Yamada Y, Sakurai T | title = Localization-specific distributions of protein pI in human proteome are governed by local pH and membrane charge | journal = BMC Molecular and Cell Biology | volume = 20 | issue = 1 | pages = 36 | date = August 2019 | pmid = 31429701 | pmc = 6701068 | doi = 10.1186/s12860-019-0221-4 }}</ref>


=== Domain ===
=== Domain ===
Line 59: Line 59:


=== Orthologs ===
=== Orthologs ===
[[Sequence homology|Orthologs]] of human C1orf159 are found in vertebrates including [[Mammal|mammals]], [[Bird|birds]], [[Reptile|reptiles]], [[Amphibian|amphibians]], and [[fish]]<ref name=":5">{{Cite web |title=C1orf159 orthologs |url=https://www.ncbi.nlm.nih.gov/gene/54991/ortholog/ |access-date=2022-07-28 |website=NCBI |language=en}}</ref> with the most distantly related group of organisms being [[Chondrichthyes|cartilaginous fish]], with a date of divergence of approximately 450 million years ago.<ref>{{Cite journal |last=Redmond |first=Anthony K. |last2=Macqueen |first2=Daniel J. |last3=Dooley |first3=Helen |date=2018-11-15 |title=Phylotranscriptomics suggests the jawed vertebrate ancestor could generate diverse helper and regulatory T cell subsets |url=https://doi.org/10.1186/s12862-018-1290-2 |journal=BMC Evolutionary Biology |volume=18 |issue=1 |pages=169 |doi=10.1186/s12862-018-1290-2 |issn=1471-2148}}</ref> Orthologs are not found in [[Agnatha|jawless fish]] or [[Invertebrate|invertebrates]].<ref name=":5" />
[[Sequence homology|Orthologs]] of human C1orf159 are found in vertebrates including [[Mammal|mammals]], [[Bird|birds]], [[Reptile|reptiles]], [[Amphibian|amphibians]], and [[fish]]<ref name=":5">{{Cite web |title=C1orf159 orthologs |url=https://www.ncbi.nlm.nih.gov/gene/54991/ortholog/ |access-date=2022-07-28 |website=NCBI |language=en}}</ref> with the most distantly related group of organisms being [[Chondrichthyes|cartilaginous fish]], with a date of divergence of approximately 450 million years ago.<ref>{{cite journal | vauthors = Redmond AK, Macqueen DJ, Dooley H | title = Phylotranscriptomics suggests the jawed vertebrate ancestor could generate diverse helper and regulatory T cell subsets | journal = BMC Evolutionary Biology | volume = 18 | issue = 1 | pages = 169 | date = November 2018 | pmid = 30442091 | doi = 10.1186/s12862-018-1290-2 | pmc = 6238376 }}</ref> Orthologs are not found in [[Agnatha|jawless fish]] or [[Invertebrate|invertebrates]].<ref name=":5" />
{| class="wikitable mw-collapsible"
{| class="wikitable mw-collapsible"
|+Orthologs of C1orf159 Protein
|+Orthologs of C1orf159 Protein

Revision as of 13:45, 30 July 2022

Chromosome 1 Open Reading Frame 159 (C1orf159)

C1orf159 is a protein that in human is encoded by the C1orf159 gene located on chromosome 1.[1][2] This gene is also found to be an unfavorable prognosis marker for renal and liver cancer, and a favorable prognosis marker for urothelial cancer.[3]

Gene

The Homo sapiens C1orf159 gene (UniProt ID: Q96HA4) is a gene located on the long arm of chromosome 1 at locus 1p36.33.[2] The gene is 34,544 base pairs in length, located at Chromosome 1 position 1,081,818 to 1,116,361 on the reverse strand.[4]

Transcript

The longest variant of human C1orf159 gene encodes an mRNA that is 2,432 nucleotides in length with 12 exons.[5] A promoter region was predicted using UCSC Genome Browser,[6] which is 762 nucleotides long, including a 434 base-pair region upstream of the transcriptional start site, exon 1, and a 298 base-pair region of intron 1.[citation needed]

Protein

Size and Isoforms

Alternative splicing of the gene creates 5 protein isoforms.[1] The longest isoform is 380 amino acids in length with a molecular mass of 40.382 kDa.[1]

Isoforms of human C1orf159 protein
Isoform UniProt[7] ID Length (aa)
1 Q96HA4-1 380
2 Q96HA4-2 185
3 Q96HA4-3 189
4 Q96HA4-4 198
5 Q96HA4-5 254

Composition

C1orf159 protein is a proline- and arginine-rich, and a lysine- and glutamic acid- poor protein. The isoelectric point of the human C1orf159 protein is 10.07,[8] which is more basic than the average human proteomic protein pI of 7.36.[9]

Domain

The human C1orf159 protein contains a domain of unknown function DUF4501.[1] Although the exact function of the domain is not clear, it is thought to be a single pass-membrane protein with highly conserved cysteine residues.[citation needed]

The protein also contains a transmembrane domain at positions 144-169[7] and a signal peptide at positions 1-18.[5][7]

Structure

Alphafold predicts the structure of human C1orf159 protein to be mainly composed of alpha-helices.[10]

Post-translational Modification

The predicted post-translational modifications of the C1orf159 protein includes N-linked glycosylation on asparagine at positions 104, 111, and 128.[1][11]

Homology/Evolution

Orthologs

Orthologs of human C1orf159 are found in vertebrates including mammals, birds, reptiles, amphibians, and fish[12] with the most distantly related group of organisms being cartilaginous fish, with a date of divergence of approximately 450 million years ago.[13] Orthologs are not found in jawless fish or invertebrates.[12]

Orthologs of C1orf159 Protein
Species Group Taxonomic

Group

NCBI Protein Accession Number Protein Sequence

Similarity

(% Relative to Human Protein)

Human Mammals Primates NP_001317235.1 100.0
Chimpanzee Primates XP_024204744.1 98.4
Bonobo Primates XP_008975653.2 88.9
House Mouse Rodentia NP_796179.1 40.9
Cattle Artiodactyla NP_001026925.1 36.6
Sunda Flying Lemur Dermoptera XP_008567908.1 39.4
Chinese Tree Shrew Scandentia XP_027622332.1 35.8
Cougar Carnivora XP_025768111.1 41.7
Chicken Birds Galliformes XP_024998437.2 32.8
Rock Pigeon Columbiformes XP_013226562.2 35.7
Hooded Crow Passeriformes XP_039420032.1 29.9
Golden-collared Manakin Passeriformes XP_017934783.1 36.5
Gharial Reptiles Crocodilia XP_019367354.1 36.8
Leatherback Sea Turtle Testudines XP_027584571.1 35.9
Chinese Softshell Turtle Testudines XP_006127168.1 35.2
Western Clawed Frog Anura NP_001039047.1 34.6
Two-lined Caecilian Amphibians Gymnophiona XP_029433955.1 33.9
Asiatic Toad Anura XP_044137731.1 31.6
Zebrafish Fish Cypriniformes NP_001313355.1 26.4
Sterlet Acipenseriformes XP_034760226.1 32.8
Reedfish Polypteriformes XP_028663678.1 32.9
Small-spotted Catshark Carcharhiniformes XP_038629468.1 28.0
Whale Shark Orectolobiformes XP_020381962.1 32.9
Unrooted phylogenetic tree of C1orf159 orthologs generated by Phylogeny.fr.[14]

Evolutionary History

When compared with the evolution rate with cytochrome c and fibrinogen alpha, the C1orf159 protein has a similar evolutionary rate of change to the fast-evolving fibrinogen alpha protein, C1orf159 protein has a relatively fast evolution rate.[citation needed]

Evolutionary change of C1orf159 protein compared to the change of Cytochrome C and Fibrinogen Alpha. m in the vertical axis is defined as the total number of amino acid changes occurred in a 100-amino acid segment of a protein.

Clinical Significance

The Human Protein Atlas shows that C1orf159 is an unfavorable prognosis marker for renal and liver cancer, and a favorable prognosis marker for urothelial cancer, indicating that a high expression of C1orf159 is associated with a lower survival probability for patients with renal and liver cancer, and is associated with a higher survival probability for patients with urothelial cancer.[3]

References

  1. ^ a b c d e "C1orf159 Gene - GeneCards | CA159 Protein | CA159 Antibody". www.genecards.org. Retrieved 2022-06-20.
  2. ^ a b "Gene symbol report | HUGO Gene Nomenclature Committee". www.genenames.org. Retrieved 2022-06-20.
  3. ^ a b "Expression of C1orf159 in cancer - Summary - The Human Protein Atlas". www.proteinatlas.org. Retrieved 2022-06-20.
  4. ^ "Gene: C1orf159 (ENSG00000131591) - Summary - Homo_sapiens - Ensembl genome browser 107". uswest.ensembl.org. Retrieved 2022-07-27.
  5. ^ a b "Homo sapiens chromosome 1 open reading frame 159 (C1orf159), transcript variant 1, mRNA". 2022-04-17. {{cite journal}}: Cite journal requires |journal= (help)
  6. ^ "UCSC Genome Browser Home". genome.ucsc.edu. Retrieved 2022-07-28.
  7. ^ a b c "UniProt". www.uniprot.org. Retrieved 2022-07-28.
  8. ^ "SIB Swiss Institute of Bioinformatics | Expasy". www.expasy.org. Retrieved 2022-07-28.
  9. ^ Kurotani A, Tokmakov AA, Sato KI, Stefanov VE, Yamada Y, Sakurai T (August 2019). "Localization-specific distributions of protein pI in human proteome are governed by local pH and membrane charge". BMC Molecular and Cell Biology. 20 (1): 36. doi:10.1186/s12860-019-0221-4. PMC 6701068. PMID 31429701.{{cite journal}}: CS1 maint: unflagged free DOI (link)
  10. ^ "AlphaFold Protein Structure Database". alphafold.ebi.ac.uk. Retrieved 2022-07-28.
  11. ^ "C1orf159 - Proteomics". www.nextprot.org. Retrieved 2022-07-30.
  12. ^ a b "C1orf159 orthologs". NCBI. Retrieved 2022-07-28.
  13. ^ Redmond AK, Macqueen DJ, Dooley H (November 2018). "Phylotranscriptomics suggests the jawed vertebrate ancestor could generate diverse helper and regulatory T cell subsets". BMC Evolutionary Biology. 18 (1): 169. doi:10.1186/s12862-018-1290-2. PMC 6238376. PMID 30442091.{{cite journal}}: CS1 maint: unflagged free DOI (link)
  14. ^ "Phylogeny.fr: "One Click" Mode". www.phylogeny.fr. Retrieved 2022-07-28.