Hp53int1: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
added Category:Human proteins; removed {{uncategorized}} using HotCat
Citation bot (talk | contribs)
Alter: template type. Add: doi, doi-access, pmc, s2cid, pmid, authors 1-1. Removed proxy/dead URL that duplicated identifier. Removed parameters. Some additions/deletions were parameter name changes. | Use this bot. Report bugs. | Suggested by Corvus florensis | #UCB_webform 228/559
Line 6: Line 6:
== Gene ==
== Gene ==
[[File:Hp53int1 location.png|thumb|454x454px|'''Figure 1:''' The location of Hp53int1 protein on chromosome 17p.13, region 7,685,260 bp-7,686,371 bp]]
[[File:Hp53int1 location.png|thumb|454x454px|'''Figure 1:''' The location of Hp53int1 protein on chromosome 17p.13, region 7,685,260 bp-7,686,371 bp]]
The Hp53int1 gene is located on chromosome 17p.13, encoded by a DNA sequence 1125 base pairs in length, covering region 7,685,260 bp-7,686,371 bp.<ref name=":0">{{Cite web |title=Hp53int1 human unknown protein |url=https://www.ncbi.nlm.nih.gov/nuccore/U58658.1 |access-date=17 December 2022 |website=NCBI Protein BLAST}}</ref> The Hp53int1 gene has two aliases, WRAP53int1 and TP53int1, in accordance to its overlap with the WRAP53 and TP53int1 genes on chromosome 17.<ref name=":1">{{Cite journal |last=Altschul |first=Stephen F. |last2=Gish |first2=Warren |last3=Miller |first3=Webb |last4=Myers |first4=Eugene W. |last5=Lipman |first5=David J. |date=October 1990 |title=Basic local alignment search tool |url=http://dx.doi.org/10.1016/s0022-2836(05)80360-2 |journal=Journal of Molecular Biology |volume=215 |issue=3 |pages=403–410 |doi=10.1016/s0022-2836(05)80360-2 |issn=0022-2836}}</ref> Hp53int1 is located downstream of the p53p2 start site.<ref>{{Cite journal |last=Reisman |first=David |last2=Bálint |first2=Éva |last3=Loging |first3=W.Troy |last4=Rotter |first4=Varda |last5=Almon |first5=Einat |date=December 1996 |title=A Novel Transcript Encoded within the 10-kb First Intron of the Human p53 Tumor Suppressor Gene (D17S2179E) Is Induced during Differentiation of Myeloid Leukemia Cells |url=https://linkinghub.elsevier.com/retrieve/pii/S0888754396906394 |journal=Genomics |language=en |volume=38 |issue=3 |pages=364–370 |doi=10.1006/geno.1996.0639}}</ref>
The Hp53int1 gene is located on chromosome 17p.13, encoded by a DNA sequence 1125 base pairs in length, covering region 7,685,260 bp-7,686,371 bp.<ref name=":0">{{Cite web |title=Hp53int1 human unknown protein |url=https://www.ncbi.nlm.nih.gov/nuccore/U58658.1 |access-date=17 December 2022 |website=NCBI Protein BLAST}}</ref> The Hp53int1 gene has two aliases, WRAP53int1 and TP53int1, in accordance to its overlap with the WRAP53 and TP53int1 genes on chromosome 17.<ref name=":1">{{Cite journal |last1=Altschul |first1=Stephen F. |last2=Gish |first2=Warren |last3=Miller |first3=Webb |last4=Myers |first4=Eugene W. |last5=Lipman |first5=David J. |date=October 1990 |title=Basic local alignment search tool |url=http://dx.doi.org/10.1016/s0022-2836(05)80360-2 |journal=Journal of Molecular Biology |volume=215 |issue=3 |pages=403–410 |doi=10.1016/s0022-2836(05)80360-2 |pmid=2231712 |s2cid=14441902 |issn=0022-2836}}</ref> Hp53int1 is located downstream of the p53p2 start site.<ref>{{Cite journal |last1=Reisman |first1=David |last2=Bálint |first2=Éva |last3=Loging |first3=W.Troy |last4=Rotter |first4=Varda |last5=Almon |first5=Einat |date=December 1996 |title=A Novel Transcript Encoded within the 10-kb First Intron of the Human p53 Tumor Suppressor Gene (D17S2179E) Is Induced during Differentiation of Myeloid Leukemia Cells |url=https://linkinghub.elsevier.com/retrieve/pii/S0888754396906394 |journal=Genomics |language=en |volume=38 |issue=3 |pages=364–370 |doi=10.1006/geno.1996.0639}}</ref>


Hp53int1 does not have multiple exons, and therefore has no isoforms.
Hp53int1 does not have multiple exons, and therefore has no isoforms.


=== Important Relationship to TP53 ===
=== Important Relationship to TP53 ===
The Hp53int1 gene is projected in the same direction as TP53 and is present in myeloid leukemia cells HL-60 and U937.<ref>{{Cite journal |last=Ortiz-Cuaran |first=Sandra |last2=Cox |first2=David |last3=Villar |first3=Stéphanie |last4=Friesen |first4=Marlin D. |last5=Durand |first5=Geoffroy |last6=Chabrier |first6=Amélie |last7=Khuhaprema |first7=Thiravud |last8=Sangrajrang |first8=Suleeporn |last9=Ognjanovic |first9=Simona |last10=Groopman |first10=John D. |last11=Hainaut |first11=Pierre |last12=Le Calvez-Kelm |first12=Florence |date=October 2013 |title=Association between TP53 R249S mutation and polymorphisms in TP53 intron 1 in hepatocellular carcinoma: TP53 R249S Associated with Intron 1 Polymorphisms |url=https://onlinelibrary.wiley.com/doi/10.1002/gcc.22086 |journal=Genes, Chromosomes and Cancer |language=en |volume=52 |issue=10 |pages=912–919 |doi=10.1002/gcc.22086}}</ref> This suggests a strong relationship to the TP53 gene and subsequent protein, including a share of transcription factors, promoter signals, tissue expression, and subcellular localization. While these genes are not identical, this relationship may give clues to function, structure, and expression.
The Hp53int1 gene is projected in the same direction as TP53 and is present in myeloid leukemia cells HL-60 and U937.<ref>{{Cite journal |last1=Ortiz-Cuaran |first1=Sandra |last2=Cox |first2=David |last3=Villar |first3=Stéphanie |last4=Friesen |first4=Marlin D. |last5=Durand |first5=Geoffroy |last6=Chabrier |first6=Amélie |last7=Khuhaprema |first7=Thiravud |last8=Sangrajrang |first8=Suleeporn |last9=Ognjanovic |first9=Simona |last10=Groopman |first10=John D. |last11=Hainaut |first11=Pierre |last12=Le Calvez-Kelm |first12=Florence |date=October 2013 |title=Association between TP53 R249S mutation and polymorphisms in TP53 intron 1 in hepatocellular carcinoma: TP53 R249S Associated with Intron 1 Polymorphisms |url=https://onlinelibrary.wiley.com/doi/10.1002/gcc.22086 |journal=Genes, Chromosomes and Cancer |language=en |volume=52 |issue=10 |pages=912–919 |doi=10.1002/gcc.22086|pmid=23836507 |s2cid=21587166 }}</ref> This suggests a strong relationship to the TP53 gene and subsequent protein, including a share of transcription factors, promoter signals, tissue expression, and subcellular localization. While these genes are not identical, this relationship may give clues to function, structure, and expression.


== mRNA Transcript ==
== mRNA Transcript ==
[[File:Hp53int1 5' UTR.png|thumb|294x294px|'''5' UTR Analysis'''<ref>{{Cite web |url=http://www.unafold.org/ |access-date=2022-12-16 |website=www.unafold.org}}</ref>''':'''''The consensus folding of the 5’ UTR of Hp53int1. Two loops are formed from amino acids 1-9 and 12-21 respectively. The connecting amino acids of both loops are rich in G’s and C’s.'']]Hp53int1 is encoded by a polyadenylated transcript '''1125 base pairs''' in length. There is a repeat sequence between base pairs 633…926 and a regulatory Poly A tail sequence between base pairs 496…1000.<ref name=":0" /> The repeat sequence between base pairs 633 and 926 is noted to be similar to the ''Alu SC subfamily repeat'', a sequence that characterizes the most abundant repeat sequences in humans and primates, and which likely diverged from other Alu subfamily complexes around 32 million years ago.<ref>{{Cite journal |last=Price |first=Alkes L. |last2=Eskin |first2=Eleazar |last3=Pevzner |first3=Pavel A. |date=November 2004 |title=Whole-genome analysis of Alu repeat elements reveals complex evolutionary history |url=http://genome.cshlp.org/lookup/doi/10.1101/gr.2693004 |journal=Genome Research |language=en |volume=14 |issue=11 |pages=2245–2252 |doi=10.1101/gr.2693004 |issn=1088-9051}}</ref>
[[File:Hp53int1 5' UTR.png|thumb|294x294px|'''5' UTR Analysis'''<ref>{{Cite web |url=http://www.unafold.org/ |access-date=2022-12-16 |website=www.unafold.org}}</ref>''':'''''The consensus folding of the 5’ UTR of Hp53int1. Two loops are formed from amino acids 1-9 and 12-21 respectively. The connecting amino acids of both loops are rich in G’s and C’s.'']]Hp53int1 is encoded by a polyadenylated transcript '''1125 base pairs''' in length. There is a repeat sequence between base pairs 633…926 and a regulatory Poly A tail sequence between base pairs 496…1000.<ref name=":0" /> The repeat sequence between base pairs 633 and 926 is noted to be similar to the ''Alu SC subfamily repeat'', a sequence that characterizes the most abundant repeat sequences in humans and primates, and which likely diverged from other Alu subfamily complexes around 32 million years ago.<ref>{{Cite journal |last1=Price |first1=Alkes L. |last2=Eskin |first2=Eleazar |last3=Pevzner |first3=Pavel A. |date=November 2004 |title=Whole-genome analysis of Alu repeat elements reveals complex evolutionary history |journal=Genome Research |language=en |volume=14 |issue=11 |pages=2245–2252 |doi=10.1101/gr.2693004 |pmid=15520288 |pmc=525682 |issn=1088-9051}}</ref>


=== Promoter Sequence ===
=== Promoter Sequence ===
The promoter sequence of the Hp53int1 gene was shown to be 540 base pairs in length.<ref name=":2">{{Cite journal |last=Kent |first=W. James |last2=Sugnet |first2=Charles W. |last3=Furey |first3=Terrence S. |last4=Roskin |first4=Krishna M. |last5=Pringle |first5=Tom H. |last6=Zahler |first6=Alan M. |last7=Haussler |first7=and David |date=2002-05-16 |title=The Human Genome Browser at UCSC |url=http://dx.doi.org/10.1101/gr.229102 |journal=Genome Research |volume=12 |issue=6 |pages=996–1006 |doi=10.1101/gr.229102 |issn=1088-9051}}</ref> Transcription factors '''SMAD2''', '''SRY''', and '''ETV7''' had Basewise Conservation Scores of 2.99, 2.73, and 3.03 respectively, with value of 4 indicating the highest conservation and -0.5 the lowest. KLF2 and KLF5 had scores of 3.73 each. The species being compared were elephant, dog, rhesus macaque, and chicken.
The promoter sequence of the Hp53int1 gene was shown to be 540 base pairs in length.<ref name=":2">{{Cite journal |last1=Kent |first1=W. James |last2=Sugnet |first2=Charles W. |last3=Furey |first3=Terrence S. |last4=Roskin |first4=Krishna M. |last5=Pringle |first5=Tom H. |last6=Zahler |first6=Alan M. |last7=Haussler |first7=and David |date=2002-05-16 |title=The Human Genome Browser at UCSC |url=http://dx.doi.org/10.1101/gr.229102 |journal=Genome Research |volume=12 |issue=6 |pages=996–1006 |doi=10.1101/gr.229102 |pmid=12045153 |pmc=186604 |issn=1088-9051}}</ref> Transcription factors '''SMAD2''', '''SRY''', and '''ETV7''' had Basewise Conservation Scores of 2.99, 2.73, and 3.03 respectively, with value of 4 indicating the highest conservation and -0.5 the lowest. KLF2 and KLF5 had scores of 3.73 each. The species being compared were elephant, dog, rhesus macaque, and chicken.


== Protein ==
== Protein ==
The Hp53int1 protein has a molecular weight of 13.3 kdal and is 118 amino acids in length.<ref name=":1" /><ref name="Brendel 2002–2006">{{Cite journal |last=Brendel |first=V |last2=Bucher |first2=P |last3=Nourbakhsh |first3=I R |last4=Blaisdell |first4=B E |last5=Karlin |first5=S |date=1992-03-15 |title=Methods and algorithms for statistical analysis of protein sequences. |url=http://dx.doi.org/10.1073/pnas.89.6.2002 |journal=Proceedings of the National Academy of Sciences |volume=89 |issue=6 |pages=2002–2006 |doi=10.1073/pnas.89.6.2002 |issn=0027-8424}}</ref> It is a basic protein.<ref name="Brendel 2002–2006"/> There is one highly conserved amino acid sequence between 102-110.<ref>{{Cite web |last=not provided |first=Bioline |date=2016-10-09 |title=Clean-up using AMPure XP beads v1 |url=http://dx.doi.org/10.17504/protocols.io.f3ebqje |access-date=2022-12-16 |website=dx.doi.org}}</ref> Serine site distribution is significantly higher than Arginine and Tyrosine sites.<ref name=":3">{{Cite web |title=GPS 5.0 - Kinase-specific Phosphorylation Site Prediction |url=http://gps.biocuckoo.cn/wsresult.php?p=1 |access-date=2022-12-16 |website=gps.biocuckoo.cn}}</ref> There is a possible Casein Kinase II phosphorylation site between amino acids 22-25.<ref>{{Cite web |title=Motif Scan |url=https://myhits.sib.swiss/cgi-bin/motif_scan |access-date=2022-12-16 |website=myhits.sib.swiss |language=en}}</ref> Casein Kinase II phosphorylation is involved in cell proliferation. Three regions of disorder are predicted for the Hp53int1 protein.<ref name=":3" />
The Hp53int1 protein has a molecular weight of 13.3 kdal and is 118 amino acids in length.<ref name=":1" /><ref name="Brendel 2002–2006">{{Cite journal |last1=Brendel |first1=V |last2=Bucher |first2=P |last3=Nourbakhsh |first3=I R |last4=Blaisdell |first4=B E |last5=Karlin |first5=S |date=1992-03-15 |title=Methods and algorithms for statistical analysis of protein sequences. |journal=Proceedings of the National Academy of Sciences |volume=89 |issue=6 |pages=2002–2006 |doi=10.1073/pnas.89.6.2002 |pmid=1549558 |issn=0027-8424|doi-access=free }}</ref> It is a basic protein.<ref name="Brendel 2002–2006"/> There is one highly conserved amino acid sequence between 102-110.<ref>{{Cite journal |last=not provided |first=Bioline |date=2016-10-09 |title=Clean-up using AMPure XP beads v1 |url=http://dx.doi.org/10.17504/protocols.io.f3ebqje |access-date=2022-12-16 |website=dx.doi.org|doi=10.17504/protocols.io.f3ebqje }}</ref> Serine site distribution is significantly higher than Arginine and Tyrosine sites.<ref name=":3">{{Cite web |title=GPS 5.0 - Kinase-specific Phosphorylation Site Prediction |url=http://gps.biocuckoo.cn/wsresult.php?p=1 |access-date=2022-12-16 |website=gps.biocuckoo.cn}}</ref> There is a possible Casein Kinase II phosphorylation site between amino acids 22-25.<ref>{{Cite web |title=Motif Scan |url=https://myhits.sib.swiss/cgi-bin/motif_scan |access-date=2022-12-16 |website=myhits.sib.swiss |language=en}}</ref> Casein Kinase II phosphorylation is involved in cell proliferation. Three regions of disorder are predicted for the Hp53int1 protein.<ref name=":3" />
[[File:Hp53int1 Web Logo.png|thumb|''Web Logo''<ref>{{Cite web |title=WebLogo - Create Sequence Logos |url=https://weblogo.berkeley.edu/logo.cgi |access-date=2022-12-16 |website=weblogo.berkeley.edu}}</ref> ''results using a multiple sequence alignment between Hp53int1 and its orthologs.'']]
[[File:Hp53int1 Web Logo.png|thumb|''Web Logo''<ref>{{Cite web |title=WebLogo - Create Sequence Logos |url=https://weblogo.berkeley.edu/logo.cgi |access-date=2022-12-16 |website=weblogo.berkeley.edu}}</ref> ''results using a multiple sequence alignment between Hp53int1 and its orthologs.'']]


Line 26: Line 26:
There is a possible '''O-ß-GlcNAc''' site between amino acids '''14-20'''.<ref name="Services">{{Cite web |title=Services |url=https://services.healthtech.dtu.dk/ |access-date=2022-12-16 |website=healthtech.dtu.dk |language=en}}</ref> There are no transmembrane domains nor signal peptides.
There is a possible '''O-ß-GlcNAc''' site between amino acids '''14-20'''.<ref name="Services">{{Cite web |title=Services |url=https://services.healthtech.dtu.dk/ |access-date=2022-12-16 |website=healthtech.dtu.dk |language=en}}</ref> There are no transmembrane domains nor signal peptides.


There is a conserved '''phosphorylation site''' at S5, S15, and S20.<ref>{{Cite journal |last=Hornbeck |first=Peter V. |last2=Zhang |first2=Bin |last3=Murray |first3=Beth |last4=Kornhauser |first4=Jon M. |last5=Latham |first5=Vaughan |last6=Skrzypek |first6=Elzbieta |date=2014-12-16 |title=PhosphoSitePlus, 2014: mutations, PTMs and recalibrations |url=http://dx.doi.org/10.1093/nar/gku1267 |journal=Nucleic Acids Research |volume=43 |issue=D1 |pages=D512–D520 |doi=10.1093/nar/gku1267 |issn=1362-4962}}</ref>
There is a conserved '''phosphorylation site''' at S5, S15, and S20.<ref>{{Cite journal |last1=Hornbeck |first1=Peter V. |last2=Zhang |first2=Bin |last3=Murray |first3=Beth |last4=Kornhauser |first4=Jon M. |last5=Latham |first5=Vaughan |last6=Skrzypek |first6=Elzbieta |date=2014-12-16 |title=PhosphoSitePlus, 2014: mutations, PTMs and recalibrations |url=http://dx.doi.org/10.1093/nar/gku1267 |journal=Nucleic Acids Research |volume=43 |issue=D1 |pages=D512–D520 |doi=10.1093/nar/gku1267 |issn=1362-4962}}</ref>
[[File:Yin O Yang results for Hp53int1.png|thumb|269x269px|Predictions for O-ß-GlcNAc in the Hp53int1 protein given by Yin O Yang.<ref name="Services"/> The green bars represent the potential for O-ß-GlcNAc attachment sites in Hp53int1, and must reach a value > 0.50 to be considered a likely site. There is one predicted attachment site for Hp53int1.]]
[[File:Yin O Yang results for Hp53int1.png|thumb|269x269px|Predictions for O-ß-GlcNAc in the Hp53int1 protein given by Yin O Yang.<ref name="Services"/> The green bars represent the potential for O-ß-GlcNAc attachment sites in Hp53int1, and must reach a value > 0.50 to be considered a likely site. There is one predicted attachment site for Hp53int1.]]
Line 51: Line 51:
Using NCBI BLAST<ref name=":0" /> sequence analysis and Clustal W<ref name=":5">{{Cite web |title=Clustal Omega < Multiple Sequence Alignment < EMBL-EBI |url=https://www.ebi.ac.uk/Tools/msa/clustalo/ |access-date=2022-12-16 |website=www.ebi.ac.uk}}</ref> a multiple sequence alignment was formulated for Hp53int1 and twenty top BLAST hits.
Using NCBI BLAST<ref name=":0" /> sequence analysis and Clustal W<ref name=":5">{{Cite web |title=Clustal Omega < Multiple Sequence Alignment < EMBL-EBI |url=https://www.ebi.ac.uk/Tools/msa/clustalo/ |access-date=2022-12-16 |website=www.ebi.ac.uk}}</ref> a multiple sequence alignment was formulated for Hp53int1 and twenty top BLAST hits.


While the objective of this research was to find several orthologs across a variety of mammals, vertebrates, and invertebrates, there were only results in two classes: '''primates''' and '''bacteria.''' Furthermore, there was only one amino acid sequence within Hp53int1 and its orthologs of a general 20 amino acids in length that directly aligned (see Web Logo results in Figure *****). The primate genes are located on chromosome 13 while Hp53int1 is located on chromosome 17.<ref>{{Cite journal |last=Kent |first=W. J. |date=2002-03-20 |title=BLAT---The BLAST-Like Alignment Tool |journal=Genome Research |volume=12 |issue=4 |pages=656–664 |doi=10.1101/gr.229202 |issn=1088-9051}}</ref> There is also evidence for subcellular localization of the primate proteins within the nuclear envelope, while evidence for Hp53int1 suggests it is in the cytoplasm.<ref name=":4" /> Thus, it is reasonable to assume that the '''primates are not strict orthologs'''. A possible explanation for why strict Hp53int1 alignments are only found in bacteria is a '''possible crossing over event''' shared between bacteria and a common ancestor of primates. [[File:Hp53int1 ortholog table.png|center|thumb|1196x1196px]]
While the objective of this research was to find several orthologs across a variety of mammals, vertebrates, and invertebrates, there were only results in two classes: '''primates''' and '''bacteria.''' Furthermore, there was only one amino acid sequence within Hp53int1 and its orthologs of a general 20 amino acids in length that directly aligned (see Web Logo results in Figure *****). The primate genes are located on chromosome 13 while Hp53int1 is located on chromosome 17.<ref>{{Cite journal |last=Kent |first=W. J. |date=2002-03-20 |title=BLAT---The BLAST-Like Alignment Tool |journal=Genome Research |volume=12 |issue=4 |pages=656–664 |doi=10.1101/gr.229202 |pmid=11932250 |pmc=187518 |issn=1088-9051}}</ref> There is also evidence for subcellular localization of the primate proteins within the nuclear envelope, while evidence for Hp53int1 suggests it is in the cytoplasm.<ref name=":4" /> Thus, it is reasonable to assume that the '''primates are not strict orthologs'''. A possible explanation for why strict Hp53int1 alignments are only found in bacteria is a '''possible crossing over event''' shared between bacteria and a common ancestor of primates. [[File:Hp53int1 ortholog table.png|center|thumb|1196x1196px]]
[[File:Phylogenetic tree for Hp53int1.png|thumb|276x276px|''The phylogenetic tree of the Hp53int1 distant orthologs in the radial format''<ref name=":5" />''.'' |center]]
[[File:Phylogenetic tree for Hp53int1.png|thumb|276x276px|''The phylogenetic tree of the Hp53int1 distant orthologs in the radial format''<ref name=":5" />''.'' |center]]


== Function ==
== Function ==
The Hp53int1 protein is likely involved in the '''regulation of cellular proliferation and apoptosis'''. This is indicated by the use of apoptosis-regulator and ubiquitination regulator interacting proteins, its share of transcription factors with TP53 (especially the '''SMAD2''', '''SRY''', and '''ETV7''' transcription factors), its location downstream of the '''p53p2 start site''', its '''casein kinase II phosphorylation site,''' and its '''ubiquitous expression across tissues'''. The Hp53int1 gene expression is increased in environments when proteins that reduce cell proliferation are overexpressed, suggesting its requirement in cellular environments that need to arrest cell growth.<ref name="Home - GEO Profiles - NCBI"/> Also, quantitave analysis shows that the general hp53 protein is absolutely required for the activation of cellular response to DNA damage.<ref>{{Cite journal |last=Gotoh |first=Tetsuya |last2=Vila-Caballer |first2=Marian |last3=Liu |first3=Jingjing |last4=Schiffhauer |first4=Samuel |last5=Finkielstein |first5=Carla V. |date=2015-01-15 |editor-last=Solomon |editor-first=Mark J. |title=Association of the circadian factor Period 2 to p53 influences p53's function in DNA-damage signaling |url=https://www.molbiolcell.org/doi/10.1091/mbc.E14-05-0994 |journal=Molecular Biology of the Cell |language=en |volume=26 |issue=2 |pages=359–372 |doi=10.1091/mbc.E14-05-0994 |issn=1059-1524}}</ref>
The Hp53int1 protein is likely involved in the '''regulation of cellular proliferation and apoptosis'''. This is indicated by the use of apoptosis-regulator and ubiquitination regulator interacting proteins, its share of transcription factors with TP53 (especially the '''SMAD2''', '''SRY''', and '''ETV7''' transcription factors), its location downstream of the '''p53p2 start site''', its '''casein kinase II phosphorylation site,''' and its '''ubiquitous expression across tissues'''. The Hp53int1 gene expression is increased in environments when proteins that reduce cell proliferation are overexpressed, suggesting its requirement in cellular environments that need to arrest cell growth.<ref name="Home - GEO Profiles - NCBI"/> Also, quantitave analysis shows that the general hp53 protein is absolutely required for the activation of cellular response to DNA damage.<ref>{{Cite journal |last1=Gotoh |first1=Tetsuya |last2=Vila-Caballer |first2=Marian |last3=Liu |first3=Jingjing |last4=Schiffhauer |first4=Samuel |last5=Finkielstein |first5=Carla V. |date=2015-01-15 |editor-last=Solomon |editor-first=Mark J. |title=Association of the circadian factor Period 2 to p53 influences p53's function in DNA-damage signaling |journal=Molecular Biology of the Cell |language=en |volume=26 |issue=2 |pages=359–372 |doi=10.1091/mbc.E14-05-0994 |pmid=25411341 |pmc=4294682 |issn=1059-1524}}</ref>


=== Interacting Proteins ===
=== Interacting Proteins ===
Line 65: Line 65:
== Clinical Significance ==
== Clinical Significance ==


There is evidence that the Hp53int1 gene is involved in '''tumor suppression'''. Hp53int1 overlaps with TP53 exon 1 ('''TAD1''').<ref>{{Cite journal |last=Hainaut |first=Pierre |last2=Pfeifer |first2=Gerd P. |date=November 2016 |title=Somatic TP53 Mutations in the Era of Genome Sequencing |url=https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5088513/ |journal=Cold Spring Harbor Perspectives in Medicine |volume=6 |issue=11 |pages=a026179 |doi=10.1101/cshperspect.a026179 |issn=2157-1422 |pmc=5088513 |pmid=27503997}}</ref> Mutations within this region can result in alternative exon expression or incomplete splicing and a loss of tumor suppressor function. Regarding osteosarcoma, these rearrangements within TAD1 have been detected in ∼20% of the cases involving incomplete splicing of TP53. These rearrangements are located across the full sequence of TP53 intron 1, but the majority of them will cluster in a domain of the Hp53int1 transcript, suggesting that these rearrangements may be facilitated by chromatin conformation within this locus<ref>{{Cite journal |last=Chen |first=Xiang |last2=Bahrami |first2=Armita |last3=Pappo |first3=Alberto |last4=Easton |first4=John |last5=Dalton |first5=James |last6=Hedlund |first6=Erin |last7=Ellison |first7=David |last8=Shurtleff |first8=Sheila |last9=Wu |first9=Gang |last10=Wei |first10=Lei |last11=Parker |first11=Matthew |last12=Rusch |first12=Michael |last13=Nagahawatte |first13=Panduka |last14=Wu |first14=Jianrong |last15=Mao |first15=Shenghua |date=April 2014 |title=Recurrent Somatic Structural Variations Contribute to Tumorigenesis in Pediatric Osteosarcoma |url=http://dx.doi.org/10.1016/j.celrep.2014.03.003 |journal=Cell Reports |volume=7 |issue=1 |pages=104–112 |doi=10.1016/j.celrep.2014.03.003 |issn=2211-1247}}</ref>
There is evidence that the Hp53int1 gene is involved in '''tumor suppression'''. Hp53int1 overlaps with TP53 exon 1 ('''TAD1''').<ref>{{Cite journal |last1=Hainaut |first1=Pierre |last2=Pfeifer |first2=Gerd P. |date=November 2016 |title=Somatic TP53 Mutations in the Era of Genome Sequencing |journal=Cold Spring Harbor Perspectives in Medicine |volume=6 |issue=11 |pages=a026179 |doi=10.1101/cshperspect.a026179 |issn=2157-1422 |pmc=5088513 |pmid=27503997}}</ref> Mutations within this region can result in alternative exon expression or incomplete splicing and a loss of tumor suppressor function. Regarding osteosarcoma, these rearrangements within TAD1 have been detected in ∼20% of the cases involving incomplete splicing of TP53. These rearrangements are located across the full sequence of TP53 intron 1, but the majority of them will cluster in a domain of the Hp53int1 transcript, suggesting that these rearrangements may be facilitated by chromatin conformation within this locus<ref>{{Cite journal |last1=Chen |first1=Xiang |last2=Bahrami |first2=Armita |last3=Pappo |first3=Alberto |last4=Easton |first4=John |last5=Dalton |first5=James |last6=Hedlund |first6=Erin |last7=Ellison |first7=David |last8=Shurtleff |first8=Sheila |last9=Wu |first9=Gang |last10=Wei |first10=Lei |last11=Parker |first11=Matthew |last12=Rusch |first12=Michael |last13=Nagahawatte |first13=Panduka |last14=Wu |first14=Jianrong |last15=Mao |first15=Shenghua |date=April 2014 |title=Recurrent Somatic Structural Variations Contribute to Tumorigenesis in Pediatric Osteosarcoma |url=http://dx.doi.org/10.1016/j.celrep.2014.03.003 |journal=Cell Reports |volume=7 |issue=1 |pages=104–112 |doi=10.1016/j.celrep.2014.03.003 |pmid=24703847 |pmc=4096827 |issn=2211-1247}}</ref>


== References ==
== References ==

Revision as of 01:05, 26 January 2023

Human protein 53 intron 1 (Hp53int1) is a protein encoded by the Hp53int1 gene in humans.

Gene

Figure 1: The location of Hp53int1 protein on chromosome 17p.13, region 7,685,260 bp-7,686,371 bp

The Hp53int1 gene is located on chromosome 17p.13, encoded by a DNA sequence 1125 base pairs in length, covering region 7,685,260 bp-7,686,371 bp.[1] The Hp53int1 gene has two aliases, WRAP53int1 and TP53int1, in accordance to its overlap with the WRAP53 and TP53int1 genes on chromosome 17.[2] Hp53int1 is located downstream of the p53p2 start site.[3]

Hp53int1 does not have multiple exons, and therefore has no isoforms.

Important Relationship to TP53

The Hp53int1 gene is projected in the same direction as TP53 and is present in myeloid leukemia cells HL-60 and U937.[4] This suggests a strong relationship to the TP53 gene and subsequent protein, including a share of transcription factors, promoter signals, tissue expression, and subcellular localization. While these genes are not identical, this relationship may give clues to function, structure, and expression.

mRNA Transcript

5' UTR Analysis[5]:The consensus folding of the 5’ UTR of Hp53int1. Two loops are formed from amino acids 1-9 and 12-21 respectively. The connecting amino acids of both loops are rich in G’s and C’s.

Hp53int1 is encoded by a polyadenylated transcript 1125 base pairs in length. There is a repeat sequence between base pairs 633…926 and a regulatory Poly A tail sequence between base pairs 496…1000.[1] The repeat sequence between base pairs 633 and 926 is noted to be similar to the Alu SC subfamily repeat, a sequence that characterizes the most abundant repeat sequences in humans and primates, and which likely diverged from other Alu subfamily complexes around 32 million years ago.[6]

Promoter Sequence

The promoter sequence of the Hp53int1 gene was shown to be 540 base pairs in length.[7] Transcription factors SMAD2, SRY, and ETV7 had Basewise Conservation Scores of 2.99, 2.73, and 3.03 respectively, with value of 4 indicating the highest conservation and -0.5 the lowest. KLF2 and KLF5 had scores of 3.73 each. The species being compared were elephant, dog, rhesus macaque, and chicken.

Protein

The Hp53int1 protein has a molecular weight of 13.3 kdal and is 118 amino acids in length.[2][8] It is a basic protein.[8] There is one highly conserved amino acid sequence between 102-110.[9] Serine site distribution is significantly higher than Arginine and Tyrosine sites.[10] There is a possible Casein Kinase II phosphorylation site between amino acids 22-25.[11] Casein Kinase II phosphorylation is involved in cell proliferation. Three regions of disorder are predicted for the Hp53int1 protein.[10]

Web Logo[12] results using a multiple sequence alignment between Hp53int1 and its orthologs.

Post Translational Modifications

There is a possible O-ß-GlcNAc site between amino acids 14-20.[13] There are no transmembrane domains nor signal peptides.

There is a conserved phosphorylation site at S5, S15, and S20.[14]

Predictions for O-ß-GlcNAc in the Hp53int1 protein given by Yin O Yang.[13] The green bars represent the potential for O-ß-GlcNAc attachment sites in Hp53int1, and must reach a value > 0.50 to be considered a likely site. There is one predicted attachment site for Hp53int1.

Secondary Structure

Hp53int1 protein is predicted to contain one alpha helix and five beta sheets.[15] There are eight possible protein binding sites.

Tertiary Structure

The predicted tertiary structure of the Hp53int1 protein as constructed by UnitProt Alpha Fold[16].

Subcellular Localization

There is a 56.5% likelihood that the Hp53int1 protein is localized within the cytoplasm.[17]

Tissue Expression

There is low tissue specificity for the Hp53int1 gene.[18] When comparing various samples of adult human tissues there is high expression in the thymus, ovaries, lymph nodes, and white blood cells. There was a high expression score for both RNA expression and protein expression in proximal digestive tract, gastrointestinal tract, and male/female tissues. During fetal development between weeks 10 and 16, there is elevated expression in the lung and heart tissues.

Hp53int1 is differentially expressed under conditions that require cell proliferation or apoptosis, in accordance to its congruence with the TP53 tumor suppressor gene[19]

Homology and Evolution

Paralogs

The Hp53int1 gene was found to have no paralogs.[1]

Orthologs

Using NCBI BLAST[1] sequence analysis and Clustal W[20] a multiple sequence alignment was formulated for Hp53int1 and twenty top BLAST hits.

While the objective of this research was to find several orthologs across a variety of mammals, vertebrates, and invertebrates, there were only results in two classes: primates and bacteria. Furthermore, there was only one amino acid sequence within Hp53int1 and its orthologs of a general 20 amino acids in length that directly aligned (see Web Logo results in Figure *****). The primate genes are located on chromosome 13 while Hp53int1 is located on chromosome 17.[21] There is also evidence for subcellular localization of the primate proteins within the nuclear envelope, while evidence for Hp53int1 suggests it is in the cytoplasm.[17] Thus, it is reasonable to assume that the primates are not strict orthologs. A possible explanation for why strict Hp53int1 alignments are only found in bacteria is a possible crossing over event shared between bacteria and a common ancestor of primates.

The phylogenetic tree of the Hp53int1 distant orthologs in the radial format[20].

Function

The Hp53int1 protein is likely involved in the regulation of cellular proliferation and apoptosis. This is indicated by the use of apoptosis-regulator and ubiquitination regulator interacting proteins, its share of transcription factors with TP53 (especially the SMAD2, SRY, and ETV7 transcription factors), its location downstream of the p53p2 start site, its casein kinase II phosphorylation site, and its ubiquitous expression across tissues. The Hp53int1 gene expression is increased in environments when proteins that reduce cell proliferation are overexpressed, suggesting its requirement in cellular environments that need to arrest cell growth.[19] Also, quantitave analysis shows that the general hp53 protein is absolutely required for the activation of cellular response to DNA damage.[22]

Interacting Proteins

The following interacting proteins[23] were found for TP53, but can be applied to Hp53int1 due to their chromosomal relationship. Important to note: the functions of these interacting proteins are directly involved in cell regulation (ubiquitination, transcriptional regulator, apoptosis, and protein-tyrosine kinase).

SNPs

There are two identified SNPS for Hp53int1.[7] SNP 15 falls at position 32 in a sequence of 43 residues located upstream of the largest ORF of the TP53 promoter. SNP 20 is then located 45 bp downstream of the 3’ end of the 1125 bp cDNA (Hp53int1).

Clinical Significance

There is evidence that the Hp53int1 gene is involved in tumor suppression. Hp53int1 overlaps with TP53 exon 1 (TAD1).[24] Mutations within this region can result in alternative exon expression or incomplete splicing and a loss of tumor suppressor function. Regarding osteosarcoma, these rearrangements within TAD1 have been detected in ∼20% of the cases involving incomplete splicing of TP53. These rearrangements are located across the full sequence of TP53 intron 1, but the majority of them will cluster in a domain of the Hp53int1 transcript, suggesting that these rearrangements may be facilitated by chromatin conformation within this locus[25]

References

  1. ^ a b c d "Hp53int1 human unknown protein". NCBI Protein BLAST. Retrieved 17 December 2022.
  2. ^ a b Altschul, Stephen F.; Gish, Warren; Miller, Webb; Myers, Eugene W.; Lipman, David J. (October 1990). "Basic local alignment search tool". Journal of Molecular Biology. 215 (3): 403–410. doi:10.1016/s0022-2836(05)80360-2. ISSN 0022-2836. PMID 2231712. S2CID 14441902.
  3. ^ Reisman, David; Bálint, Éva; Loging, W.Troy; Rotter, Varda; Almon, Einat (December 1996). "A Novel Transcript Encoded within the 10-kb First Intron of the Human p53 Tumor Suppressor Gene (D17S2179E) Is Induced during Differentiation of Myeloid Leukemia Cells". Genomics. 38 (3): 364–370. doi:10.1006/geno.1996.0639.
  4. ^ Ortiz-Cuaran, Sandra; Cox, David; Villar, Stéphanie; Friesen, Marlin D.; Durand, Geoffroy; Chabrier, Amélie; Khuhaprema, Thiravud; Sangrajrang, Suleeporn; Ognjanovic, Simona; Groopman, John D.; Hainaut, Pierre; Le Calvez-Kelm, Florence (October 2013). "Association between TP53 R249S mutation and polymorphisms in TP53 intron 1 in hepatocellular carcinoma: TP53 R249S Associated with Intron 1 Polymorphisms". Genes, Chromosomes and Cancer. 52 (10): 912–919. doi:10.1002/gcc.22086. PMID 23836507. S2CID 21587166.
  5. ^ www.unafold.org http://www.unafold.org/. Retrieved 2022-12-16. {{cite web}}: Missing or empty |title= (help)
  6. ^ Price, Alkes L.; Eskin, Eleazar; Pevzner, Pavel A. (November 2004). "Whole-genome analysis of Alu repeat elements reveals complex evolutionary history". Genome Research. 14 (11): 2245–2252. doi:10.1101/gr.2693004. ISSN 1088-9051. PMC 525682. PMID 15520288.
  7. ^ a b Kent, W. James; Sugnet, Charles W.; Furey, Terrence S.; Roskin, Krishna M.; Pringle, Tom H.; Zahler, Alan M.; Haussler, and David (2002-05-16). "The Human Genome Browser at UCSC". Genome Research. 12 (6): 996–1006. doi:10.1101/gr.229102. ISSN 1088-9051. PMC 186604. PMID 12045153.
  8. ^ a b Brendel, V; Bucher, P; Nourbakhsh, I R; Blaisdell, B E; Karlin, S (1992-03-15). "Methods and algorithms for statistical analysis of protein sequences". Proceedings of the National Academy of Sciences. 89 (6): 2002–2006. doi:10.1073/pnas.89.6.2002. ISSN 0027-8424. PMID 1549558.
  9. ^ not provided, Bioline (2016-10-09). "Clean-up using AMPure XP beads v1". dx.doi.org. doi:10.17504/protocols.io.f3ebqje. Retrieved 2022-12-16.
  10. ^ a b "GPS 5.0 - Kinase-specific Phosphorylation Site Prediction". gps.biocuckoo.cn. Retrieved 2022-12-16.
  11. ^ "Motif Scan". myhits.sib.swiss. Retrieved 2022-12-16.
  12. ^ "WebLogo - Create Sequence Logos". weblogo.berkeley.edu. Retrieved 2022-12-16.
  13. ^ a b "Services". healthtech.dtu.dk. Retrieved 2022-12-16.
  14. ^ Hornbeck, Peter V.; Zhang, Bin; Murray, Beth; Kornhauser, Jon M.; Latham, Vaughan; Skrzypek, Elzbieta (2014-12-16). "PhosphoSitePlus, 2014: mutations, PTMs and recalibrations". Nucleic Acids Research. 43 (D1): D512–D520. doi:10.1093/nar/gku1267. ISSN 1362-4962.
  15. ^ "Welcome to PredictProtein!". predictprotein.org. Retrieved 2022-12-16.
  16. ^ "AlphaFold Protein Structure Database". alphafold.ebi.ac.uk. Retrieved 2022-12-16.
  17. ^ a b "PSORT Prediction Tool". PSORT II. Retrieved 2022-12-12.
  18. ^ "TP53 protein expression summary - The Human Protein Atlas". www.proteinatlas.org. Retrieved 2022-12-16.
  19. ^ a b "Home - GEO Profiles - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2022-12-16.
  20. ^ a b "Clustal Omega < Multiple Sequence Alignment < EMBL-EBI". www.ebi.ac.uk. Retrieved 2022-12-16.
  21. ^ Kent, W. J. (2002-03-20). "BLAT---The BLAST-Like Alignment Tool". Genome Research. 12 (4): 656–664. doi:10.1101/gr.229202. ISSN 1088-9051. PMC 187518. PMID 11932250.
  22. ^ Gotoh, Tetsuya; Vila-Caballer, Marian; Liu, Jingjing; Schiffhauer, Samuel; Finkielstein, Carla V. (2015-01-15). Solomon, Mark J. (ed.). "Association of the circadian factor Period 2 to p53 influences p53's function in DNA-damage signaling". Molecular Biology of the Cell. 26 (2): 359–372. doi:10.1091/mbc.E14-05-0994. ISSN 1059-1524. PMC 4294682. PMID 25411341.
  23. ^ "tp53 Interactants [Xenopus] - Xenbase Gene Catalog". www.xenbase.org. Retrieved 2022-12-16.
  24. ^ Hainaut, Pierre; Pfeifer, Gerd P. (November 2016). "Somatic TP53 Mutations in the Era of Genome Sequencing". Cold Spring Harbor Perspectives in Medicine. 6 (11): a026179. doi:10.1101/cshperspect.a026179. ISSN 2157-1422. PMC 5088513. PMID 27503997.
  25. ^ Chen, Xiang; Bahrami, Armita; Pappo, Alberto; Easton, John; Dalton, James; Hedlund, Erin; Ellison, David; Shurtleff, Sheila; Wu, Gang; Wei, Lei; Parker, Matthew; Rusch, Michael; Nagahawatte, Panduka; Wu, Jianrong; Mao, Shenghua (April 2014). "Recurrent Somatic Structural Variations Contribute to Tumorigenesis in Pediatric Osteosarcoma". Cell Reports. 7 (1): 104–112. doi:10.1016/j.celrep.2014.03.003. ISSN 2211-1247. PMC 4096827. PMID 24703847.