PRR23A
PRR23A | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | PRR23A, proline rich 23A | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | MGI: 3645937; HomoloGene: 67036; GeneCards: PRR23A; OMA:PRR23A - orthologs | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
Proline-Rich Protein 23A is a protein that is encoded by the Proline-Rich 23A (PRR23A) gene.
Gene
[edit]Locus
[edit]The human PRR23A gene is on chromosome 3 at position 3q23 and is located on the antisense strand.[5] The gene is encoded from position 139,003,962 to 139,006,268. It consists on 1 exon and spans 2,307 base pairs. Other genes in the neighborhood include: FOXL2NB, FOXL2, PRR23B, and PRR23C. The FOXL2NB gene has tissue enriched expression in ovaries, and PRR23A has demonstrated expression in the ovary as well.[6][7] Aliases for PRR23A include: Proline-Rich 23A, Proline-Rich Protein 23A, and UPF0572 Protein ENSP00000372650.[8] Two of those genes, PRR23B and PRR23C, are paralogs to PRR23A.[9]
Gene expression
[edit]PRR23A is primarily expressed at low levels in the brain and testis.[7][10] There were also very low levels of PRR23A expression detected in ovary and bone marrow tissue.[7] Genes typically show high expression in the testis during RNA sequencing since it is a highly transcriptionally active tissue due to its function of sperm production.[11] However, some researchers have noted that this testis tissue expression of PRR23A may be legitimate since PRR23 family genes are thought to play a role in male reproduction.[9][12] Furthermore, brain and testis tissue share biochemical characteristics and express a large number of common genes.[13] This may also explain why PRR23A expression has been found at similar levels within the brain and testis.
Tissue | Cerebellum | Choroid Plexus | White Matter | Cerebral Cortex | Medulla Oblongata | Pons | Thalamus | Amygdala | Hippocampal Formation | Midbrain | Basal Ganglia | Hypothalamus | Spinal Cord | Testis | Ovary | Bone Marrow |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
RNA Abundance (nTPM) | 2.4 | 1.7 | 1.7 | 1.6 | 1.6 | 1.6 | 1.5 | 1.4 | 1.3 | 1.3 | 1.2 | 1.2 | 1.2 | 1.5 | 0.1 | 0.1 |
mRNA transcript
[edit]Since PRR23A consists of 1 exon, there are no alternative splicing products.[5] This also means that the 1 known isoform in humans has an mRNA sequence of 2,307 nucleotides which matches the length of the PRR23A gene.[14] mRNA typically contain a 5' UTR with a median length of 170 nucleotides in humans, but the human PRR23A mRNA sequence does not contain a 5' UTR.[14][15] Instead, the FASTA sequence for human PRR23A begins with the start codon ATG.[14] Although, the 5' UTR is not translated, it plays a major regulatory role for the translation of coding sequence nucleotides to their amino acids that go on to form a protein structure.[16] Therefore, it is unlikely that human PRR23A does not have this region, and that its upstream 5' UTR sequence could be obtained through further sequencing research.
Protein
[edit]Human PRR23A consists of 266 amino acids, has a predicted molecular weight of 28.2 kDal, and a predicted basal isoelectric point of 4.57.[17] PRR23A, as its name implies, is enriched with the amino acid proline. Therefore, PRR23A belongs to the category of proteins called proline-rich proteins. PRR23A contains less asparagine, threonine, and lysine compared to other human proteins.[18][17] This protein composition for PRR23A is generally conserved across species.[18][19]
Amino Acid | Pro
(P) |
Ala
(A) |
Leu
(L) |
Ser
(S) |
Glu
(E) |
Gly
(G) |
Val
(V) |
Arg
(R) |
Asp
(D) |
Phe
(F) |
Gln
(Q) |
Ile
(I) |
Cys
(C) |
His
(H) |
Thr
(T) |
Lys
(K) |
Met
(M) |
Try
(Y) |
Trp
(W) |
Asn
(N) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Composition of Human PRR23A | 16.2% | 10.5% | 10.5% | 9.8% | 9.0% | 7.5% | 7.1% | 6.4% | 5.3% | 3.4% | 3.0% | 2.3% | 1.5 | 1.5 | 1.5 | 1.1 | 1.1% | 1.1 | 0.80% | 0.4 |
Composition of PRR23A Compared to Other Human Proteins | very rich | average | average | average | average | average | average | average | average | average | average | average | average | average | very poor | poor | average | average | average | very poor |
Secondary and tertiary structure
[edit]Human PRR23A is mainly a disordered protein with small stretches of beta strands and alpha helices forming.[17][20][21][22][23][24] There are 2 known disordered regions at the beginning and the end of the protein.[17] There are 6 regions from the beginning-middle of the protein sequence that are predicted to form beta strands, and when folded into the tertiary structure are in the middle of the predicted protein structure. There is 1 possible transmembrane domain that is located in 1 of these beta strands.[18][17][20][21] Some proteins can create transmembrane beta barrels when a beta sheet curls on itself to make a tube that goes through a membrane, so the PRR23A could exhibit this phenomenon[25]. There are 2 regions towards the end of the protein sequence that are predicted to form alpha helices, and when folded into a tertiary structure are in the middle of the predicted protein structure.
Subcellular localization
[edit]Antibody detection in human stomach cells has shown that PRR23A localizes in the membrane and cytoplasm.[26] Further investigation of the PRR23A protein sequence has also identified a small transmembrane region towards the beginning of the protein, and signal sequences for the ER membrane, nucleus, and mitochondria.[18][21]
Interacting proteins
[edit]PRR23A does not have very many known interactions. The most significant protein interactions for human PRR23A are DEFB106A and DEFB107A which have been determined though co-expression data and textmining.[27] Co-expression data has also shown that DEFB106A and DEFB107A interact with one another. This means that PRR23A, DEFB106A, and DEFB107A have been observed to be correlated in expression across a large number of experiments. DEFB105A, DEFB106B, IQCJ, FAM90A10P, SPAG11B, PRSS22, USP17L4, and USP17L7 are also thought to interact with PRR23A. The basis of these interactions were determined through textmining, so further experiments such as the yeast two-hybrid assay should be conducted to increase the confidence of these protein interactions.
Name | Basis | Function |
DEFB106A | Textmining and Co-expression | Belongs to the defensin family which are antimicrobial and cytotoxic peptides made by neutrophils. Associated with Diamond-Blackfan Anemia15, Mandibulofacial with Dysostosis, and Keratomalacia. Enables lipopolysaccharide binding, protein binding, heparin binding and CCR2 chemokine receptor binding |
DEFB107A | Textmining and Co-expression | Belongs to the defensin family which are antimicrobial and cytotoxic peptides made by neutrophils. Enables lipid binding |
DEFB105A | Textmining | Belongs to the defensin family which are antimicrobial and cytotoxic peptides made by neutrophils. Associated with Familial Hypertrophic Cardiomyopathy 12, and Familial Hypertrophic Cardiomyopathy 6 |
DEFB106B | Textmining | Belongs to the defensin family which are antimicrobial and cytotoxic peptides made by neutrophils. Associated with Diamond-Blackfan Anemia15, Mandibulofacial with Dysostosis. Enables lipopolysaccharide binding, protein binding, heparin binding and CCR2 chemokine receptor binding |
IQCJ | Textmining | NA |
FAM90A10P | Textmining | Belongs to the FAM90 family |
SPAG11B | Textmining | Encodes several androgen-dependent, epididymis-specific secretory proteins. Thought to be involved in sperm maturation. Associated with Small Intestine Lymphoma and Herpes Zoster Oticus |
PRSS22 | Textmining | Gene encodes a member of the trypsin family of serine-proteases. Preferentially cleaves the synthetic substrate H-D-Leu-Thr-Arg-pNA compared to tosyl-Gly-Pro-Arg-pNA. Enables serine-type endopeptidase activity and peptidase activator activity |
USP17L4 | Textmining | Predicted to enable cysteine-type endopeptidase activity and thiol-dependent deubiquitinase |
USP17L7 | Textmining | Predicted to enable cysteine-type endopeptidase activity and thiol-dependent deubiquitinase |
Post-translational modifications
[edit]Post-translational modifications for human PRR23A include: phosphorylation,[29][30][31][32] acetylation,[33] myristoylation,[32] sulfonation,[34] SUMOylation,[35] and glycosylation.[36][37] The glycosylation site supports the identified transmembrane region and ER membrane subcellular localization of PRR23A since proteins that are glycosylated are typically membrane bound and expressed in the ER.[18][21][38]
Homology
[edit]Paralogs
[edit]There are 2 known paralogs of PRR23A: PRR23B and PRR23C.[9][17][39]
Protein | Accession Number | Sequence Length (Amino Acids) | E-value | Sequence Identity to Human PRR23A Protein (%) | Sequence Similarity to Human PRR23A Protein (%) |
---|---|---|---|---|---|
PRR23A | NP_001128131 | 266 | 0 | 100 | 100 |
PRR23B | NP_001013672 | 265 | 5e-146 | 89.8 | 91.4 |
PRR23C | NP_001128129 | 262 | 1e-116 | 84.6 | 86.8 |
Orthologs
[edit]PRR23A orthologs are only found in placental mammals.[17][39] No PRR23A orthologs have been identified in marsupials, monotremes, birds, reptiles, amphibians, fish, or invertebrates.
Genus and Species | Common Name | Taxonomic Group | Median Date of Divergence (Millions of Years Ago) | Accession Number | Sequence Length (Amino Acids) | Sequence Identity to Human Protein (%) | Sequence Similarity to Human Protein (%) |
---|---|---|---|---|---|---|---|
Homo sapiens | Human | Primates | 0 | NP_001128131 | 266 | 100 | 100 |
Macaca mulatta | Rhesus Monkey | Primates | 28.8 | XP_001113833 | 282 | 83.3 | 85.5 |
Mus musculus | Mouse | Rodentia | 87 | NP_001128132 | 258 | 41.3 | 53.1 |
Ochotona princeps | American Pika | Lagomorpha | 87 | XP_058513009 | 276 | 39.7 | 50.8 |
Pteropus alecto | Black Flying Fox | Chiroptera | 94 | XP_006907257 | 273 | 63 | 69.2 |
Sturnira hondurensis | Honduran Yellow-Shouldered Bat | Chiroptera | 94 | XP_036917059 | 270 | 56.9 | 64.1 |
Leptonychotes weddellii | Weddell Seal | Carnivora | 94 | XP_006734260 | 260 | 53.1 | 61.7 |
Canis lupus familiaris | Dog | Carnivora | 94 | XP_038288414 | 243 | 53 | 59.8 |
Neophocaena asiaeorientalis asiaeorientalis | Yangtze Finless Porpoise | Cetacea | 94 | XP_024606498 | 261 | 53 | 61.9 |
Hippopotamus amphibius kiboko | East African Hippopotamus | Hippopotamidae | 94 | XP_057593770 | 263 | 51.9 | 60.5 |
Equus caballus | Horse | Perissodactyla | 94 | XP_023477063 | 279 | 59.6 | 68.1 |
Camelus ferus | Wild Bactrian Camel | Tylopoda | 94 | XP_032343448 | 258 | 56.4 | 63.5 |
Phacochoerus africanus | Common Warthog | Suiformes | 94 | XP_047608137 | 262 | 47.3 | 57.4 |
Moschus berezovskii | Dwarf Musk Deer | Ruminantia | 94 | XP_055276284 | 291 | 45.2 | 54.2 |
Talpa occidentalis | Spanish Mole | Talpidae | 94 | XP_037382562 | 264 | 54.1 | 64.4 |
Sorex araneus | Common Shrew | Soricidae | 94 | XP_054980762 | 217 | 41.6 | 46.8 |
Erinaceus europaeus | European Hedgehog | Erinaceidae | 94 | XP_060039908 | 274 | 27.8 | 40.6 |
Elephas maximus indicus | Indian Elephant | Proboscidea | 99 | XP_049726169 | 235 | 53.4 | 58.6 |
Loxodonta africana | African Savanna Elephant | Proboscidea | 99 | XP_003420004 | 235 | 53.4 | 58.6 |
Orycteropus afer afer | Aardvark | Tubulidentata | 99 | XP_007945733 | 243 | 52.9 | 58 |
Evolution
[edit]PRR23A first appeared within placental mammals which evolved 78-129 million years ago.[39][44] Then, placental mammals began to diversify into two the major lineages of Atlantogenata and Boreoeutheria which emerged 90-100 million years ago.[45] PRR23A orthologs can be found within both of these major lineages, and several subgroups that evolved as well.[39][46] Despite PRR23A's recent emergence in the long run of evolutionary history, it is evolving at a very rapid rate.[47][48][19]
Function
[edit]PRR23A has demonstrated gene expression within the testis through increased mRNA levels, and so have the other PRR23 family genes.[9] This expression indicates that PRR23A may have a role within the male reproductive system. The larger family of proline-rich proteins have a large range of functions including: energy provisions, antistress responses, calcium binding in saliva, structure support, and many others.[49][50] One subgroup called small proline-rich proteins (SPRRs) are antimicrobial proteins that direct bacterial membrane disruption.[51]
Clinical significance and pathology
[edit]Epigenetic modifications of PRR23A have been shown to impact maternal early-pregnancy serum ferritin concentrations.[12] 2 CpG sites within human PRR23A have been identified: cg02806645 and cg06322988. When these locations are methylated, a decrease in serum ferritin concentrations during early-pregnancy was observed. Low levels of ferritin are a sign of iron deficiency which is especially important to monitor during pregnancy.[52] Therefore, the decreased expression of PRR23A though methylation silencing is associated with iron deficiency.
References
[edit]- ^ a b c GRCh38: Ensembl release 89: ENSG00000206260 – Ensembl, May 2017
- ^ a b c GRCm38: Ensembl release 89: ENSMUSG00000063058 – Ensembl, May 2017
- ^ "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
- ^ "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
- ^ a b c "PRR23A proline rich 23A [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2023-12-15.
- ^ "FOXL2NB FOXL2 neighbor [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2023-12-15.
- ^ a b c d "PRR23A protein expression summary - The Human Protein Atlas". www.proteinatlas.org. Retrieved 2023-12-15.
- ^ "PRR23A Gene - Proline Rich 23A". GeneCards. Retrieved 2023-12-15.
- ^ a b c d Robertson MJ, Kent K, Tharp N, Nozawa K, Dean L, Mathew M, et al. (August 2020). "Large-scale discovery of male reproductive tract-specific genes through analysis of RNA-seq datasets". BMC Biology. 18 (1): 103. doi:10.1186/s12915-020-00826-z. PMC 7436996. PMID 32814578.
- ^ "Microarray Data :: Allen Brain Atlas: Human Brain". human.brain-map.org. Retrieved 2023-12-15.
- ^ Witt E, Benjamin S, Svetec N, Zhao L (August 2019). "Testis single-cell RNA-seq reveals the dynamics of de novo gene transcription and germline mutational bias in Drosophila". eLife. 8: e47138. doi:10.7554/eLife.47138. PMC 6697446. PMID 31418408.
- ^ a b Taeubert MJ, de Prado-Bert P, Geurtsen ML, Mancano G, Vermeulen MJ, Reiss IK, et al. (May 2022). "Maternal iron status in early pregnancy and DNA methylation in offspring: an epigenome-wide meta-analysis". Clinical Epigenetics. 14 (1): 59. doi:10.1186/s13148-022-01276-w. PMC 9066980. PMID 35505416.
- ^ Matos B, Publicover SJ, Castro LF, Esteves PJ, Fardilha M (June 2021). "Brain and testis: more alike than previously thought?". Open Biology. 11 (6): 200322. doi:10.1098/rsob.200322. PMC 8169208. PMID 34062096.
- ^ a b c "Homo sapiens proline rich 23A (PRR23A), mRNA". www.ncbi.nlm.nih.gov. 21 December 2022. Retrieved 2023-12-15.
- ^ Clancy S, Brown W (2008). "Translation: DNA to mRNA to Protein" (PDF). Nature Education. Retrieved 2023-12-15.
- ^ Ryczek N, Łyś A, Makałowska I (February 2023). "The Functional Meaning of 5'UTR in Protein-Coding Genes". International Journal of Molecular Sciences. 24 (3): 2976. doi:10.3390/ijms24032976. PMC 9917990. PMID 36769304.
- ^ a b c d e f g h i j k "proline-rich protein 23A [Homo sapiens] - Protein - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2023-12-15.
- ^ a b c d e f "SAPS < Sequence Statistics < EMBL-EBI". www.ebi.ac.uk. Retrieved 2023-12-15.
- ^ a b c "PRR23A - Protein - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2023-12-15.
- ^ a b c d "iCn3D: Web-based 3D Structure Viewer". www.ncbi.nlm.nih.gov. Retrieved 2023-12-15.
- ^ a b c d e "PSORT II Prediction". psort.hgc.jp. Retrieved 2023-12-15.
- ^ "AlphaFold Protein Structure Database". alphafold.ebi.ac.uk. Retrieved 2023-12-15.
- ^ "I-TASSER server for protein structure and function prediction". zhanggroup.org. Retrieved 2023-12-15.
- ^ "PHYRE2 Protein Fold Recognition Server". www.sbg.bio.ic.ac.uk. Retrieved 2023-12-15.
- ^ Vorobieva AA, White P, Liang B, Horne JE, Bera AK, Chow CM, et al. (February 2021). "De novo design of transmembrane β barrels". Science. 371 (6531): eabc8182. doi:10.1126/science.abc8182. PMC 8064278. PMID 33602829.
- ^ "PRR23A Polyclonal Antibody (PA5-83867)". www.thermofisher.com. Retrieved 2023-12-15.
- ^ a b "PRR23A protein (human) - STRING interaction network". string-db.org. Retrieved 2023-12-15.
- ^ "Explore a Gene". GeneCards. Retrieved 2023-12-15.
- ^ "PRR23A (human)". www.phosphosite.org. Retrieved 2023-12-15.
- ^ "ELM - unknown". elm.eu.org. Retrieved 2023-12-15.
- ^ "GPS-PBS - PPBDs–specific binding p-site prediction". pbs.biocuckoo.cn. Retrieved 2023-12-15.
- ^ a b "Motif Scan". myhits.sib.swiss. Retrieved 2023-12-15.
- ^ "The CUCKOO Workgroup". pail.biocuckoo.org. Retrieved 2023-12-15.
- ^ "Expasy - Sulfinator". web.expasy.org. Retrieved 2023-12-15.
- ^ "GPS-SUMO: Prediction of SUMOylation Sites & SUMO-interaction Motifs". sumo.biocuckoo.cn. Retrieved 2023-12-15.
- ^ "NetNGlyc 1.0 - DTU Health Tech - Bioinformatic Services". services.healthtech.dtu.dk. Retrieved 2023-12-15.
- ^ "NetOGlyc 4.0 - DTU Health Tech - Bioinformatic Services". services.healthtech.dtu.dk. Retrieved 2023-12-15.
- ^ Eichler J (April 2019). "Protein glycosylation". Current Biology. 29 (7): R229–R231. Bibcode:2019CBio...29.R229E. doi:10.1016/j.cub.2019.01.003. PMID 30939300.
- ^ a b c d e f g "BLAST: Basic Local Alignment Search Tool". blast.ncbi.nlm.nih.gov. Retrieved 2023-12-15.
- ^ a b "EMBOSS Needle < Pairwise Sequence Alignment < EMBL-EBI". www.ebi.ac.uk. Retrieved 2023-12-15.
- ^ "proline-rich protein 23B [Homo sapiens] - Protein - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2023-12-15.
- ^ "proline-rich protein 23C [Homo sapiens] - Protein - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2023-12-15.
- ^ "TimeTree :: The Timescale of Life". timetree.org. Retrieved 2023-12-15.
- ^ Wible JR, Rougier GW, Novacek MJ, Asher RJ (June 2007). "Cretaceous eutherians and Laurasian origin for placental mammals near the K/T boundary". Nature. 447 (7147): 1003–1006. Bibcode:2007Natur.447.1003W. doi:10.1038/nature05854. PMID 17581585. S2CID 4334424.
- ^ Eizirik E (2016-01-01), "Mammalian Diversification", in Kliman RM (ed.), Encyclopedia of Evolutionary Biology, Oxford: Academic Press, pp. 418–423, ISBN 978-0-12-800426-5, retrieved 2023-12-15
- ^ a b c Asher RJ, Bennett N, Lehmann T (August 2009). "The new framework for understanding placental mammal evolution". BioEssays. 31 (8): 853–864. doi:10.1002/bies.200900053. PMID 19582725.
- ^ a b "cytochrome c - Protein - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2023-12-15.
- ^ a b "fibrinogen alpha - Protein - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2023-12-15.
- ^ Patriarca EJ, Cermola F, D'Aniello C, Fico A, Guardiola O, De Cesare D, Minchiotti G (2021). "The Multifaceted Roles of Proline in Cell Behavior". Frontiers in Cell and Developmental Biology. 9: 728576. doi:10.3389/fcell.2021.728576. PMC 8397452. PMID 34458276.
- ^ Bennick A (June 1982). "Salivary proline-rich proteins". Molecular and Cellular Biochemistry. 45 (2): 83–99. doi:10.1007/BF00223503. PMID 6810092. S2CID 31373141.
- ^ Zhang C, Hu Z, Lone AG, Artami M, Edwards M, Zouboulis CC, et al. (March 2022). Kim BS, Zaidi M (eds.). "Small proline-rich proteins (SPRRs) are epidermally produced antimicrobial proteins that defend the cutaneous barrier by direct bacterial membrane disruption". eLife. 11: e76729. doi:10.7554/eLife.76729. PMC 8912919. PMID 35234613.
- ^ Daru J, Colman K, Stanworth SJ, De La Salle B, Wood EM, Pasricha SR (December 2017). "Serum ferritin as an indicator of iron status: what do we need to know?". The American Journal of Clinical Nutrition. 106 (Suppl 6): 1634S–1639S. doi:10.3945/ajcn.117.155960. PMC 5701723. PMID 29070560.