Glycoproteins are proteins which contain oligosaccharide chains (glycans) covalently attached to amino acid side-chains. The carbohydrate is attached to the protein in a cotranslational or posttranslational modification. This process is known as glycosylation. Secreted extracellular proteins are often glycosylated.
In proteins that have segments extending extracellularly, the extracellular segments are also often glycosylated. Glycoproteins are also often important integral membrane proteins, where they play a role in cell–cell interactions. It is important to distinguish endoplasmic reticulum-based glycosylation of the secretory system from reversible cytosolic-nuclear glycosylation. Glycoproteins of the cytosol and nucleus can be modified through the reversible addition of a single GlcNAc residue that is considered reciprocal to phosphorylation and the functions of these are likely to be additional regulatory mechanism that controls phosphorylation-based signalling. In contrast, classical secretory glycosylation can be structurally essential. For example, inhibition of asparagine-linked, i.e. N-linked, glycosylation can prevent proper glycoprotein folding and full inhibition can be toxic to an individual cell. In contrast, perturbation of glycan processing (enzymatic removal/addition of carbohydrate residues to the glycan), which occurs in both the endoplasmic reticulum and Golgi apparatus, is dispensable for isolated cells (as evidence by survival with glycosides inhibitors) but can lead to human disease (congenital disorders of glycosylation) and can be lethal in animal models. It is therefore likely that the fine processing of glycans is important for endogenous functionality, such as cell trafficking, but that this is likely to have been secondary to its role in host-pathogen interactions. A famous example of this latter effect is the ABO blood group system.
Though there are different types of glycoproteins, the most common are N-linked and O-linked glycoproteins. These two types of glycoproteins each have structural differences that give them their names. Glycoproteins vary greatly in composition, making many different compounds such as antibodies or hormones. Due to the array of functions within the body, interest in glycoprotein synthesis for medical use has increased. There are now several methods to synthesize glycoproteins, including recombination and glycosylation of proteins.
Types of glycosylation
There are several types of glycosylation, although the first two are the most common.
- In N-glycosylation, sugars are attached to nitrogen, typically on the amide side-chain of asparagine.
- In O-glycosylation, sugars are attached to oxygen, typically on serine or threonine, but also on tyrosine or non-canonical amino acids such as hydroxylysine & hydroxyproline.
- In P-glycosylation, sugars are attached to phosphorus on a phosphoserine.
- In C-glycosylation, sugars are attached directly to carbon, such as in the addition of mannose to tryptophan.
- In S-glycosylation, a beta-GlcNAc is attached to the sulfur atom of a cysteine residue.
- In glypiation, a GPI glycolipid is attached to the C-terminus of a polypeptide, serving as a membrane anchor.
- In glycation, also known as non-enzymatic glycosylation, sugars are covalently bonded to a protein or lipid molecule, without the controlling action of an enzyme, but through a Maillard reaction.
Monosaccharides commonly found in eukaryotic glycoproteins include:: 526
|N-Acetylneuraminic acid||Aminononulosonic acid
The sugar group(s) can assist in protein folding, improve proteins' stability and are involved in cell signalling.
The critical structural element of all glycoproteins is having oligosaccharides bonded covalently to a protein. There are 10 common monosaccharides in mammalian glycans including: glucose (Glc), fucose (Fuc), xylose (Xyl), mannose (Man), galactose (Gal), N-acetylglucosamine (GlcNAc), glucuronic acid (GlcA), iduronic acid (IdoA), N-acetylgalactosamine (GalNAc), sialic acid, and 5-N-acetylneuraminic acid (Neu5Ac). These glycans link themselves to specific areas of the protein amino acid chain.
The two most common linkages in glycoproteins are N-linked and O-linked glycoproteins. An N-linked glycoprotein has glycan bonds to the nitrogen containing an Asparagine amino acid within the protein sequence. An O-linked glycoprotein is where the sugar is bonded to an oxygen atom of a Serine or Threonine amino acid in the protein.
Glycoprotein size and composition can vary largely, with carbohydrate composition ranges from 1% to 70% of the total mass of the glycoprotein. Within the cell, they appear in the blood, the extracellular matrix, or on the outer surface of the plasma membrane, and make up a large portion of the proteins secreted by eukaryotic cells. They are very broad in their applications and can function as a variety of chemicals from antibodies to hormones.
Glycomis is the study of the carbohydrate components of cells. Though not exclusive to glycoproteins, it can reveal more information about different glycoproteins and their structure. One of the purposes of this field of study is to determine which proteins are glycosylated and where in the amino acid sequence the glycosylation occurs. Historically, mass spectrometry has been used to identify the structure of glycoproteins and characterize the carbohydrate chains attached.
The unique interaction between the oligosaccharide chains have different applications. First, it aids in quality control by identifying misfolded proteins. The oligosaccharide chains also change the solubility and polarity of the proteins that they are bonded to. For example, if the oligosaccharide chains are negatively charged, with enough density around the protein, they can repulse proteolytic enzymes away from the bonded protein. The diversity in interactions lends itself to different types of glycoproteins with different structures and functions.
One example of glycoproteins found in the body is mucins, which are secreted in the mucus of the respiratory and digestive tracts. The sugars when attached to mucins give them considerable water-holding capacity and also make them resistant to proteolysis by digestive enzymes.
- molecules such as antibodies (immunoglobulins), which interact directly with antigens.
- molecules of the major histocompatibility complex (or MHC), which are expressed on the surface of cells and interact with T cells as part of the adaptive immune response.
- sialyl Lewis X antigen on the surface of leukocytes.
H antigen of the ABO blood compatibility antigens. Other examples of glycoproteins include:
- gonadotropins (luteinizing hormone a follicle-stimulating hormone)
- glycoprotein IIb/IIIa, an integrin found on platelets that is required for normal platelet aggregation and adherence to the endothelium.
- components of the zona pellucida, which surrounds the oocyte, and is important for sperm-egg interaction.
- structural glycoproteins, which occur in connective tissue. These help bind together the fibers, cells, and ground substance of connective tissue. They may also help components of the tissue bind to inorganic substances, such as calcium in bone.
- Glycoprotein-41 (gp41) and glycoprotein-120 (gp120) are HIV viral coat proteins.
- Miraculin, is a glycoprotein extracted from Synsepalum dulcificum a berry which alters human tongue receptors to recognize sour foods as sweet.
Variable surface glycoproteins allow the sleeping sickness Trypanosoma parasite to escape the immune response of the host.
The viral spike of the human immunodeficiency virus is heavily glycosylated. Approximately half the mass of the spike is glycosylation and the glycans act to limit antibody recognition as the glycans are assembled by the host cell and so are largely 'self'. Over time, some patients can evolve antibodies to recognise the HIV glycans and almost all so-called 'broadly neutralising antibodies (bnAbs) recognise some glycans. This is possible mainly because the unusually high density of glycans hinders normal glycan maturation and they are therefore trapped in the premature, high-mannose, state. This provides a window for immune recognition. In addition, as these glycans are much less variable than the underlying protein, they have emerged as promising targets for vaccine design.
P-glycoproteins are critical for antitumor research due to its ability block the effects of antitumor drugs. P-glycoprotein, or multidrug transporter (MDR1), is a type of ABC transporter that transports compounds out of cells. This transportation of compounds out of cells includes drugs made to be delivered to the cell, causing a decrease in drug effectiveness. Therefore, being able to inhibit this behavior would decrease P-glycoprotein interference in drug delivery, making this an important topic in drug discovery. For example, P-Glycoprotein causes a decrease in anti-cancer drug accumulation within tumor cells, limiting the effectiveness of chemotherapies used to treat cancer.
Hormones that are glycoproteins include:
- Follicle-stimulating hormone
- Luteinizing hormone
- Thyroid-stimulating hormone
- Human chorionic gonadotropin
- Erythropoietin (EPO)
Distinction between glycoproteins and proteoglycans
Quoting from recommendations for IUPAC:
A glycoprotein is a compound containing carbohydrate (or glycan) covalently linked to protein. The carbohydrate may be in the form of a monosaccharide, disaccharide(s). oligosaccharide(s), polysaccharide(s), or their derivatives (e.g. sulfo- or phospho-substituted). One, a few, or many carbohydrate units may be present. Proteoglycans are a subclass of glycoproteins in which the carbohydrate units are polysaccharides that contain amino sugars. Such polysaccharides are also known as glycosaminoglycans.
|Lubricant and protective agent||Mucins|
|Transport molecule||Transferrin, ceruloplasmin|
|Immunologic molecule||Immunoglobulins, histocompatibility antigens|
|Hormone||Human chorionic gonadotropin (HCG), thyroid-stimulating hormone (TSH)|
|Enzyme||Various, e.g., alkaline phosphatase, patatin|
|Cell attachment-recognition site||Various proteins involved in cell–cell (e.g., sperm–oocyte), virus–cell, bacterium–cell, and hormone–cell interactions|
|Antifreeze protein||Certain plasma proteins of coldwater fish|
|Interact with specific carbohydrates||Lectins, selectins (cell adhesion lectins), antibodies|
|Receptor||Various proteins involved in hormone and drug action|
|Affect folding of certain proteins||Calnexin, calreticulin|
|Regulation of development||Notch and its analogs, key proteins in development|
|Hemostasis (and thrombosis)||Specific glycoproteins on the surface membranes of platelets|
|Periodic acid-Schiff stain||Detects glycoproteins as pink bands after electrophoretic separation.|
|Incubation of cultured cells with glycoproteins as radioactive decay bands||Leads to detection of a radioactive sugar after electrophoretic separation.|
|Treatment with appropriate endo- or exoglycosidase or phospholipases||Resultant shifts in electrophoretic migration help distinguish among proteins with N-glycan, O-glycan, or GPI linkages and also between high mannose and complex N-glycans.|
|Agarose-lectin column chromatography, lectin affinity chromatography||To purify glycoproteins or glycopeptides that bind the particular lectin used.|
|Lectin affinity electrophoresis||Resultant shifts in electrophoretic migration help distinguish and characterize glycoforms, i.e. variants of a glycoprotein differing in carbohydrate.|
|Compositional analysis following acid hydrolysis||Identifies sugars that the glycoprotein contains and their stoichiometry.|
|Mass spectrometry||Provides information on molecular mass, composition, sequence, and sometimes branching of a glycan chain. It can also be used for site-specific glycosylation profiling.|
|NMR spectroscopy||To identify specific sugars, their sequence, linkages, and the anomeric nature of glycosidic chain.|
|Multi-angle light scattering||In conjunction with size-exclusion chromatography, UV/Vis absorption and differential refractometry, provides information on molecular mass, protein-carbohydrate ratio, aggregation state, size, and sometimes branching of a glycan chain. In conjunction with composition-gradient analysis, analyzes self- and hetero-association to determine binding affinity and stoichiometry with proteins or carbohydrates in solution without labeling.|
|Dual Polarisation Interferometry||Measures the mechanisms underlying the biomolecular interactions, including reaction rates, affinities and associated conformational changes.|
|Methylation (linkage) analysis||To determine linkage between sugars.|
|Amino acid or cDNA sequencing||Determination of amino acid sequence.|
The glycosylation of proteins has an array of different applications from influencing cell to cell communication to changing the thermal stability and the folding of proteins. Due to the unique abilities of glycoproteins, they can be used in many therapies. By understanding glycoproteins and their synthesis, they can be made to treat cancer, Crohn's Disease, high cholesterol, and more.
The process of glycosylation (binding a carbohydrate to a protein) is a post-translational modification, meaning it happens after the production of the protein. Glycosylation is a process that roughly half of all human proteins undergo and heavily influences the properties and functions of the protein. Within the cell, glycosylation occurs in the endoplasmic reticulum.
There are several techniques for the assembly of glycoproteins. One technique utilizes recombination. The first consideration for this method is the choice of host, as there are many different factors that can influence the success of glycoprotein recombination such as cost, the host environment, the efficacy of the process, and other considerations. Some examples of host cells include E. coli, yeast, plant cells, insect cells, and mammalian cells. Of these options, mammalian cells are the most common because their use does not face the same challenges that other host cells do such as different glycan structures, shorter half life, and potential unwanted immune responses in humans. Of mammalian cells, the most common cell line used for recombinant glycoprotein production is the Chinese hamster ovary line. However, as technologies develop, the most promising cell lines for recombinant glycoprotein production are human cell lines.
The formation of the link between the glycan and the protein is key element of the synthesis of glycoproteins. The most common method of glycosylation of N-linked glycoproteins is through the reaction between a protected glycan and a protected Asparagine. Similarly, an O-linked glycoprotein can be formed through the addition of a glycosyl donor with a protected Serine or Threonine. These two methods are examples of natural linkage. However, there are also methods of unnatural linkages. Some methods include ligation and a reaction between a serine-derived sulfamidate and thiohexoses in water. Once this linkage is complete, the amino acid sequence can be expanded upon using solid-phase peptide synthesis.
Notes and references
- Ruddock LW, Molinari M (November 2006). "N-glycan processing in ER quality control". Journal of Cell Science. 119 (Pt 21): 4373–4380. doi:10.1242/jcs.03225. PMID 17074831.
- Funakoshi Y, Suzuki T (February 2009). "Glycobiology in the cytosol: the bitter side of a sweet world". Biochimica et Biophysica Acta (BBA) - General Subjects. 1790 (2): 81–94. doi:10.1016/j.bbagen.2008.09.009. PMID 18952151.
- Picanco e Castro V, Swiech SH (2018). Recombinant Glycoprotein Production Methods and Protocols. ISBN 978-1-4939-7312-5. OCLC 1005519572.
- Nelson DL, Cox MM, Hoskins AA, Lehninger AL (2013). Lehninger Principles of Biochemistry (Sixth ed.). ISBN 978-1-319-38149-3. OCLC 1249676451.
- Gamblin DP, Scanlan EM, Davis BG (January 2009). "Glycoprotein synthesis: an update". Chemical Reviews. 109 (1): 131–163. doi:10.1021/cr078291i. PMID 19093879.
- Hart GW (27 October 2014). "Three Decades of Research on O-GlcNAcylation - A Major Nutrient Sensor That Regulates Signaling, Transcription and Cellular Metabolism". Frontiers in Endocrinology. 5: 183. doi:10.3389/fendo.2014.00183. PMC 4209869. PMID 25386167.
- Stepper J, Shastri S, Loo TS, Preston JC, Novak P, Man P, et al. (February 2011). "Cysteine S-glycosylation, a new post-translational modification found in glycopeptide bacteriocins". FEBS Letters. 585 (4): 645–650. doi:10.1016/j.febslet.2011.01.023. PMID 21251913.
- Murray RC, Granner DK, Rodwell VW (2006). Harper's Illustrated Biochemistry (27th ed.). McGraw–Hill.
- Glycan classification SIGMA
- Dell A, Morris HR (March 2001). "Glycoprotein structure determination by mass spectrometry". Science. 291 (5512): 2351–2356. Bibcode:2001Sci...291.2351D. doi:10.1126/science.1058890. PMID 11269315. S2CID 23936441.
- Theerasilp S, Kurihara Y (August 1988). "Complete purification and characterization of the taste-modifying protein, miraculin, from miracle fruit". The Journal of Biological Chemistry. 263 (23): 11536–11539. doi:10.1016/S0021-9258(18)37991-2. PMID 3403544.
- Pritchard LK, Vasiljevic S, Ozorowski G, Seabright GE, Cupo A, Ringe R, et al. (June 2015). "Structural Constraints Determine the Glycosylation of HIV-1 Envelope Trimers". Cell Reports. 11 (10): 1604–1613. doi:10.1016/j.celrep.2015.05.017. PMC 4555872. PMID 26051934.
- Pritchard LK, Spencer DI, Royle L, Bonomelli C, Seabright GE, Behrens AJ, et al. (June 2015). "Glycan clustering stabilizes the mannose patch of HIV-1 and preserves vulnerability to broadly neutralizing antibodies". Nature Communications. 6: 7479. Bibcode:2015NatCo...6.7479P. doi:10.1038/ncomms8479. PMC 4500839. PMID 26105115.
- Behrens AJ, Vasiljevic S, Pritchard LK, Harvey DJ, Andev RS, Krumm SA, et al. (March 2016). "Composition and Antigenic Effects of Individual Glycan Sites of a Trimeric HIV-1 Envelope Glycoprotein". Cell Reports. 14 (11): 2695–2706. doi:10.1016/j.celrep.2016.02.058. PMC 4805854. PMID 26972002.
- Crispin M, Doores KJ (April 2015). "Targeting host-derived glycans on enveloped viruses for antibody-based vaccine design". Current Opinion in Virology. Viral pathogenesis • Preventive and therapeutic vaccines. 11: 63–69. doi:10.1016/j.coviro.2015.02.002. PMC 4827424. PMID 25747313.
- Ambudkar SV, Kimchi-Sarfaty C, Sauna ZE, Gottesman MM (October 2003). "P-glycoprotein: from genomics to mechanism". Oncogene. 22 (47): 7468–7485. doi:10.1038/sj.onc.1206948. PMID 14576852. S2CID 11259597.
- "Nomenclature of glycoproteins, glycopeptides and peptidoglycans, Recommendations 1985". www.qmul.ac.uk. Retrieved 16 March 2021.
- Maverakis E, Kim K, Shimoda M, Gershwin ME, Patel F, Wilken R, et al. (February 2015). "Glycans in the immune system and The Altered Glycan Theory of Autoimmunity: a critical review". Journal of Autoimmunity. 57 (6): 1–13. doi:10.1016/j.jaut.2014.12.002. PMC 4340844. PMID 25578468.
- Davis BG (February 2002). "Synthesis of glycoproteins". Chemical Reviews. 102 (2): 579–602. doi:10.1021/cr0004310. PMID 11841255.
- Maverakis E, Kim K, Shimoda M, Gershwin ME, Patel F, Wilken R, Raychaudhuri S, Ruhaak LR, Lebrilla CB (February 2015). "Glycans in the immune system and The Altered Glycan Theory of Autoimmunity: a critical review". Journal of Autoimmunity. 57: 1–13. doi:10.1016/j.jaut.2014.12.002. PMC 4340844. PMID 25578468.
- Berg JM, Tymoczko JL, Stryer L (2002). "Carbohydrates Can Be Attached to Proteins to Form Glycoproteins". Biochemistry (5th ed.). New York: W.H. Freeman. ISBN 978-0-7167-4684-3.
- Glycoproteins at the US National Library of Medicine Medical Subject Headings (MeSH)
- "Biological Importance of the glycosylation of a protein". BiochemPages. 15 August 2015.
- "Carbohydrate Chemistry and Glycobiology: A Web Tour". Science. 291 (5512): 2263–2502. 23 March 2001. Archived from the original on 9 January 2008.
Special Web Supplement
- "Glycan Recognizing Proteins". bioWORLD.
- "Structure of Glycoprotein and Carbohydrate Chain". Home Page for Learning Environmental Chemistry.