Drug design, sometimes referred to as rational drug design or simply rational design, is the inventive process of finding new medications based on the knowledge of a biological target. The drug is most commonly an organic small molecule that activates or inhibits the function of a biomolecule such as a protein, which in turn results in a therapeutic benefit to the patient. In the most basic sense, drug design involves the design of molecules that are complementary in shape and charge to the biomolecular target with which they interact and therefore will bind to it. Drug design frequently but not necessarily relies on computer modeling techniques. This type of modeling is often referred to as computer-aided drug design. Finally, drug design that relies on the knowledge of the three-dimensional structure of the biomolecular target is known as structure-based drug design. In addition to small molecules, biopharmaceuticals and especially therapeutic antibodies are an increasingly important class of drugs and computational methods for improving the affinity, selectivity, and stability of these protein-based therapeutics have also been developed.
The phrase "drug design" is to some extent a misnomer. A more accurate term is ligand design (i.e., design of a molecule that will bind tightly to its target). Although modeling techniques for prediction of binding affinity are reasonably successful, there are many other properties, such as bioavailability, metabolic half-life, side effects, etc., that first must be optimized before a ligand can become a safe and efficacious drug. These other characteristics are often difficult to predict using computer-aided drug design techniques. Nevertheless due to high attrition rates, especially during clinical phases of drug development, more attention is being focused early in the drug design process on selecting candidate drugs whose physicochemical properties are predicted to result in fewer complications during development and hence more likely to lead to an approved, marketed drug. Furthermore in vitro experiments complemented with computation methods are increasingly used in early drug discovery to select compounds with more favorable ADME (absorption, distribution, metabolism, and excretion) and toxicological profiles.
A biomolecular target (most commonly a protein or nucleic acid) is a key molecule involved in a particular metabolic or signaling pathway that is associated with a specific disease condition or pathology or to the infectivity or survival of a microbial pathogen. Potential drug targets are not necessarily disease causing but must by definition be disease modifying. In some cases, small molecules will be designed to enhance or inhibit the target function in the specific disease modifying pathway. Small molecules (for example receptor agonists, antagonists, inverse agonists, or modulators; enzyme activators or inhibitors; or ion channel openers or blockers) will be designed that are complementary to the binding site site of target. Small molecules (drugs) can be designed so as not to affect any other important "off-target" molecules (often referred to as antitargets) since drug interactions with off-target molecules may lead to undesirable side effects. Due to similarities in binding sites, closely related targets identified through sequence homology have the highest chance of cross reactivity and hence highest side effect potential.
Most commonly, drugs are organic small molecules produced through chemical synthesis, but biopolymer-based drugs (also known as biopharmaceuticals) produced through biological processes are becoming increasingly more common. In addition, mRNA-based gene silencing technologies may have therapeutic applications.
Rational drug discovery
In contrast to traditional methods of drug discovery (known as forward pharmacology), which rely on trial-and-error testing of chemical substances on cultured cells or animals, and matching the apparent effects to treatments, rational drug design (also called reverse pharmacology) begins with a hypothesis that modulation of a specific biological target may have therapeutic value. In order for a biomolecule to be selected as a drug target, two essential pieces of information are required. The first is evidence that modulation of the target will be disease modifying. This knowledge may come from, for example, disease linkage studies that show an association between mutations in the biological target and certain disease states. The second is that the target is "druggable". This means that it is capable of binding to a small molecule and that its activity can be modulated by the small molecule.
Once a suitable target has been identified, the target is normally cloned and expressed. The expressed target is then used to establish a screening assay. In addition, the three-dimensional structure of the target may be determined.
The search for small molecules that bind to the target is begun by screening libraries of potential drug compounds. This may be done by using the screening assay (a "wet screen"). In addition, if the structure of the target is available, a virtual screen may be performed of candidate drugs. Ideally the candidate drug compounds should be "drug-like", that is they should possess properties that are predicted to lead to oral bioavailability, adequate chemical and metabolic stability, and minimal toxic effects. Several methods are available to estimate druglikeness such as Lipinski's Rule of Five and a range of scoring methods such as lipophilic efficiency. Several methods for predicting drug metabolism have also been proposed in the scientific literature.
Due to the large number of drug properties that must be simultaneously optimized during the design process, multi-objective optimization techniques are sometimes employed. Finally because of the limitations in the current methods for prediction of activity, drug design is still very much reliant on serendipity and bounded rationality.
Computer-aided drug design 
The most fundamental goal in drug design is to predict whether a given molecule will bind to a target and if so how strongly. Molecular mechanics or molecular dynamics are most often used to predict the conformation of the small molecule and to model conformational changes in the biological target that may occur when the small molecule binds to it. Semi-empirical, ab initio quantum chemistry methods, or density functional theory are often used to provide optimized parameters for the molecular mechanics calculations and also provide an estimate of the electronic properties (electrostatic potential, polarizability, etc.) of the drug candidate that will influence binding affinity.
Molecular mechanics methods may also be used to provide semi-quantitative prediction of the binding affinity. Also, knowledge-based scoring function may be used to provide binding affinity estimates. These methods use linear regression, machine learning, neural nets or other statistical techniques to derive predictive binding affinity equations by fitting experimental affinities to computationally derived interaction energies between the small molecule and the target.
Ideally, the computational method will be able to predict affinity before a compound is synthesized and hence in theory only one compound needs to be synthesized, saving enormous time and cost. The reality is that present computational methods are imperfect and provide, at best, only qualitatively accurate estimates of affinity. In practice it still takes several iterations of design, synthesis, and testing before an optimal drug is discovered. Computational methods have accelerated discovery by reducing the number of iterations required and have often provided novel structures.
Drug design with the help of computers may be used at any of the following stages of drug discovery:
- hit identification using virtual screening (structure- or ligand-based design)
- hit-to-lead optimization of affinity and selectivity (structure-based design, QSAR, etc.)
- lead optimization optimization of other pharmaceutical properties while maintaining affinity
In order to overcome the insufficient prediction of binding affinity calculated by recent scoring functions, the protein-ligand interaction and compound 3D structure information are used for analysis. For structure-based drug design, several post-screening analyses focusing on protein-ligand interaction have been developed for improving enrichment and effectively mining potential candidates:
- Consensus scoring
- Selecting candidates by voting of multiple scoring functions
- May lose the relationship between protein-ligand structural information and scoring criterion
- Cluster analysis
- Represent and cluster candidates according to protein-ligand 3D information
- Needs meaningful representation of protein-ligand interactions.
There are two major types of drug design. The first is referred to as ligand-based drug design and the second, structure-based drug design.
Ligand-based drug design (or indirect drug design) relies on knowledge of other molecules that bind to the biological target of interest. These other molecules may be used to derive a pharmacophore model that defines the minimum necessary structural characteristics a molecule must possess in order to bind to the target. In other words, a model of the biological target may be built based on the knowledge of what binds to it, and this model in turn may be used to design new molecular entities that interact with the target. Alternatively, a quantitative structure-activity relationship (QSAR), in which a correlation between calculated properties of molecules and their experimentally determined biological activity, may be derived. These QSAR relationships in turn may be used to predict the activity of new analogs.
Structure-based drug design (or direct drug design) relies on knowledge of the three dimensional structure of the biological target obtained through methods such as x-ray crystallography or NMR spectroscopy. If an experimental structure of a target is not available, it may be possible to create a homology model of the target based on the experimental structure of a related protein. Using the structure of the biological target, candidate drugs that are predicted to bind with high affinity and selectivity to the target may be designed using interactive graphics and the intuition of a medicinal chemist. Alternatively various automated computational procedures may be used to suggest new drug candidates.
Current methods for structure-based drug design can be divided roughly into three main categories. The first method is identification of new ligands for a given receptor by searching large databases of 3D structures of small molecules to find those fitting the binding pocket of the receptor using fast approximate docking programs. This method is known as virtual screening. A second category is de novo design of new ligands. In this method, ligand molecules are built up within the constraints of the binding pocket by assembling small pieces in a stepwise manner. These pieces can be either individual atoms or molecular fragments. The key advantage of such a method is that novel structures, not contained in any database, can be suggested. A third method is the optimization of known ligands by evaluating proposed analogs within the binding cavity.
Binding site identification
Binding site identification is the first step in structure based design. If the structure of the target or a sufficiently similar homolog is determined in the presence of a bound ligand, then the ligand should be observable in the structure in which case location of the binding site is trivial. However there may be unoccupied allosteric binding sites that may be of interest. Furthermore it may be that only apoprotein (protein without ligand) structures are available and the reliable identification of unoccupied sites that have the potential to bind ligands with high affinity is non-trivial. In brief, binding site identification usually relies on identification of concave surfaces on the protein that can accommodate drug sized molecules that also possess appropriate "hot spots" (hydrophobic surfaces, hydrogen bonding sites, etc.) that drive ligand binding.
Structure-based drug design attempts to use the structure of proteins as a basis for designing new ligands by applying the principles of molecular recognition. Selective high affinity binding to the target is generally desirable since it leads to more efficacious drugs with fewer side effects. Thus, one of the most important principles for designing or obtaining potential new ligands is to predict the binding affinity of a certain ligand to its target (and known antitargets) and use the predicted affinity as a criterion for selection.
- ΔG0 – empirically derived offset that in part corresponds to the overall loss of translational and rotational entropy of the ligand upon binding.
- ΔGhb – contribution from hydrogen bonding
- ΔGionic – contribution from ionic interactions
- ΔGlip – contribution from lipophilic interactions where |Alipo| is surface area of lipophilic contact between the ligand and receptor
- ΔGrot – entropy penalty due to freezing a rotatable in the ligand bond upon binding
A more general thermodynamic "master" equation is as follows:
- desolvation – enthalpic penalty for removing the ligand from solvent
- motion – entropic penalty for reducing the degrees of freedom when a ligand binds to its receptor
- configuration – conformational strain energy required to put the ligand in its "active" conformation
- interaction – enthalpic gain for "resolvating" the ligand with its receptor
The basic idea is that the overall binding free energy can be decomposed into independent components that are known to be important for the binding process. Each component reflects a certain kind of free energy alteration during the binding process between a ligand and its target receptor. The Master Equation is the linear combination of these components. According to Gibbs free energy equation, the relation between dissociation equilibrium constant, Kd, and the components of free energy was built.
Various computational methods are used to estimate each of the components of the master equation. For example, the change in polar surface area upon ligand binding can be used to estimate the desolvation energy. The number of rotatable bonds frozen upon ligand binding is proportional to the motion term. The configurational or strain energy can be estimated using molecular mechanics calculations. Finally the interaction energy can be estimated using methods such as the change in non polar surface, statistically derived potentials of mean force, the number of hydrogen bonds formed, etc. In practice, the components of the master equation are fit to experimental data using multiple linear regression. This can be done with a diverse training set including many types of ligands and receptors to produce a less accurate but more general "global" model or a more restricted set of ligands and receptors to produce a more accurate but less general "local" model.
A particular example of rational drug design involves the use of three-dimensional information about biomolecules obtained from such techniques as X-ray crystallography and NMR spectroscopy. Computer-aided drug design in particular becomes much more tractable when there is a high-resolution structure of a target protein bound to a potent ligand. This approach to drug discovery is sometimes referred to as structure-based drug design. The first unequivocal example of the application of structure-based drug design leading to an approved drug is the carbonic anhydrase inhibitor dorzolamide, which was approved in 1995.
Another important case study in rational drug design is imatinib, a tyrosine kinase inhibitor designed specifically for the bcr-abl fusion protein that is characteristic for Philadelphia chromosome-positive leukemias (chronic myelogenous leukemia and occasionally acute lymphocytic leukemia). Imatinib is substantially different from previous drugs for cancer, as most agents of chemotherapy simply target rapidly dividing cells, not differentiating between cancer cells and other tissues.
Additional examples include:
- Many of the atypical antipsychotics
- Cimetidine, the prototypical H2-receptor antagonist from which the later members of the class were developed
- Selective COX-2 inhibitor NSAIDs
- Enfuvirtide, a peptide HIV entry inhibitor
- Nonbenzodiazepines like zolpidem and zopiclone
- Raltegravir, an HIV integrase inhibitor
- SSRIs (selective serotonin reuptake inhibitors), a class of antidepressants
- Zanamivir, an antiviral drug
- 5-HT3 antagonists
- Acetylcholine receptor agonists
- Angiotensin receptor antagonists
- Bcr-Abl tyrosine-kinase inhibitors
- Cannabinoid receptor antagonists
- CCR5 receptor antagonists
- Cyclooxygenase 2 inhibitors
- Dipeptidyl peptidase-4 inhibitors
- HIV protease inhibitors
- NK1 receptor antagonists
- Non-nucleoside reverse transcriptase inhibitors
- PDE5 inhibitors
- Proton pump inibitors
- Renin inhibitors
- TRPV1 antagonists
- c-Met inhibitors
It has been argued that the highly rigid and focused nature of rational drug design suppresses serendipity in drug discovery. Because many of the most significant medical discoveries have been inadvertent, the recent focus on rational drug design may limit the progress of drug discovery. Furthermore, the rational design of a drug may be limited by a crude or incomplete understanding of the underlying molecular processes of the disease it is intended to treat.
- Madsen U, Krogsgaard-Larsen P, Liljefors T (2002). Textbook of Drug Design and Discovery. Washington, DC: Taylor & Francis. ISBN 0-415-28288-8.
- Reynolds CH, Merz KM, Ringe D, eds. (2010). Drug Design: Structure- and Ligand-Based Approaches (1 ed.). Cambridge, UK: Cambridge University Press. ISBN 978-0521887236.
- Shirai H, Prades C, Vita R, Marcatili P, Popovic B, Xu J et al. (Nov 2014). "Antibody informatics for drug discovery". Biochimica Et Biophysica Acta 1844 (11): 2002–2015. doi:10.1016/j.bbapap.2014.07.006. PMID 25110827.
- Tollenaere JP (Apr 1996). "The role of structure-based ligand design and molecular modelling in drug discovery". Pharmacy World & Science 18 (2): 56–62. doi:10.1007/BF00579706. PMID 8739258.
- Waring MJ, Arrowsmith J, Leach AR, Leeson PD, Mandrell S, Owen RM et al. (2015). "An analysis of the attrition of drug candidates from four major pharmaceutical companies". Nature Reviews Drug Discovery. doi:10.1038/nrd4609. PMID 26091267.
- Yu H, Adedoyin A (Sep 2003). "ADME-Tox in drug discovery: integration of experimental and computational technologies". Drug Discovery Today 8 (18): 852–61. doi:10.1016/S1359-6446(03)02828-9. PMID 12963322.
- Dixon SJ, Stockwell BR (Dec 2009). "Identifying druggable disease-modifying gene products". Current Opinion in Chemical Biology 13 (5-6): 549–55. doi:10.1016/j.cbpa.2009.08.003. PMC 2787993. PMID 19740696.
- Imming P, Sinning C, Meyer A (Oct 2006). "Drugs, their targets and the nature and number of drug targets". Nature Reviews. Drug Discovery 5 (10): 821–34. doi:10.1038/nrd2132. PMID 17016423.
- Anderson AC (Sep 2003). "The process of structure-based drug design". Chemistry & Biology 10 (9): 787–97. PMID 14522049.
- Recanatini M, Bottegoni G, Cavalli A (Dec 2004). "In silico antitarget screening". Drug Discovery Today. Technologies 1 (3): 209–15. doi:10.1016/j.ddtec.2004.10.004. PMID 24981487.
- Wu-Pong S, Rojanasakul Y (2008). Biopharmaceutical drug design and development (2nd ed.). Totowa, NJ Humana Press: Humana Press. ISBN 978-1-59745-532-9.
- Scomparin A, Polyak D, Krivitsky A, Satchi-Fainaro R (Apr 2015). "Achieving successful delivery of oligonucleotides - From physico-chemical characterization to in vivo evaluation". Biotechnology Advances. doi:10.1016/j.biotechadv.2015.04.008. PMID 25916823.
- Ganellin CR, Jefferis R, Roberts SM (2013). "The small molecule drug discovery process — from target selection to candidate selection". Introduction to Biological and Small Molecule Drug Research and Development: theory and case studies. Elsevier. ISBN 9780123971760.
- Yuan Y, Pei J, Lai L (Dec 2013). "Binding site detection and druggability prediction of protein targets for structure-based drug design". Current Pharmaceutical Design 19 (12): 2326–33. PMID 23082974.
- Rishton GM (Jan 2003). "Nonleadlikeness and leadlikeness in biochemical screening". Drug Discovery Today 8 (2): 86–96. PMID 12565011.
- Hopkins AL (2011). "Chapter 25: Pharmacological space". In Wermuth CG. The Practice of Medicinal Chemistry (3 ed.). Academic Press. pp. 521–527. ISBN 978-0-12-374194-3.
- Kirchmair J (2014). Drug Metabolism Prediction. Wiley's Methods and Principles in Medicinal Chemistry 63. Wiley-VCH. ISBN 978-3-527-67301-8.
- Nicolaou CA, Brown N (Sep 2013). "Multi-objective optimization methods in drug design". Drug Discovery Today. Technologies 10 (3): 427–35. doi:10.1016/j.ddtec.2013.02.001. PMID 24050140.
- Ban TA. "The role of serendipity in drug discovery". Dialogues in Clinical Neuroscience 8 (3): 335–44. PMID 17117615.
- Ethiraj SK, Levinthal D (Sep 2004). "Bounded Rationality and the Search for Organizational Architecture: An Evolutionary Perspective on the Design of Organizations and Their Evolvability". Administrative Science Quarterly (Sage Publications, Inc. on behalf of the Johnson Graduate School of Management, Cornell University) 49 (3): 404–437. JSTOR 4131441.
- Lewis RA (2011). "Chapter 4: The Development of Molecular Modelling Programs: The Use and Limitations of Physical Models". In Gramatica P, Livingstone DJ, Davis AM. Drug Design Strategies: Quantitative Approaches. Royal Society of Chemistry. pp. 88–107. doi:10.1039/9781849733410-00088. ISBN 978-1849731669.
- Rajamani R, Good AC (May 2007). "Ranking poses in structure-based lead discovery and optimization: current trends in scoring function development". Current Opinion in Drug Discovery & Development 10 (3): 308–15. PMID 17554857.
- de Azevedo WF, Dias R (Dec 2008). "Computational methods for calculation of ligand-binding affinity". Current Drug Targets 9 (12): 1031–9. doi:10.2174/138945008786949405. PMID 19128212.
- Singh J, Chuaqui CE, Boriack-Sjodin PA, Lee WC, Pontz T, Corbley MJ et al. (Dec 2003). "Successful shape-based virtual screening: the discovery of a potent inhibitor of the type I TGFbeta receptor kinase (TbetaRI)". Bioorganic & Medicinal Chemistry Letters 13 (24): 4355–9. doi:10.1016/j.bmcl.2003.09.028. PMID 14643325.
- Becker OM, Dhanoa DS, Marantz Y, Chen D, Shacham S, Cheruku S et al. (Jun 2006). "An integrated in silico 3D model-driven discovery of a novel, potent, and selective amidosulfonamide 5-HT1A agonist (PRX-00023) for the treatment of anxiety and depression". Journal of Medicinal Chemistry 49 (11): 3116–35. doi:10.1021/jm0508641. PMID 16722631.
- Liang S, Meroueh SO, Wang G, Qiu C, Zhou Y (May 2009). "Consensus scoring for enriching near-native structures from protein-protein docking decoys". Proteins 75 (2): 397–403. doi:10.1002/prot.22252. PMC 2656599. PMID 18831053.
- Oda A, Tsuchida K, Takakura T, Yamaotsu N, Hirono S (2006). "Comparison of consensus scoring strategies for evaluating computational models of protein-ligand complexes". Journal of Chemical Information and Modeling 46 (1): 380–91. doi:10.1021/ci050283k. PMID 16426072.
- Deng Z, Chuaqui C, Singh J (Jan 2004). "Structural interaction fingerprint (SIFt): a novel method for analyzing three-dimensional protein-ligand binding interactions". Journal of Medicinal Chemistry 47 (2): 337–44. doi:10.1021/jm030331x. PMID 14711306.
- Amari S, Aizawa M, Zhang J, Fukuzawa K, Mochizuki Y, Iwasawa Y et al. (2006). "VISCANA: visualized cluster analysis of protein-ligand interaction based on the ab initio fragment molecular orbital method for virtual ligand screening". Journal of Chemical Information and Modeling 46 (1): 221–30. doi:10.1021/ci050262q. PMID 16426058.
- Guner OF (2000). Pharmacophore Perception, Development, and use in Drug Design. La Jolla, Calif: International University Line. ISBN 0-9636817-6-1.
- Tropsha A (2010). "QSAR in Drug Discovery". In Reynolds CH, Merz KM, Ringe D. Drug Design: Structure- and Ligand-Based Approaches (1 ed.). Cambridge, UK: Cambridge University Press. pp. 151–164. ISBN 978-0521887236.
- Leach, Andrew R.; Harren, Jhoti (2007). Structure-based Drug Discovery. Berlin: Springer. ISBN 1-4020-4406-2.
- Mauser H, Guba W (May 2008). "Recent developments in de novo design and scaffold hopping". Current Opinion in Drug Discovery & Development 11 (3): 365–74. PMID 18428090.
- Klebe G (2000). "Recent developments in structure-based drug design". Journal of Molecular Medicine 78 (5): 269–81. PMID 10954199.
- Wang R, Gao Y, Lai L (2000). "LigBuilder: A Multi-Purpose Program for Structure-Based Drug Design". Journal of Molecular Modeling 6 (7–8): 498–516. doi:10.1007/s0089400060498.
- Schneider G, Fechner U (Aug 2005). "Computer-based de novo design of drug-like molecules". Nature Reviews. Drug Discovery 4 (8): 649–63. doi:10.1038/nrd1799. PMID 16056391.
- Jorgensen WL (Mar 2004). "The many roles of computation in drug discovery". Science 303 (5665): 1813–8. Bibcode:2004Sci...303.1813J. doi:10.1126/science.1096361. PMID 15031495.
- Leis S, Schneider S, Zacharias M (2010). "In silico prediction of binding sites on proteins". Current Medicinal Chemistry 17 (15): 1550–62. PMID 20166931.
- Warren GL, Warren SD (2011). "Chapter 16: Scoring Drug-Receptor Interactions". In Gramatica P, Livingstone DJ, Davis AM. Drug Design Strategies: Quantitative Approaches. Royal Society of Chemistry. p. 440–457. doi:10.1039/9781849733410-00440. ISBN 978-1849731669.
- Böhm HJ (Jun 1994). "The development of a simple empirical scoring function to estimate the binding constant for a protein-ligand complex of known three-dimensional structure". Journal of Computer-Aided Molecular Design 8 (3): 243–56. Bibcode:1994JCAMD...8..243B. doi:10.1007/BF00126743. PMID 7964925.
- Liu J, Wang R (23 March 2015). "Classification of Current Scoring Functions". Journal of Chemical Information and Modeling 55 (3): 475–482. doi:10.1021/ci500731a.
- Ajay, Murcko MA (1995). "Computational methods to predict binding free energy in ligand-receptor complexes". J. Med. Chem. 38 (26): 4953–67. doi:10.1021/jm00026a001. PMID 8544170.
- Gramatica P (2011). "Chapter 17: Modeling Chemicals in the Environment". In Gramatica P, Livingstone DJ, Davis AM. Drug Design Strategies: Quantitative Approaches. Royal Society of Chemistry. p. 466. doi:10.1039/9781849733410-00458. ISBN 978-1849731669.
- Greer J, Erickson JW, Baldwin JJ, Varney MD (Apr 1994). "Application of the three-dimensional structures of protein target molecules in structure-based drug design". Journal of Medicinal Chemistry 37 (8): 1035–54. doi:10.1021/jm00034a001. PMID 8164249.
- Timmerman H, Gubernator K, Böhm HJ, Mannhold R, Kubinyi H (1998). Structure-based Ligand Design (Methods and Principles in Medicinal Chemistry). Weinheim: Wiley-VCH. ISBN 3-527-29343-4.
- Capdeville R, Buchdunger E, Zimmermann J, Matter A (Jul 2002). "Glivec (STI571, imatinib), a rationally developed, targeted anticancer drug". Nature Reviews. Drug Discovery 1 (7): 493–502. doi:10.1038/nrd839. PMID 12120256.
- "AutoDock's role in Developing the First Clinically-Approved HIV Integrase Inhibitor". Press Release. The Scripps Research Institute. 2007-12-17.
- Klein DF (Mar 2008). "The loss of serendipity in psychopharmacology". Jama 299 (9): 1063–5. doi:10.1001/jama.299.9.1063. PMID 18319418.