1 / 63

Insilico drug designing

Insilico drug designing. Dinesh Gupta Structural and Computational Biology Group ICGEB. Modern drug discovery process. Target identification. Target validation. Lead identification. Lead optimization. Preclinical phase. Drug discovery. 6-9 years. 2-5 years.

rainer
Download Presentation

Insilico drug designing

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Insilico drug designing Dinesh Gupta Structural and Computational Biology Group ICGEB

  2. Modern drug discovery process Target identification Target validation Lead identification Lead optimization Preclinical phase Drug discovery 6-9 years 2-5 years • Drug discovery is an expensive process involving high R & D cost and extensive clinical testing • A typical development time is estimated to be 10-15 years.

  3. Drug discovery technologies • Target identification • Genomics, gene expression profiling and proteomics • Target Validation • Gene knock-out, inhibition assay • Lead Identification • High throughput screening, fragment based screening, combinatorial libraries • Lead Optimization • Medicinal chemistry driven optimization, X-ray crystallography, QSAR, ADME profiling (bioavailability) • Pre Clinical Phase • Pharmacodynamics (PD), Pharmacokinetics (PK), ADME, and toxicity testing through animals • Clinical Phase • Human trials

  4. Rational Approach to Drug Discovery Identify and validate target Clone gene encoding target Express target Crystal structures/MM of target and target/inhibitor complexes Identify lead compounds Synthesize modified lead compounds Toxicity & pharmacokinetic studies Preclinical trials

  5. Bioinformatics tools in DD • Comparison of Sequences: Identify targets • Homology modelling: active site prediction • Systems Biology: Identify targets • Databases: Manage information • In silico screening (Ligand based, receptor based): Iterative steps of Molecular docking. • Pharmacogenomic databases: assist safety related issues

  6. Currently used drug targets J. Drews Science 287, 1960 -1964 (2000) This information is used by bioinformaticians to narrow the search in the groups Published by AAAS

  7. Insilico methods in Drug Discovery • Molecular docking • Virtual High through put screening. • QSAR (Quantitative structure-activity relationship) • Pharmacophore mapping • Fragment based screening

  8. L L R R Molecular Docking • Docking is the computational determination of binding affinity between molecules (protein structure and ligand). • Given a protein and a ligand find out the binding free energy of the complex formed by docking them.

  9. Molecular Docking: classification • Docking or Computer aided drug designing can be broadly classified • Receptor based methods- make use of the structure of the target protein. • Ligand based methods- based on the known inhibitors

  10. Receptor based methods • Uses the 3D structure of the target receptor to search for the potential candidate compounds that can modulate the target function. • These involve molecular docking of each compound in the chemical database into the binding site of the target and predicting the electrostatic fit between them. • The compounds are ranked using an appropriate scoring function such that the scores correlate with the binding affinity. • Receptor based method has been successfully applied in many targets

  11. Ligand based strategy • In the absence of the structural information of the target, ligand based method make use of the information provided by known inhibitors for the target receptor. • Structures similar to the known inhibitors are identified from chemical databases by variety of methods, • Some of the methods widely used are similarity and substructure searching, pharmacophore matching or 3D shape matching. • Numerous successful applications of ligand based methods have been reported

  12. database known actives structures found Ligand based strategy Search for similar compounds

  13. Binding free energy • Binding free energy is calculated as the sum of the following energies • - Electrostatic Energy • - Vander waals Energy • - Internal Energy change due to flexible deformations • - Translational and rotational energy • Lesser the binding free energy of a complex the more stable it is

  14. Basic binding mechanism • Complementarities between the ligand and the binding site: • Steric complementarities, i.e. the shape of the ligand is mirrored in the shape of the binding site. • Physicochemical complementarities

  15. Components of molecular docking • A) Search algorithm • To find the best conformation of the ligand • and the protein system. • Rigid and flexible docking • B) Scoring function • Rank the ligands according to the interaction energy. • Based on the energy force-field function.

  16. Success with vHTS • Dihydrofolate reductase inhibitor (1992) • HIV-protease (1992) • Phospholypase A2 (1994) • Thrombine (1996) • Carbonic anhydrase inhibitors(2002)

  17. Virtual High Throughput Screening • Less expensive than High Throughput Screening • Faster than conventional screening • Scanning a large number of potential drug like molecules in very less time. • HTS itself is a trial and error approach but can be better complemented by virtual screening.

  18. QSAR • QSAR is statistical approach that attempts to relate physical and chemical properties of molecules to their biological activities. • Various descriptors like molecular weight, number of rotatable bonds LogP etc. are commonly used. • Many QSAR approaches are in practice based on the data dimensions. • It ranges from 1D QSAR to 6D QSAR.

  19. Pharmacophore mapping • It is a 3D description of a pharmacophore, developed by specifying the nature of the key pharmacophoric features and the 3D distance map among all the key features. • A Pharmacophore map can be generated by superposition of active compounds to identify their common features. • Based on the pharmacophore map either de novo design or 3D database searching can be carried out.

  20. Modeling and informatics in drug design

  21. Increased application of structure based drug designing is facilitated by: • Growth of targets number • Growth of 3D structures determination (PDB database) • Growth of computing power • Growth of prediction quality of protein-compound interactions

  22. Summary: role of Bioinformatics? • Identification of homologs of functional proteins (motif, protein families, domains) • Identification of targets by cross species examination • Visualization of molecular models • Docking, vHTS • QSAR, Pharmacophore mapping

  23. Example: use of Bioinformatics in Drug discoveryIdentification of novel drug targets against human malaria

  24. Malaria – A global problem! • Malaria causes at least 500 million clinical cases and more than one million deaths each year. • A child dies of malaria every 30 seconds. • Out of four Plasmodium species causing human malaria, P.falciparum poses most serious threat: because of its virulence, prevalence and drug resistance. • Malaria takes an economic toll - cutting economic growth rates by as much as 1.3% in countries with high disease rates. • There are four types of human malaria: • Plasmodium falciparum • Plasmodium vivax • Plasmodium malariae • Plasmodium ovale.

  25. Approximately half of the world's population is at risk of malaria, particularly those living in lower-income countries. • Today, there are 109 malaria affected countries in 4 regions

  26. Chemical structures of drugs in widely used for treatment of Malaria a) Chloroquine b) Quinine c) Artemether d) Sodium artesunate e) Dihydroartemisinin f) Pyrimethamine g) Sulfadoxine h) Mefloquine i) Halofantrine j) Primaquine k) Tafenoquine l) Chlorproguanil m) Dapsone

  27. http://malaria.who.int/docs/adpolicy_tg2003.pdf

  28. Problems with the existing drugs • Drug resistance is most common problem • Adverse effects (Shock and cardiac arrhythmias caused by Chloroquine) • Poor patient compliance (Quinine tastes very unpleasant, causes dizziness, nausea etc.) • High cost of production for some effective drugs (Atovaquine). • Urgent need for identification of novel drug targets which are effective and affordable.

  29. Strategies for drug target identification in P. falciparum • Parasite culture for functional assays are difficult and expensive. Making computational approaches more relevant. • Malaria remains a neglected disease- very few stake holders! • Availability of the genomic data of P.falciparum and H.sapiens has facilitated the effective application of comparative genomics. • Comparative genomics helps in the identification and exploitation of different characteristic features in host and the parasite. • Identification of specific metabolic pathways in P. falciparum and targeting the crucial proteins is an attractive approach of target based drug discovery.

  30. A. gambiae P. falciparum Predicted proteome H. sapiens Comparison of proteomes helps in identifying important indispensible parasite proteins • Out of 5334 predicted proteins in P. falciparum, 60% didn’t show any similarity to known proteins. • Hence assigning a physiological functional role to these hypothetical proteins using bioinformatics approach still remains a challenge.

  31. Novel drug target identification in P.falciparum Comparative genomics studies ~40% identity threshold for three-dimensional modeling Relational Database of homology models 476 P.falciparum proteins BlastP Human proteome Large set of proteins with no/low similarity Literature search for all these proteins Check for physiological and biochemical functions; etc .. Putative drug targets in P.falciparum Proteasome machinery (ClpQY and ClpAP) in P.falciparum

  32. Targets identified by comparison of proteins models • Identification of two proteasomal proteins of prokaryotic origin, not present in hosts. • The protein degradation is an important process in parasite development inside host RBCs.

  33. Eukaryotic and prokaryotic proteasome machinery • 26S proteasome: eukaryotic type • 19S regulatory + 20S proteolytic particle • Present only in Eukaryotes and archae • Degrades ubiquitinated proteins • > 20 different proteins involved • 20S proteasome • ClpQY system: prokaryotic type • ClpY cap + ClpQ core particle • Present only in prokaryotes • No ubiquitination in prokaryote • Substrate specificity is not known • Only two proteins ClpQ & ClpY Substrate protein ClpY ClpQ ClpY Peptides

  34. ATP Dependent Protease MachineryClpQY (PfHslUV system) • The HslUV complex in prokaryotes is composed of an HslV threonine protease and HslU ATP-dependent protease, a chaperone of Clp/Hsp100 family. • HslV (ClpQ) subunits are arranged in form of two-stacked hexameric rings and are capped by two HslU (ClpY) hexamers at both ends. • HslU (ClpY) hexamer recognizes and unfold peptide substrates with an ATP dependent process, and translocates them into HslV for degradation.

  35. Crystal structure of HslUV complex in H. influenzae PfClpQY complex model in P. falciparum

  36. ATP Dependent Protease machineriesClpQY (PfHslUV system) • The HslUV complex in prokaryotes is composed of an HslV threonine protease and ATP-dependent protease HslU, a chaperone of clp/Hsp100 family. • HslV subunits are arranged in the form of two-stacked hexameric rings and are capped by two HslU hexamers at both ends. • In an ATP dependent process, HslU hexamer recognizes and unfold peptide substrates and translocate them into HslV for degradation.

  37. PfClpQ component MFIRNFVNIIGSQKSITKTIARNYFSDNSKLIIPRHGTTILCVRKNN EVCLIGDGMVSQGTMIVKGNAKKIRRLKDNILMGFAGATADCFTLLDKFETKIDEYPNQL LRSCVELAKLWRTDRYLRHLEAVLIVADKDILLEVTGNGDVLEPSGNVLGTGSGGPYAMA AARALYDVENLSAKDIAYKAMNIAADMCCHTNNNFICETL • For full length & matured active protein • Length : 207 aa (170) • Pro domain : 37aa • Important motifs found: • TT at N terminal in mature protein • GSGG common chymotrypsin protease signal. • Lys(28) and Arg(35) are two conserved amino acids play some role in the activity.

  38. Homologs of PfClpQ protein in other Plasmodium spp PK_ClpQ TTILCVRKNNEVCLIGDGMVSQGTMIVKGNAKKIRRLKDNILMGFAGATADCFTLLDKFE PV_ClpQ TTILCVRKNNEVCLIGDGMVSQGTMIVKGNAKKIRRLKDNILMGFAGATADCFTLLDKFE PF_ClpQ TTILCVRKNNEVCLIGDGMVSQGTMIVKGNAKKIRRLKDNILMGFAGATADCFTLLDKFE PY_ClpQ TTILCVRKNNEVCLIGDGMVSQGTMIVKGNAKKIRRLKDNILMGFAGATADCFTLLDKFE PB_ClpQ TTILCVRKNNEVCLIGDGMVSQGTMIVKGNAKKIRRLKDNILMGFAGATADCFTLLDKFE ************************************************************ PK_ClpQ TKIDEYPDQLLRSCVELAKLWRTDRYLRHLEAVLIVADKDVLLEVTGNGDVLEPSGNVLG PV_ClpQ TKIDEYPDQLLRSCVELAKLWRTDRYLRHLEAVLIVADKDVLLEVTGNGDVLEPSGNVLG PF_ClpQ TKIDEYPNQLLRSCVELAKLWRTDRYLRHLEAVLIVADKDILLEVTGNGDVLEPSGNVLG PY_ClpQ TKIDEYPDQLLRSCVELAKLWRTDRYLRHLEAVLIVADKDTLLEVTGNGDVLEPSGNVLG PB_ClpQ TKIDEYPDQLLRSCVELAKLWRTDRYLRHLEAVLIVADKDTLLEVTGNGDVLEPSGNVLG *******:******************************** ******************* PK_ClpQ TGSGGPYAIAAARALYDVENLSAKDIAYKAMNIAADMCCHTNNNFICETL PV_ClpQ TGSGGPYAIAAARALYDVENLSAKDIAYKAMNIAADMCCHTNNNFICETL PF_ClpQ TGSGGPYAMAAARALYDVENLSAKDIAYKAMNIAADMCCHTNNNFICETL PY_ClpQ TGSGGPYAMAAARALYDIENLSAKDIAYKAMNIAADMCCHTNHNFICETL PB_ClpQ TGSGGPYAIAAARALYDIENLSAKDIAYKAMNIAADMCCHTNHNFICETL ********:********:************************:*******

  39. Homology modeling of PfClpQ PfClpQ 1kyi Conservation of catalytic residues S125-G45-T1-K33 Structural alignment of PfClpQ and HslV (H.influenzae)

  40. E. coli E. coli E. coli S. enterica S. enterica S. enterica H. influenzae H. influenzae H. influenzae X. campestris X. campestris X. campestris W. pipientis W. pipientis W. pipientis P. falciparum P. falciparum P. falciparum T. brucei T. brucei T. brucei T. cruzi T. cruzi T. cruzi L. infantum L. infantum L. infantum Homology Modeling of PfClpQ • Most of the conserved residues in different bacterial species were either identical or similar in PfClpQ

  41. Protease Fluorogenic peptide substrate Fluorescence 150 500 150 400 100 100 300 200 50 50 100 0 0 0 1h 2h 3h 4h 5h 6h 1h 2h 3h 4h 5h 6h 30 60 90 120 150 180 Time Time in minutes Biochemical characterization of PfClpQ protein Activity assay for PfClpQ protein Threonine protease like Substrate: Inhibitor: Chymotrypsin like Suc-LLVY-AMC chymostatin Peptidyl glutamyl hydrolase Z-LLE-AMC MG132 Cbz-GGL-AMC Lactacystin AMC released (m moles) AMC released (m moles) AMC released (m moles) Time Substrate conc (mM) Substrate conc (mM) Substrate conc (mM) Km =19.18 mM Km = 58.22 mM Km =37.79 mM

  42. Insilico identification of novel inhibitors against PfClpQ , a novel drug target of P.falciparum byhigh throughput docking Drug-like compound library (1,000,00) PfclpQ Molecular docking Ligand docked into protein’s active site Top 100 solutions Out of top 40 only 10 compounds available for purchase

  43. ClpQ interaction with ligand identified by virtual screening Phe46 Gly49 Gly48 Arg36 Thr2 Thr50 Val21 Ser22 Crystal structure of HslV complexed with a vinyl sulfone inhibitor

  44. I-Domain N-Domain C-Domain Identification of P. falciparum ClpY (PfClpY) gene • A regulatory component of ClpQY system • Recognizes the substrate; unfolds the substrate; feeds it into the degradation machine (ClpQ) • Belongs to AAA+ family of proteins ClpY ClpQ ClpY ATPase domain Walker B Walker A PfClpY DOMAINS N I N C • ~1.3 kb • Contain all the three ClpY domains- N, I and C

More Related