1 / 34

Protein Structure Prediction and Homology Modeling

Learn about the importance of protein structures, their role in identifying active sites and protein mechanisms, and how protein structure prediction methods help bridge the gap between protein sequences and experimentally determined structures. Explore homology modeling, a technique that extrapolates protein structures based on similarities in sequence and how it can be used to design mutants and facilitate drug development.

donnah
Download Presentation

Protein Structure Prediction and Homology Modeling

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Protein Structure Prediction and Protein Homology modeling Dinesh Gupta ICGEB, New Delhi 11/16/2019 5:28 AM

  2. Why protein structures • Identifying active and binding sites • Determine protein’s mechanism (catalysis & interactions) • Searching ligands for a binding site • Understanding the molecular basis of diseases • Designing mutants • Drug design

  3. Why protein structure prediction? • There is a discrepancy in number of protein sequences and number of experimentally determined structures. • Release 2017_01 of 18th-Jan-17 of UniProtKB/Swiss- Prot contains 5,53,474 sequence entries (www.uniprot.org) whereas only 40094 distinct protein structures in PDB database (www.rcsb.org, 24th-Jan-17). • Experimental determination: slow, expensive- failed in certain cases! • Hence methods which can reasonably predict protein structures are important. 11/16/2019 5:28 AM

  4. 11/16/2019 5:28 AM

  5. Protein structure prediction • Methods: – Homology (comparative) modeling – Threading – Ab-initio 11/16/2019 5:28 AM

  6. Protein Homology modeling • Homology modeling is an extrapolation of protein structure for a target sequence using the known 3D structure of similar sequence as a template. • Basis: proteins with similar sequences are likely to assume same folding • Certain proteins with as low as 25% similarity have been observed to assume same 3D structure 11/16/2019 5:28 AM

  7. The accuracy of modeling is proportional to the similarity in primary sequences 11/16/2019 5:28 AM

  8. Homology modeling consists of the following steps Template recognition • Database search using BLAST Model generation Sequence alignment • Pair-wise • Multiple alignment Model validation • Statistical potentials • Physics-based energy calculations

  9. Template recognition The sequence of the target protein is first scanned against the sequences of known structures using pair wise sequence alignment program like BLAST or FASTA. However more sensitive methods based on multiple sequence alignment like PSI-BLAST are preferred for remote homologs.  Selection of best template is decided by several factors like, high sequence identity (> 30%), better coverage of the aligned region, quality of the template structure etc..

  10. Model generation Once an initial target-template alignment is built, a variety of methods are available to construct a 3D model. The most commonly used optimization techniques to satisfy spatial restraints obtained from template-target alignment (Modeller). one is distance geometry and Spatial restraints are extracted from two sources: first, homology- derived restraints on the distances and dihedral angles in the target sequence are extracted from its alignment with the template structures.  Second, stereo chemical restraints such as bond length and bond angle preferences are mechanics force field of CHARMM-22 obtained from the molecular

  11. Model validation In general the homology model is susceptible to error and needs to be assessed for quality by either statistical potentials or physics-based energy calculations. Energy-based calculations checks if the bond-length and bond-angles are with normal ranges, and if there are lot of bumps in the model which corresponds to high vanderwaals energy (PROCHECK). Statistical potentials are empirical methods based on observed residue- residue contact frequencies among proteins of known structure in the PDB. They assign a probability or energy score to each possible pairwise interaction between amino acids and combine these pair wise interaction scores into a single score for the entire model (Prosa and DOPE).

  12. Software for homology molecular modeling • Molecular graphics: PyMol, RasMol, etc. • Freeware: available for all OS – Downloadable • Modeller (Sali, 1998) https://salilab.org/modeller/ • DeepView (SwissPDB viewer) http://spdbv.vital-it.ch • YASARA (WHATIF) http://yasara.org – Web based: • SWISS MODEL server (https://swissmodel.expasy.org/interactive) • CPH model server (http://www.cbs.dtu.dk/services/CPHmodels) • Phyre2 (http://www.sbg.bio.ic.ac.uk/phyre2/html/page.cgi?id=index) 11/16/2019 5:28 AM

  13. Common errors in homology modeling • Errors in modeling side chains • Errors in modeling sequence segments without a template (e.g., variable and/or loop regions) • Errors due to selecting the wrong template for modeling 11/16/2019

  14. 11/16/2019 5:28 AM Petry, D. and Honig, B. Molecular Cell, Vol. 20, 811–819, December 22, 2005

  15. 11/16/2019 5:28 AM

  16. 11/16/2019 5:28 AM

  17. 11/16/2019

  18. 11/16/2019

  19. 11/16/2019

  20. 11/16/2019

  21. 11/16/2019

  22. Databases of protein models 11/16/2019

  23. 11/16/2019

  24. 11/16/2019

  25. Homology modeling summary • Homology modeling methods can efficiently predict structure if the target protein has similarity to known folds • The greater the sequence identity to template, the better is the quality of the model • Homology modeling can yield high accuracy and high resolution atomic models • With increasing numbers of unique structures and solved experimentally homology modeling methods should become more powerful 11/16/2019

  26. Protein structure prediction • Methods: – Homology (comparative) modelling – Threading – Ab-initio 11/16/2019 5:28 AM

  27. Threading • Structure prediction method that picks up where homology modelling leaves off. – Homology Modeling: Align sequence to sequence – Threading: Align sequence to structure • Threading is based on the fact that only limited number of protein folds are possible in nature. • Threading recognizes folds in proteins having no similarity to known proteins structures • Very approximate models • Check by forcing a sequence of structure into known folds checking the packing of aa residues, including sides chains, in each fold. Scoring functions. 11/16/2019 5:28 AM

  28. 11/16/2019 5:28 AM

  29. Threading software • I-TASSER http://zhanglab.ccmb.med.umich.edu/I-TASSER/ • RaptorX http://raptorx.uchicago.edu 11/16/2019 5:28 AM

  30. 11/16/2019 5:28 AM

  31. Protein structure prediction • Methods: – Homology (comparative) modelling – Threading – Ab-initio 11/16/2019 5:28 AM

  32. Ab initio structure prediction • Still experimental • Less accurate • ROSETTA (David Baker) – http://robetta.bakerlab.org/ 11/16/2019 5:28 AM

  33. Energy minimization (Molecular Mechanics, MM) • Heart of any protein modeling technique • Energy minimization techniques try to minimize energies of molecules with the help of equations also called force fields which represent energies of molecular systems • Assumption: Molecular systems try to achieve lowest energies in equilibrium. • MM could be used to calculate large scale conformational changes over long periods of time, but currently computationally infeasible. 11/16/2019 5:28 AM

  34. What do Force Fields represent? • How do atoms stretch, vibrate, rotate, etc.? • Represent the constraints on atomic motion (e.g. van der Waals, electrostatic, bonds, etc.) • Must also represent solvation effects etc. • Quantum solutions exist, but are too complex to calculate for such large systems • Empirical (approximate) energy functions must be used. No single best function exists. 11/16/2019 5:28 AM

More Related