Taking a numeric path
Download
1 / 59

taking a numeric path - PowerPoint PPT Presentation


  • 207 Views
  • Updated On :

Taking a Numeric Path. Idan Szpektor. The Input. A partial description of a molecule: The atoms The bonds The bonds lengths and angles Spatial constraints on some of the atoms of a molecule. The Questions. Is there a conformation of the molecule that answers the spatial constraints?

loader
I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
capcha
Download Presentation

PowerPoint Slideshow about 'taking a numeric path' - arleen


An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript
Taking a numeric path l.jpg

Taking a Numeric Path

Idan Szpektor


The input l.jpg
The Input

  • A partial description of a molecule:

    • The atoms

    • The bonds

    • The bonds lengths and angles

  • Spatial constraints on some of the atoms of a molecule.


The questions l.jpg
The Questions

  • Is there a conformation of the molecule that answers the spatial constraints?

  • Provide explicit description of such conformations.

  • How many “really” different conformations are there? Which one is the best?


Hard life l.jpg
Hard life…

Torsion angles are (usually) unconstrained

a large number of degrees of freedom

The answers to the questions are not easy to find analytically. They are also resource consuming (running time).


A numeric path l.jpg
A Numeric Path

  • Two algorithms take different approaches to limit the space of possible conformations to search in:

    • Numeric randomized algorithm

    • Semi numeric algorithm on a simpler problem


Slide6 l.jpg

A Randomized Kinematics-Based Approach to Pharmacophore-Constrained Conformational Search and Database Screening

S. M. Lavalle, P. W. Finn, L. E. Kavraki, J. C. Latombe (2000)


The pharmacophore problem l.jpg
The Pharmacophore Problem Pharmacophore-Constrained Conformational Search and Database Screening

  • The problem: to find a molecule conformation that satisfies a set of constraints out of a database of flexible molecules (e.g. for ligand docking).

  • Constraints:

    • Different atoms and their features in the molecule (types, charges etc.)

    • The bond length and angles between bonds

    • Static bond torsion angles

    • Relative locations of some atoms from an anchor atom


The molecule model no rings l.jpg
The Molecule Model – no rings Pharmacophore-Constrained Conformational Search and Database Screening


The molecule model cont l.jpg
The Molecule Model (Cont…) Pharmacophore-Constrained Conformational Search and Database Screening

  • An atom ai carries standard information

  • A bond bi carries the following information:

    • li – the bond length

    • αi – the angle from the previous bond

    • The set of possible torsion angles θi [0, 2π)

  • All info besides the torsion angles is fixed.

  • Θis the m dimensional vector of variables that defines the conformation


The pharmacophore model l.jpg
The Pharmacophore Model Pharmacophore-Constrained Conformational Search and Database Screening

  • A finite set of features corresponding to a subset of atoms

  • Constraints on the relative positions between features (the atoms), when one of the features is designated as aanch, the origin of a global xyz coordinate


The kinematic model l.jpg
The Kinematic Model Pharmacophore-Constrained Conformational Search and Database Screening

  • The bond length, angles and the torsion angles Θ can be used to give the positions of all of the atoms, relative to aanch

  • We look at each atom center as a local coordinate frame. We would like to use the transformation from one coordinate frame to the another.


The kinematic model cont l.jpg
The Kinematic Model (Cont…) Pharmacophore-Constrained Conformational Search and Database Screening


The kinematic model cont13 l.jpg
The Kinematic Model (Cont…) Pharmacophore-Constrained Conformational Search and Database Screening

  • The homogeneous transformation is:


The kinematic model cont14 l.jpg
The Kinematic Model (Cont…) Pharmacophore-Constrained Conformational Search and Database Screening

  • The xyz position of atom ai is given by:


The kinematic model cont15 l.jpg
The Kinematic Model (Cont…) Pharmacophore-Constrained Conformational Search and Database Screening

The coordinate frame of the molecule could be rotated with respect to the global coordinate frame of the pharmacophore feature positions

Another global coordinate frame transformation is needed


The kinematic model cont16 l.jpg
The Kinematic Model (Cont…) Pharmacophore-Constrained Conformational Search and Database Screening

  • Global rotation transformation based on Euler angles γ,φ,ψ:


The kinematic model cont17 l.jpg
The Kinematic Model (Cont…) Pharmacophore-Constrained Conformational Search and Database Screening

  • The complete xyz position of atom ai is given by:


The kinematic error function l.jpg
The Kinematic Error Function Pharmacophore-Constrained Conformational Search and Database Screening

  • Given Θ, γ,φ and ψ, the total amount of error between the requested feature positions G and the actual feature positions g can be measured as:


The energy function l.jpg
The Energy Function Pharmacophore-Constrained Conformational Search and Database Screening

  • In a sense, the energy function measures the likelihood that the molecule will achieve a conformation in nature.

  • An example for an energy function:


Two questions l.jpg
Two Questions Pharmacophore-Constrained Conformational Search and Database Screening

  • For a given molecule from a database and a pharmacophore:

  • Can the molecule achieve a low-energy conformation that satisfies the given pharmacophore?

  • What are the “distinct” low-energy conformations that satisfy the pharmacophore?


Randomized conformation search with constraints the motivation l.jpg
Randomized Conformation Search with Constraints – The Motivation

  • For the problem, the system of equations is generally under constrained, which leads to a complicated multidimensional solution set.

  • This consideration, and the need for efficiency, led to the choice of a numerical randomized technique.



The search gradient descent l.jpg
The Search – Gradient Descent Approach

  • Randomly sample the neighborhood of Pi

  • Search for a point Pi+1 such that f(Pi+1) < f(Pi)

  • If such Pi+1 is found, move to Pi+1 and repeat the search




Integrating into a database search l.jpg
Integrating into a Database Search Approach

  • A small set (~several hundreds) of candidate molecules (configurations) are chosen from a database (using 2D information)

  • Kinematics-based conformational search is performed to further reduce the set of candidates


Search in a set of candidates l.jpg
Search in a Set of Candidates Approach

  • Each time a sample conformation fails to match, the likelihood that the molecule will ever succeed decrease

  • On the other hand, after any number of ‘fail’ iterations, it is impossible to conclude that the molecule will never succeed


Search in a set of candidates cont l.jpg
Search in a Set of Candidates (cont…) Approach

  • Perform one attempt (random sample + match search) per molecule

  • Go through all the molecules in the set and repeat

  • Stop when a requested number of matches was found or a maximum number of iterations was reached


Conformation clustering l.jpg
Conformation Clustering Approach

  • In general, having alternative low-energy conformations is useful because in many cases it is not the lowest-energy conformation that results in docking.


Conformation clustering cont l.jpg
Conformation Clustering (cont...) Approach

  • Use a metric m(θ1, θ2), such as RMS of the displacements of the atoms between two conformations

  • A threshold mmax is the maximum distance that still regards two conformations as identical (in the same cluster)


Clustering algorithm idea l.jpg
Clustering Algorithm – Idea Approach

  • Always keep one representative for each cluster

  • The representative is the conformation with the lowest energy


Clustering algorithm detailed l.jpg
Clustering Algorithm – Detailed Approach

  • for a new conformation θi:

    • if exists another conformation θk such that m(θk, θi) ≤ mmax and e(θk) ≤ e(θi), discard θi

    • otherwise:

      • add θi as a new cluster

      • remove all θk such that m(θk, θi) ≤ mmax


Experiments l.jpg
Experiments Approach

  • 2 different pharmacophores, for ACE and Thermolysin inhibitors

  • 6 different molecules for each pharmacophore, kept in the database in randomly picked conformations

  • The docked conformations are known


Experiments cont l.jpg
Experiments (Cont…) Approach

  • Cluster distance mmax=1.5 Ǻ

  • 20 iterations for the different candidates


Experiments cont35 l.jpg
Experiments (Cont…) Approach

  • In general, conformations’ energies are within 2–7 kcal/mol of the energy of the known docked conformation.

  • The RMS distance of the conformations from the known docked conformations:

    • Thermolysin inhibitors – 0.50, 2.96, 0.59, 0.81, 2.40, and 2.56 Å.

    • ACE inhibitors – 1.26, 1.79, 0.94, 2.03, 1.87, and 1.98 Å.


Experiments cont36 l.jpg
Experiments (Cont…) Approach

  • A sufficient clustering record for a single molecule required about 5–20 min.

  • A claim: “Our previous work with randomized techniques has shown that if we continue iterating our algorithm we increase our chances of covering the conformational space of the molecule, and hence, our chances of providing exhaustive information about the constrained conformations of the molecule.”



Cyclic coordinate descent a robotics algorithm for protein loop closure l.jpg

Cyclic coordinate descent: A robotics algorithm for protein loop closure

A. A. Canutescu and R. L. Dunbrack Jr. (2003)


The loop closure problem l.jpg
The Loop Closure Problem loop closure

  • The problem: matching a given loop to a given backbone (e.g. for Homology modeling).

  • Constraint: connecting the two protein segments on either end of the loop, termed N and C-terminal anchors.



Previous solutions l.jpg
Previous Solutions loop closure

  • Number of available conformation is enormous.

  • Analytical: for 6 degrees of freedom.

  • Numerical: changing all torsion angles at once to the next “best” position.

  • Numerical methods are computationally expensive and sometimes unstable.


Cyclic coordinate descent ccd l.jpg
Cyclic Coordinate Descent (CCD) loop closure

  • Originally developed for robotics.

  • An iterative relaxation algorithm.

  • Adjust only one degree of freedom at a time.


Ccd algorithm l.jpg
CCD Algorithm loop closure

  • Proceed in an iterative fashion along the chain of degrees of freedom.

  • Modify each torsion angle so that the end of the loop gets as close as possible to the desired position.


Ccd simplicity l.jpg
CCD Simplicity loop closure

  • One equation in one unknown for each degree of freedom.

  • The equation provides:

    • Optimum setting for the variable

    • First and second derivatives


The main equation l.jpg
The Main Equation loop closure



The main equation cont47 l.jpg
The Main Equation (Cont...) loop closure

  • Multiplying the last two terms by:

  • Defining:

  • We get:



Ccd benefits l.jpg
CCD Benefits loop closure

  • Computationally Fast.

  • Analytically simple - no singularities.

  • Constraints can be placed on any degree of freedom.

  • Using derivatives, small increments in change can be done in preference to of large changes.


Test 1 success percentage l.jpg
Test 1 – Success Percentage loop closure

  • 2752 different loops.

  • 100 randomly different starting conformations for each loop.

  • A match (closed loop) is when distance from the terminals is less than 0.08 Ǻ.

  • Maximum 5000 iterative cycles (through all torsion angles) per search.


Test 1 cont l.jpg
Test 1 (cont…) loop closure

  • Two test frameworks:

    • No constraints

    • Using a Ramachandran Map

  • When using the map:

    • for a new proposed angle θnew, find (ψnew,θnew) in the map and compare to (ψold,θold)

    • Change to θnew with probability:


Results for test 1 l.jpg
Results for Test 1 loop closure


Analysis of test 1 l.jpg
Analysis of Test 1 loop closure

  • Loops that fail to converge arrive to a local minimum (usage of Monte Carlo method to escape local minimum).

  • CCD favors large changes in the first residues.

  • Loops that failed to converge are usually extended loops.


Test 2 minimum rms to original l.jpg
Test 2 – Minimum RMS to Original loop closure

  • CCD is meant to be a component of a matching algorithm that should also include loop generation and energy function.

  • How well CCD alone converge to the original loop conformation.

  • 30 original loops. For each loop, 5000 random conformations were generated and closed.


Results for test 2 l.jpg
Results for Test 2 loop closure



Test 3 structure convergence l.jpg
Test 3 – Structure Convergence loop closure

  • Using Ramachandran map and starting with the same loop conformation.

  • Each time using different random generated numbers.

  • 500 tests. do all final conformations are the same?

  • Compared to 500 closures of the same loop with different starting conformations.


Test 3 results l.jpg
Test 3 – Results loop closure


Comparison to random tweak l.jpg
Comparison to Random TWEAK loop closure

  • 5000 trails (10 loops, 500 initial configurations each). Closure when RMS < 0.08 Ǻ.

  • CCD – closed 5000. ~7 min running time

  • TWEAK – closed 4841. ~40 min running time


ad