440 likes | 469 Views
Explore the SimRNA method for RNA structure modeling, including non-canonical base pairings, Leontis/Westhof classification, and energy function components. Learn about RNA stabilization factors and SimRNA's Monte Carlo simulation approach. Witness folding results and expert predictions in RNA Puzzles. Dive into the energy-RMSD plots and folding pathways in biomolecular dynamics.
E N D
SimRNA: a coarse-grained method for RNA 3D structure modeling___ new ideas accounting for non-canonical base pairing ___ Mathematics and Computer Science in Modeling and Understanding of Structure and Dynamics of Biomolecules Michał Boniecki, 10.08.2019 International Institute of Molecular and Cell Biology (Janusz Bujnicki lab)
We want to fold biomolecues This is an example of simple 3D RNA structure 3D structure is a result of (base-base) interactions. What are non-canonicalbase pairings?
Just RNA ... Just make a polymer of this, with 4 different side groups ...You have an RNA
sequence >>> structure secondary structure CGCAUUGCG(((...)))
Factors of RNA structure stabilization canonical base-base contacts: non–canonical base-base contacts: Leontis/Westhof base pairing classification
What really matters C – G G – C A – U U – A wobble: G – U U – G ?
How it looks like in SimRNA Representation of RNA chainpdb_id: 1zih – example of small RNA structure
REPRESENTATION simplified (coarse grained) • MODEL OF INTERACTIONS (energy function) statistical potential – derived from pdb • CONFORMATIONAL SAMPLING METHOD Monte Carlo methods: Metropolis algorithm, REMC Introduction of SimRNA (SimRNP) three major assumptions:
SimRNA all–atom representations of RNA chain
SimRNA Point of Interaction Level 1 Level 2 Level 3
We are deriving energy function(that allows to calculate energy for any 3D conformation) We are scanning data set of structures which are solved experimentally (crystal structures)We rely on what we see !We don’t have knowledge about physical interactions, how total energy is decomposed. ... and so on ...
SimRNA energy function componentsspecificity originates from contacts Energy function can be divided into short range (flat angles and distances) terms and long range (base-base pairing) terms. Allshort rangeterms are sequence independent, sequence specificity originates from long range interactions (base–base).
SimRNA specific energy terms 3D grid for AU middle point of U RNA: base – base
A reminder canonical base-base contacts: non–canonical base-base contacts: Leontis/Westhof base pairing classification
SimRNA representation of RNA chainpdb_id: 157d CGCGAAUUAGCG CGCGAAUUAGCG (((.((((.((( ))).)))).))) ...<....<.......>....>...
SimRNA representation of RNA chainpdb_id: 1a51 GGCCGAUGGUAGUGUGGGGUCUCCCCAUGCGAGAGUAGGCC ((((.......(((((((.....))))))).......)))) ....<<<<<<<...................>>>>>>>....
SimRNA representation of RNA chainpdb_id: 1cq5 GGCGUUUACCAGGUCAGGUCCGGAAGGAAGCAGCCAAGGCGCC ((((.......(((....(((....)))....)))....))))
SimRNA representation of RNA chainpdb_id: 1fqz GCCGAGUAGUGUUGGGUCGCGAAAGGC (((.....(((......)))....)))
SimRNA specific energy terms 3D grid for AU middle point of U RNA: base – base What we can do ... ?
SimRNA specific energy terms: A-U 3D grid for AU middle point of U What we can do ... ?
SimRNA specific energy terms: A-G 3D grid for AG middle point of G What we can do ... ?
What SimRNA is doing? Monte Carlo simulation decision(Ebefore,Eafter) initial sample accepted sample decision(Ebefore,Eafter) decision(Ebefore,Eafter) sample accepted rejected decision(Ebefore,Eafter) sample accepted
conformer change (single nucleotide) • one atom position change(-P- or -C4’-) • two atoms position change(either -P-C4’- or -C4’-P-) • change direction of a bigger consecutive fragment SimRNA move set – geometry modifications atom changes involve recalculation of positions of conformers
Folding of sircin-ricin motifpdb_id: 1fqz 1st cluster: 15.6 Å 2nd cluster: 5.66 Å 3rd cluster: 2.84 Å
A Some RNA-Puzzles examples Examples of Bujnicki-group expert predictions, supported by SimRNA, in RNA Puzzles (left: experimentally determined reference structure, right: our top model). (A) Puzzle 14: L-glutamine riboswitch (bound) (B) Puzzle 8: SAM I/IV riboswitch. model reference B reference model
Idea of RMSD vs. energy plots Energy RMSD (dissimilarity)
1l2x folding pathway investigation Energy RMSD GGCGCGGCACCGUCCGCGGAACAAACGG ..(((((......))))).......... .........((((...........))))
1l2x energy landscape Energy RMSD 1l2x shape of folding funnel (projection)
Folding pathwaysHow molecules fold? Energy function requirements? folding funnel pic. from the internet
ACKNOWLEDGEMNTS ICM help: Maciej Marchwiany, Witold Rudnicki Joanna Jędraszczyk other authors: Grzegorz Łach, Konrad Tomala, Wayne Dawson, Tomasz Sołtysiński, Paweł Łukasz, Kristian Rother Janusz Bujnicki my former boss: Andrzej Koliński help and valuable discussion: Grzegorz Chojnowski Staszek Dunin-Horkawicz Juliusz Stasiewicz Boguś Kluge Tomasz Waleń Dorota Matelska Marcin Pawłowski Wojciech Potrzebowski Łukasz Kozłowski Piotr PokarowskiKrzysztof Formanowicz Masoud Farsani ICM resources: (Okeanos) Topola 100 GB of diskspace C/C++ compiler This work was supported by the Polish National Science Center Poland (NCN) (grant 2016/23/B/ST6/03433 to Michal J. Boniecki). Previous developments were supported by the Polish Ministry of Science (HISZPANIA/152/2006 grant to Janusz M. Bujnicki and PBZ/MNiSW/07/2006 grant to Michał Boniecki) and by the EU (6FP grant “EURASNET” LSHG-CT-2005-518238) and DFG (SPP 1258), ERC (RNA+P=123D). We thank the current and former members of the Bujnicki group (in particular developers of methods and participants of the RNA-Puzzles experiment) for their intellectual contribution.