1 / 24

Standardizer

Standardizer. Molecular Cosmetics for Chemoinformatics. Gy ö rgy Pirok N ó ra M á te István Cseh Szil á rd D ó r á nt Péter Kovács Szabolcs Csepregi Ferenc Csizmadia. Why standardize structures?. Canonicalisation

dympna
Download Presentation

Standardizer

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Standardizer Molecular Cosmetics for Chemoinformatics György Pirok Nóra Máte István Cseh Szilárd Dóránt Péter Kovács Szabolcs Csepregi Ferenc Csizmadia

  2. Why standardize structures? • Canonicalisation • Uniformization of structures without changing the chemical content to recognize duplicates, functional groups (aromatization, mesomers, tautomers, ... ) • Beautification • Making the structures visually more attractive (dearomatization, cleaning coordinates, wedge orientation, ... ) • Modification • Conversion of structures by modifying its original content as a preparation step for further chemoinformatics tasks (transformations, removing stereo, removing R-groups, ...). often difficult to categorize the standardization actions

  3. Canonicalisation • Hydrogens • Tautomers making hydrogens explicit converting to canonical tautomer form making hydrogens implicit transforming to user defined tautomer form • Resonant structures • Other aromatizing Kekülé rings removing small fragments converting to canonical mesomer form removing user defined fragments transforming to user defined mesomer form expanding stoichiometry setting the chiral flag

  4. Mesomers

  5. Tautomersoxo-enol, enamine-imine

  6. Tautomerspyridone-pyridol

  7. Fragment removal

  8. Specific counterion removal

  9. Solvent removal

  10. Stoichiometry expansionexpanding salt stoichiometry

  11. Stoichiometry expansionexpanding reaction stoichiometry

  12. Beautification • Hydrogens • Cleaning making hydrogens implicit calculating 2D coordinates reallocating wedge bonds • Resonant structures template based cleaning converting aromatic rings to Kekülé format 3D geometry optimization • Groups contracting/expanding/ungrouping abbreviated and multiple groups

  13. Template-based Cleaning2D-coordinate calculation of macrocycles or bridged systems

  14. Template-based Cleaningaligning search results to the query query

  15. Canonicalization During Database Import client server input structures JChem Base / Cartridge Standardizer canonicalization configuration canonicalized structures original structures Relational Database

  16. Sending Query to the Database client server query structure JChem Base / Cartridge Standardizer query is compared to the canonicalized structures canonicalization configuration canonicalizedquery Relational Database

  17. Displaying Result Structures client server beautified structures JChem Base / Cartridge Standardizer beautification configuration original structures Relational Database

  18. Modification + custom transformations

  19. Standardizer st = new Standardizer(new File("standardize.xml")); st.standardize(mol); standardize input.sdf -c config.xml -o output.smiles API and command line interface

  20. Live Demonstration

  21. Applications: Virtual Synthesis

  22. Applications: Structure Databases

  23. How can ChemAxon Help • Free for non commercial websites • Free for academic teaching and research“Academic Package” • Free Academic Package to be extended to cover academic networks – campus-wide roll out

  24. Acknowledments • Ferenc Csizmadia • Nóra Máté • István Cseh • Szabó Attila • Szilárd Dóránt • Péter Kovács • Szabolcs Csepregi

More Related