1 / 33

Protein Database in Europe

Protein Database in Europe. Deposition , Validation, Search and Analysis Services. Gaurav Sahni, Ph.D. worldwide Protein Data Bank (wwPDB). Consists of four sites RCSB (USA), PDB-j (Japan) BMRB (USA) and PDBe. Single repository of macromolecular structures.

sirvat
Download Presentation

Protein Database in Europe

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Protein Database in Europe Deposition , Validation, Search and Analysis Services Gaurav Sahni, Ph.D.

  2. worldwide Protein Data Bank (wwPDB) • Consists of four sites • RCSB (USA), PDB-j (Japan) BMRB (USA) and PDBe. • Single repository of macromolecular structures. • Started in 1971 and now ~61,000 entries, adding ~200 new entries/week. • Deposited by experimentalists and contents is freely available. • The format of the archive is flat-files with fixed line format, although an improved flat-file format (mmCIF) and XML are also available.

  3. Protein Databank in Europe (PDBe) group • Is one of the four sites around the world that where 3D structures may be deposited. • Provides stable and clean repository of macromolecular structure data. • Has services that allow users to access, search and retrieve structural data from a single web access point.

  4. PDBe Tasks Deposition and Validation Database design and implementation Retrieve data Analysis tools & Services

  5. Depositions and Curation Deposition via AutoDep4 (http://www.ebi.ac.uk/pdbe-xdep/autodep) Closely collaborate with the other wwPDB members for a single unified archive.. Depositions via EMDEP (http://www.ebi.ac.uk/pdbe-emdep/emdep) Depositions started June 2002

  6. Validation of Structures • Authentication of source That the protein is from human and not rabbit, for example ! • Authentication of structure Comparison of structure against raw data. Geometry and Stereochemistry. Provide results back to depositor. • Validation of correct methodology used Whether X-Ray, NMR or EM. • Conformity to standards Follows PDB format specifications • Error checks • Consistency checks - to identify simple typos Homo sapiens and not Homo sapien (single human?). • Outlier detection - to identify suspect records

  7. PDBe Tasks Deposition site Database design and implementation Retrieve data Analysis tools & Services

  8. Disadvantages of Flat files… • Macromolecular structures are very complex. • Existing PDB format is incapable of fully describing few existing structures also. • Format is not readily extensible, to cope, for example, with structural genomics data. • Historical archive is non-uniform and poorly populated. • Search and retrieval of flat files is difficult and/or inaccurate.

  9. Uniform Data Improved Query Functionality PDBe Relational Database Crystallographers Biologists Time Effort Usefulness Usage Programmers Bioinformaticians

  10. PDBe Tasks Deposition site Database design and implementation Retrieve data Analysis tools & Services

  11. Some Implementation Issues • The PDBe database is large and complex: • ~61,000 PDB entries • Cross-referenced against SwissProt, PubMed etc. • Making data accessible without adding additional complexity. • Tools for different categories of end-user • Simple – biobar • Intermediate - PDBelite • Advanced – PDBepro • New - PDBeView

  12. biobar A toolbar search application for Mozilla/Netscape or firefox browsers http://biobar.mozdev.org/ Simple and quick retrieval of data from PDBe and 45 other Databases

  13. PDBelite A simple form-based query system to search the PDBe Databases

  14. PDBelite Search Results

  15. Features of Search Interface • Strengths: • simple, easy to use form • allows multiple search fields to be combined • relatively fast, despite performing quite complex SQL queries • Weaknesses: • not exposing the power of a relational database • limited logical operators between search fields: • "name" AND "title" AND "keyword“ • "name" OR "title" OR "keyword“ • ( "name" OR "title" ) AND NOT "keyword" • the search form is defined by the authors of the search system, not the author of a query

  16. PDBepro A java-based flexible graphical search interface for advanced searching

  17. Complex searches • User have comprehensive control of their query • Applet provide a dynamic form, as compared to a static HTML form: • choose the fields to be searched • specify the relationships between search fields • choose the result fields and how results are presented • perform “complex” sub-queries e.g. SSM, FASTA • PDBepro uses an applet for constructing queries and a server to execute them • The user describes their query entirely graphically, including logical operations such as AND, OR and NOT

  18. PDBeView

  19. Search result: The Atlas page

  20. PDBe Tasks Deposition site Database design and implementation Retrieve data Analysis tools & Services

  21. AstexViewer™: Visualization@PDBe • View structures as wireframe, backbone or ribbons • Built-in sequence viewer • Calculate and display surfaces • Various display options: • Ramachandran plots • Distance matrix • B-factors Based on the AstexViewer™ from Astex Technology Limited and modified under licence by the PDBe group

  22. PDBeChem Ligand Database

  23. PDBeSite What is the environment aroundalpha-D-mannoseandbeta-D-mannose?

  24. PDBeSite What binds ASP ASP HIS LYS ?

  25. PDBeSite How does ATP generally interact with LYS in all structures ?

  26. PDBeAnalysis Assess Quality of a Structure Bond Distances Bond Angles Ramachandran Plot

  27. PDBePisa What assembly can my structure have ?

  28. PDBeFold Discover unknown relationships… • Are there any structures in the PDB that are similar to mine? • What SCOP and/or CATH family could my structure belong to ? • Can I get some idea about the possible function of my protein based on similarity with others based on structural similarity ? • Mutiple alignment of many of my structures ?

  29. ChemSearch Sub-structure based search of a million chemicals

  30. PDBeAnalysis/PDBeValidate Online PDB validation

  31. PDBeStatus PDB Deposition status search

  32. PDBe provides… • Clean biological data • Integrated data • A single web access point • Query interfaces for different users (Beginner, Occasional or expert). • Interconnected views of the data relating structure, sequence, text & experimental details.

  33. Linking to Domain data, eFamily Sequence Mapping, SIFTS PDBechem ligand data Electron Density Visualisation AstexViewer PDBePro, PDBelite PISA biological assemblies Active sites Fold matching Surface Matching

More Related