1 / 39

Structural Validation of Homology

Structural Validation of Homology. 19% Seq ID. Z = 12.2. Adenylate Kinase Guanylate Kinase. Dali Domain Dictionary Deitman, Park, Notredame, Heger, Lappe, and Holm Nucleic Acids Res. 29: 5557 (2001).

Download Presentation

Structural Validation of Homology

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Structural Validation of Homology 19% Seq ID Z = 12.2 Adenylate Kinase Guanylate Kinase

  2. Dali Domain DictionaryDeitman, Park, Notredame, Heger, Lappe, and Holm Nucleic Acids Res. 29: 5557 (2001) • Dali Domain Dictionary is a numerical taxonomy of all known domain structures in the PDB • Evolves from Dali / FSSP Database Holm & Sander, Nucl. Acid Res. 25: 231-234 (1997) • Dali Domain Dictionary Sept 2000 • 10,532 PDB enteries • 17,101 protein chains • 5 supersecondary structure motifs (attractors) • 1375 fold types • 2582 functional families • 3724 domain sequence families

  3. courtesy of C. Chothia

  4. Most proteins in biology have been produced by the duplication, divergence and recombination of the members of a small number of protein families. courtesy of C. Chothia

  5. courtesy of C. Chothia

  6. courtesy of C. Chothia

  7. courtesy of C. Chothia

  8. courtesy of C. Chothia

  9. Cadherins courtesy of C. Chothia

  10. courtesy of C. Chothia

  11. courtesy of C. Chothia

  12. A Global Representation of Protein Fold SpaceHou, Sims, Zhang, Kim, PNAS 100: 2386 - 2390 (2003) Database of 498 SCOP “Folds” or “Superfamilies” The overall pair-wise comparisons of 498 folds lead to a 498 x 498 matrix of similarity scores Sijs, where Sij is the alignment score between the ith and jth folds. An appropriate method for handling such data matrices as a whole is metric matrix distance geometry . We first convert the similarity score matrix [Sij] to a distance matrix [Dij] by using Dij = Smax - Sij, where Smax is the maximum similarity score among all pairs of folds. We then transform the distance matrix to a metric (or Gram) matrix [Mij] by using Mij = Dij2 - Dio2 - Djo2 where Di0, the distance between the ith fold and the geometric centroid of all N = 498 folds. The eigen values of the metric matrix define an orthogonal system of axes, called factors. These axes pass through the geometric centroid of the points representing all observed folds and correspond to a decreasing order of the amount of information each factor represents.

  13. A Global Representation of Protein Fold SpaceHou, Sims, Zhang, Kim, PNAS 100: 2386 - 2390 (2003)

More Related