Eccb 2020 gent introduction to protein structure validation and improvement gert vriend
Download
1 / 36

ECCB 2020 Gent Introduction to protein structure validation (and improvement) Gert Vriend - PowerPoint PPT Presentation


  • 75 Views
  • Uploaded on

Protein structure validation. ECCB 2020 Gent Introduction to protein structure validation (and improvement) Gert Vriend. Protein structure validation.

loader
I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
capcha
Download Presentation

PowerPoint Slideshow about ' ECCB 2020 Gent Introduction to protein structure validation (and improvement) Gert Vriend' - shanon


An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript
Eccb 2020 gent introduction to protein structure validation and improvement gert vriend

Protein structure validation

ECCB 2020 GentIntroduction to protein structure validation(and improvement)Gert Vriend


Protein structure validation

The plan for today:Gert Vriend: Introduction to validationRobbie Joosten: X-ray structure validation and improvementJurgen Doreleijers: NMR structure validation (and improvement )Bas Vroling (and you): YASARAAll (and you): General validation practicalsSplit-up in groups: General validation issues X-ray specific issues NMR specific issues Continuation of validation practicalsAt the end: Overview of validation and related facilitiesAnd in-between we have coffee, lunch, tea, and whatever else they throw at us at any moment that anybody feels like it.


Structure validation
Structure validation

Everything that can go wrong, will go wrong, especially with things as complicated as protein structures.



What is real1
What is real?

ATOM 1 N LEU 1 -15.159 11.595 27.068 1.00 18.46

ATOM 2 CA LEU 1 -14.294 10.672 26.323 1.00 9.92

ATOM 3 C LEU 1 -14.694 9.210 26.499 1.00 12.20

ATOM 4 O LEU 1 -14.350 8.577 27.502 1.00 13.43

ATOM 5 CB LEU 1 -12.829 10.836 26.772 1.00 13.48

ATOM 6 CG LEU 1 -11.745 10.348 25.834 1.00 15.93

ATOM 7 CD1 LEU 1 -11.895 11.027 24.495 1.00 13.12

ATOM 8 CD2 LEU 1 -10.378 10.636 26.402 1.00 15.12




X ray2
X-ray

And now move the atoms around till the calculated reflections best match the observed ones.

‘FFT-inv’

FFT-inv



X ray r factor
X-ray R-factor

2

Error = Σ w.(obs-calc)

R-factor = Σ w.|obs-calc|




Nmr data
NMR data

NMR data consists of short inter-atomic distances between atoms. We call these NOEs.

Most NOEs are between close neighbours in the sequence. Those hold little information.

The ‘good’ NOEs are between atoms far away in the sequence. There are few of those, normally.

NOEs are known with low precision. E.g. NOEs are binned 2.5-4.0, 4.0-5.5, and 5.5-7.0.

NMR can also measure some angles, and relative orientations. The latter, called RDCs are powerful.


Nmr q factor
NMR Q-factor

2

Error = Σ NOE/RDC-violations + Energy term


Nmr versus x ray
NMR versus X-ray

With X-ray you measure reflections. Each reflection holds information about each atom.

With NMR you measure pair-wise distances, angles, and orientations. These all hold local information.

X-ray requires crystals, and crystals cause/are artefacts.

NMR is in solution, but provides much less precision.


Nmr versus x ray1
NMR versus X-ray

NMR X-ray

‘Error’ 1-2 Å 0.1-0.5 Å

Mobility yes not really

Crystal artefacts no yes

Material needed 20 mg 1 mg

Cost of hardware 4 M Euro near infinite (share)

Drug design no almost

Better combine and use the best of both worlds.


Why validation
Why validation ?

Why does a sane (?) human being spend twenty years to search for millions of errors in the PDB?


Validation because
Validation because:

Everything we know about proteins comes from PDB files.

Errors become less dangerous when you know about them.

And, going back to the red thread through this series, if a template is wrong the model will be wrong.


What kind of errors can the software find
What kind of errors can the software find?

Administrative errors.

Crystal-specific errors.

NMR-specific errors.

Really wrong things.

Improbable things.

Things worth looking at.

Ad hoc things.


Smile or cry
Smile or cry?

A 5RXN 1.2

B 7GPB 2.9

C 1DLP 3.3

D 1BIW 2.5





How bad is bad

X-ray

How bad is bad?


Check with force fields

A force field is a set of parameters together with a set of rules to use those parameters to see how normal something is. Most force fields are designed to score events, or to predict the future.

In structure validation we often look at structures and count ‘things’ For example, we count that the number of buried hydrogen bond donors that do not make a hydrogen bond is 4.6+/-1.2 per 100 amino acids in well-solved proteins. So we call that normal, and now, using ΔG=-RTln(K), we can calculate the energy penalty for proteins with more than 4.6 unsatisfied buried unsatisfied hydrogen bonds.

Check with force fields


Contact probability
Contact Probability rules to use those parameters to see how normal something is. Most force fields are designed to score events, or to predict the future.


Contact probability1
Contact Probability rules to use those parameters to see how normal something is. Most force fields are designed to score events, or to predict the future.


Contact probability box
Contact probability box rules to use those parameters to see how normal something is. Most force fields are designed to score events, or to predict the future.


One slide about homology modelling
One slide about homology modelling rules to use those parameters to see how normal something is. Most force fields are designed to score events, or to predict the future.


His asn gln flips
His, Asn, Gln ‘flips’ rules to use those parameters to see how normal something is. Most force fields are designed to score events, or to predict the future.


Hydrogen bond network
Hydrogen bond network rules to use those parameters to see how normal something is. Most force fields are designed to score events, or to predict the future.


Your best check
Your best check: rules to use those parameters to see how normal something is. Most force fields are designed to score events, or to predict the future.


How difficult can it be
How difficult can it be? rules to use those parameters to see how normal something is. Most force fields are designed to score events, or to predict the future.

1CBQ

2.2 A


How difficult can it be1
How difficult can it be? rules to use those parameters to see how normal something is. Most force fields are designed to score events, or to predict the future.


Errors or discoveries
Errors or discoveries? rules to use those parameters to see how normal something is. Most force fields are designed to score events, or to predict the future.

Buried histidine.

Warning for buried histidine triggered biochemical follow -up and new mechanism for KH-module of Vigilin.

(A. Pastore, 1VIG).


Acknowledgements
Acknowledgements: rules to use those parameters to see how normal something is. Most force fields are designed to score events, or to predict the future.

Rob Hooft

Elmar Krieger

Sander Nabuurs

Chris Spronk

Maarten Hekkelman

Robbie Joosten


ad