1 / 41

Systems Biology

Systems Biology . Two ways of looking a problem. Top down or bottom up Either look at the whole organism and abstract large portions of it Or try to understand each small piece and then after understanding every small piece assemble into the whole

Download Presentation

Systems Biology

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Systems Biology

  2. Two ways of looking a problem • Top down or bottom up • Either look at the whole organism and abstract large portions of it • Or try to understand each small piece and then after understanding every small piece assemble into the whole • Both are used, valid and complement each other

  3. Bottom up is traditional approach • You would study a pathway in detail not worrying about how that pathway might interact with other elements in the cell. • You would strive to understand a gene or pathway in great detail, eventually you might extend this knowledge to other organisms and compare and contrast. • With top down you need other tools...

  4. Definitions • At a recent NIH SysBio SIG retreat almost every talk started with that speakers definition of what systems biology is. • Leroy Hood came up with the following (my summary) • As global a view as possible • Fundamentally quantitative • Different scales integrated

  5. The Systems Biology Institute take: • Understand the structure of the system • Regulatory and biochemical networks • Understand the dynamics of the the system • Construct model with predictive capabilities • Understand the control methods

  6. Common “themes” • Cross disciplinary • Lots of data/information/knowledge • Concepts of networks for abstract portrayal of many interaction types. • Model development • Predictive models • Models to drive experimentation • Models to understand processes

  7. “Inner life of a Cell”SIGGRAPH 2006 showcase winner • Need to fight infection • WBC • Need to keep blood from leaking out

  8. Requires a higher level of understanding • Many tools “feed” into this understanding • Microarrays • Homology tools (BLAST, alignments COGS) • Biochemical literature • Genomic sequence • Specialized databases • Any faults in these tools lead to problems in the analysis

  9. A complex problem • 35,000 genes either on or off (huge simplification!) would have 2^35,000 solutions • Things can be simplified by grouping and finding key genes which regulate many other genes and genes which may only interact with one other gene • In reality there are lots of subtle interactions and non-binary states.

  10. Some real numbers from E. coli • 630 transcription units controlled by 97 transcription factors. • 100 enzymes that catalyse more than one biochemical reaction . • 68 cases where the same reaction is catalysed by more than one enzyme. • 99 cases where one reaction participates in multiple pathways. • The regulatory network is at most 3 nodes deep. • 50 of 85 studied transcription factors do not regulate other transcription factors, lots of negative auto-regulation

  11. Theoretical hurdles to jump • Switching delay (McAdams and Arkin 1997) • More transcripts, less protein/transcript = more energy less noise • Fewer transcripts, More protein/transcript = less energy more noise. • Selection drives this trade-off • Two critical times; how long after trigger does a protein reach a critical level how long after removal of the trigger does the protein level decline to below critical level. • How critical is the level

  12. Conclusions from Arkin: • Simulations found 3-20 minutes from transcript to active protein. • Many processes are stochastic (random) not deterministic. • The probabilities are definitely skewed but still have long tails • This means that with a large population there are cells which may be in very different states than most of the rest of the population. • Complex interplay between regulation, lag and activity that has implications when trying to reconstruct a network.

  13. Surviving heat shock: Control strategies for robustness and performance • Taking engineering principles and applying them to systems biology

  14. Air conditioning • Setpoint (temperature you set) • Sensor (thermostat) • Error signal (temp exceeded) • Controller (thermostat/ac) • Actuator (ac on)

  15. Heat shock protein • Increased heat -> mRNA -32 mRNA melting • Make 32 • Interacts with RNAP to activate specific sub-sets of genes • Make a bunch >10,000 protein copies to deal with heat

  16. Heat shock response

  17. DNAK Chaperone representative Binds to 32 and degraded proteins FtsH Protease degrading 32 Titrated away by degraded proteins 32 Temperature regulation at translation Components

  18. Need to turn off (cooler) Don’t want to activate inappropriately (energy waste) Fast response (proteins degrading) Proportional response (it’s a little hot) Why make it more difficult?

  19. Theoretical types of control

  20. Effects of control types on response levels

  21. Adding metabolic cost as a design parameter

  22. Using feedback to get robustness

  23. Sometimes simple is better but: Often some complexity adds desirable features Trade off between complexity, robustness, and economy Modules, reuse “Helps” evolution Can help biologist Summary

  24. Techniques • Advanced Methods and Algorithms for Biological Networks Analysis “such questions are conventionally viewed as computationally intractable. Thus, biologists and engineers alike are often forced to resort to inefficient simulation methods or translate their problems into biologically unnatural terms in order to use available algorithms; hence the necessity for an algorithmic scalable infrastructure the systematically addresses these questions”

  25. Problems of modeling • Compare model to data • But with complex model and large parameter set any data set can be made to fit • Could a simpler model also work • Untested parameters

  26. Alternative to exhaustive searches • Use sum of squares to generate dynamical behavior barriers • Don’t test all possible values just see where they make a difference • Stocastic simulation is another way but • Uses months to simulate picoseconds • Robustness provides a key • Biological systems must exhibit robustness • This robustness also limits the search space

  27. Case studies • Consistency between literature and microarray profiles. • Galactose utilization in yeast.

  28. Case study 1: Microarrays -> regulatory networks • Long been a dream, all this data should tell me everything. • Try with E. coli: • How consistent is the literature knowledge base with the microarray expression profile • Genome Research 13:2435-2443 2003 • Literature compiled into the RegulonDB database • Correlation was significant 70-89% but…

  29. But… • Noise filtering removed >50% of the genes on the microarray • 83/179 known regulatory genes where used the rest discarded also due to noise filtering. • Simple conditions: Minimal media, anaerobic and stationary phase growth. • 32% of the 83 where always off. • Fell to ~40% if effector metabolites not considered.

  30. Case study 2: Figure out Galactose utilization in yeast • Classic last line: “As technologies for cellular perturbation and global measurement mature, these approaches will soon become feasible in higher eukaryotes” • Combines: literature knowledge, microarray, proteomics, visualization, and network techniques to refine what is known about galactose utilization in yeast. • Science 292:929-934

  31. Utilization of galactose is well studied 1625 papers in PubMed dating back to the 1950s. • Simple process get Galactose into the cell then modify this sugar into the more usable form of glucose-6-P; don’t waste a lot of energy doing it if: (1.) there is no gal or (2.) you have plenty of glucose.

  32. Galactose metabolism

  33. The Process • Define all genes in the genome, particularly the subset of genes and other small molecules that are involved in the gal pathway (DONE) • For each gene or condition change (ie delete the gene) and measure the global effect on both mRNA and protein levels. • Integrate the changes in respect to the first point with all known protein-protein and protein-DNA networks • Form new hypothesises and test

  34. Expression measurements

  35. Blue line (pp) Yellow line (pd) Visualizing the data

  36. Networks the “system” of systems biology • Humans produce some pretty complex structures. • Computer chips • Oil refineries • Airplanes • The goals for these structures are similar to life forms • Survive • Do it at a cheap cost • Reproduce/evolve??

  37. Basic network terminology • Nodes • Edges • Scale-free • Power laws • Exponential/Random networks • Robustness • Ability to respond to different conditions • Robust yet fragile • Complexity • Not the number of parts… consider a lump of coal • The number of different parts AND the organization of those parts

  38. Graph theory, networks • Two types of networks • Exponential and scale free • Most cellular networks are scale free • It makes the most sense to study the interactions of the central nodes not the outer nodes

  39. High Throughput data sources • Microarray data • Already well covered in the last couple of weeks. • Probably the most mature • Proteomics • Several processes • Separation of the products • Digest the products • Find the mass of the products • Problems • Contamination • Phosphorylation, glycosylation, Acylation, methylation, cleavage.

  40. Cytoscape • Software tool to manage data and develop predictive models (Genome Research Shannon et al. 2003) • Not directed specifically to a cellular process or disease pathway • Combine • Protein-protein interactions • RNA expression • Genetic interactions • Protein-dna interactions • Protein abundance • Protein phosphorylation • Metabolite concentrations • Integrate (global) molecular interactions and state measurements. • Organized around a network graph

More Related