1 / 29

CSCI2950-C Lecture 12 Networks

CSCI2950-C Lecture 12 Networks. Ben Raphael November 18, 2008. http://cs.brown.edu/courses/csci2950-c/. Biological Interaction Networks. Many types: Protein-DNA (regulatory) Protein-metabolite (metabolic) Protein-protein (signaling) RNA-RNA (regulatory)

hworkman
Download Presentation

CSCI2950-C Lecture 12 Networks

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. CSCI2950-CLecture 12Networks Ben Raphael November 18, 2008 http://cs.brown.edu/courses/csci2950-c/

  2. Biological Interaction Networks Many types: • Protein-DNA (regulatory) • Protein-metabolite (metabolic) • Protein-protein (signaling) • RNA-RNA (regulatory) • Genetic interactions (gene knockouts)

  3. Outline • Biology of cellular interaction networks • Network Alignment • Network Motifs • Network Integration

  4. Metabolic Networks Nodes = reactants Edges = reactions labeled by enzyme (protein) that catalyzes reaction

  5. Regulatory Networks

  6. Regulatory Networks Nodes = genes Edges = regulatory interaction A “activates” B A B A “represses” C C Protein-DNA interaction network

  7. Signaling Networks

  8. Protein Interaction Networks • Proteins rarely function in isolation, protein interactions affect all processes in a cell. • Forms of protein-protein interactions: • Modification, complexation [Cardelli, 2005]. phosphorylation protein complex

  9. Protein Interaction Networks High-throughput methods are available to find all interactions, “PPI network”, of a species. • an undirected graph • nodes: protein, edges: interactions • Edges may have weights • Yeast DIP network: ~5K proteins, ~18K interactions • Fly DIP network: ~7K proteins, ~20K interactions PPI network

  10. How are protein-protein interaction networks derived? • Yeast two-hybrid screens

  11. How are protein-protein interaction networks derived? • Protein purification and separation

  12. Computational Problems • Classifying Network Topology • Finding paths, cliques, dense subnetworks, etc. • Comparing Networks Across Species • Using networks to explain data • Dependencies revealed by network topology • Modeling dynamics of networks

  13. Alignment Networks Evolve via gain/loss of proteins or interactions (?) • human • mouse • Sequences • Evolve via substitutions • Conservation implies function • EFTPPVQAAYQKVVAG • DFNPNVQAAFQKVVAG

  14. Motivation • By similar intuition, subnetworks conserved across species are likely functional modules

  15. mismatch/substitution Network Alignment • “Conserved” means two subgraphs contain proteins serving similar functions, having similar interaction profiles • Key word is similar, not identical

  16. Alignment Analogy Sharan and Ideker. Modeling cellular machinery through biological network comparison. Nature Biotechnology 24, pp. 427-433, 2006

  17. Earlier approaches: interologs • Interactions conserved in orthologs • Orthology (descended from common ancestor) is a fuzzy notion • Sequence similarity not necessary for conservation of function

  18. Complications • Protein sequence similarity not 1-1 • Orthologs • Paralogs • Interaction data: • Noisy • Incomplete • Dynamic • Computational tractability

  19. Network Alignment Sharan and Ideker. Modeling cellular machinery through biological network comparison. Nature Biotechnology 24, pp. 427-433, 2006

  20. The Network Alignment Problem Given: k different interaction networks belonging to different species, Find: Conserved sub-networks within these networks Conserved defined by protein sequence similarity (node similarity) and interaction similarity (network topology similarity)

  21. A B C D A’ X’ D’ Score: match + gap + mismatch + match PathBLAST • Goal: identify conserved pathways (chains) • Idea: can be done efficiently by dynamic programming if networks are DAGs Kelley et al (2003)

  22. Why paths?

  23. PathBLAST (Kelley, et al. PNAS 2003) • Find conserved pathways in protein interaction maps of two species • Model & Scoring: See class notes.

  24. 2 1 4 1 4 2 3 5 5 3 5 2 1 4 3 PathBLAST • Problem: Networks are neither acyclic nor directed • Solution: Randomize Impose random ordering on nodes, perform DP; repeat many times • On average, highest scoring path preserved in 2/L! subgraphs • Finds conserved paths of length L within networks of size n in O(L!n) expected time • Drawbacks • Computationally expensive • Restricts search to specific topology Kelley et al (2003)

  25. PathBLAST

  26. PathBLAST: Computational Formulation • Given: • Undirected weighted graph G = (V, E, w) • Set of start vertices I, and end vertex v, • Find: a minimum-weight simple path starting in I and ending at v. • NP-hard in general (reduction from TSP) • Dynamic Programming formulation (see class notes) Scott, et al. JCB 2006

  27. Color-coding(Alon, Yuster, & Zwick) • Assign each vertex random color between 1 and k. • Search for path w/ distinct colors. (colorful path) • Resulting paths are simple. • High-scoring path not discovered when two vertices have same color. • Repeat for many random colorings.

  28. Color-coding(Alon, Yuster, & Zwick) • Extends to many other cases of subgraph isomorphism problem: • Does a graph G have a subgraph isomorphic to graph H? • H = simple path of length k. • H = simple cycle of length k. • H = tree. • H = graph of fixed (bounded) tree-width

  29. Additional Problems • Efficient querying of a network (e.g. QNET) • Find conserved subgraphs Heavy subgraphs in product graph • Multiple network alignment

More Related