1 / 49

Visualizing RNA Expression Data

Visualizing RNA Expression Data. John Quackenbush VIZBI 16 March 2011. Northern Blots: Before the dawn of Time. Northern Blots. Northern Blots. Quantitative RT-PCR The Pre-Modern Era. Quantitative PCR. Quantitative PCR and other Methods. Large-scale Quantitative RT-PCR:

teleri
Download Presentation

Visualizing RNA Expression Data

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Visualizing RNA Expression Data John Quackenbush VIZBI 16 March 2011

  2. Northern Blots: Before the dawn of Time

  3. Northern Blots

  4. Northern Blots

  5. Quantitative RT-PCR The Pre-Modern Era

  6. Quantitative PCR

  7. Quantitative PCR and other Methods

  8. Large-scale Quantitative RT-PCR: The Dawn of the Modern Age

  9. An Aside: The Birth of Clustering

  10. Our World Today: A Microarray Overview

  11. History is written by the victors (or those who produce software): The Birth of Clustering This was also the start of tormenting the red-green color-blind.

  12. Truth is determined by the person giving the talk: MeV is the best clustering tool ever! http://www.tm4.org

  13. Truth is determined by the person giving the talk: MeV is the best clustering tool ever!

  14. Truth is determined by the person giving the talk: MeV is the best clustering tool ever!

  15. Public Microarray Data ArrayExpress 20,423 Experiments (572,682 hybs/arrays) GEO 21,320 Experiments (529,108 arrays) CIBEX 148 Experiments (2,711 arrays) SMD 21,521Expts (80,319 incl private data)

  16. >1,000,000 arrays x $500 = $500,000,000 Cancer Studies account for >14% of all studies in databases…

  17. EBI’s Expression Atlas Rocks!

  18. Disease Progression and Personalized Care Birth Treatment Death Quality Of Life Natural History of Disease Clinical Care Environment + Lifestyle Outcomes Treatment Options Disease Staging Patient Stratification Early Detection Genetic Risk Biomarkers

  19. Welcome to the post-Modern World: Next-Gen Technologies have Dramatically Expanded our Genomic Universe

  20. Browser-mania rules!

  21. Back to Excel, Man’s Best Friend RNA-Seq data of 7 FFPE blocks

  22. And more websites are integrating data

  23. Cells Converge to Attractive States • Stuart Kauffman presented the idea of a gene expression landscape with attractors • ~250 stable cell types each represent attractors • Cells can be "pushed" or induced to converge to an attractor. • Once in the attractor, a cell is robust to small perturbations. Jess Mar

  24. Differentiation of Promyelocytes into Neutrophil-Like Cells Affymetrix GeneChip Time 0 Promyeloctyes (HL-60 Cell Line) RA used in differentiation therapy for acute promyelocytic leukemia. ~6 days DimethylSulfoxide (DMSO) All-Trans Retinoic Acid (ATRA) Combined with chemotherapy, complete remission rates as high as 90-95% can be achieved. Neutrophil-like Cells Day 7 Huang et al. PRL 2005 Collins et al. PNAS 1978 Jess Mar

  25. DMSO, ATRA GEDI: Cells Display Divergent Trajectories That Eventually Converge as they Differentiate Graphical representation of the results from a Self-Organizing Map clustering. Expression data from a single sample (time point) clustered according to a grid. What factors drive this divergent-then-convergent behavior? Huang et al. PRL 2005

  26. Our Hypothesis Observed Trajectory (Perturbation 1) State A State B Observed Trajectory (Perturbation 2) Transient Pathway (Perturbation 1) State A Core Differentiation Pathway State B Transient Pathway (Perturbation 2) Jess Mar

  27. Observed Trajectory 4 hrs 8 hrs 12 hrs 18 hrs 1 day 2 hrs ATRA ATRA DMSO DMSO ATRA ATRA DMSO DMSO 2 days 3 days 4 days 5 days 6 days 7 days Jess Mar

  28. Transient Trajectory 4 hrs 8 hrs 12 hrs 18 hrs 1 day 2 hrs ATRA DMSO ATRA DMSO 2 days 3 days 4 days 5 days 6 days 7 days Jess Mar

  29. Core Trajectory 4 hrs 8 hrs 12 hrs 18 hrs 1 day 2 hrs ATRA DMSO ATRA DMSO 2 days 3 days 4 days 5 days 6 days 7 days Jess Mar

  30. Extracellular Membrane Cytoplasm Nuclear high variance low variance Ultimately, we’d like to get to pathways:Functional Roles Are Associated with Constraint High-variance genes tend to function as cell surface receptors. Low-variance genes function as kinases and transferases.

  31. But the tools are very primative

  32. SZ Group PD Group P-value 2.5  10-4 P-value 2.8  10-7 Density Density Node Degree Node Degree Control Group P-value 3.5  10-4 Density high variance low variance Variance Constraints Alter Network Topology Degree distributions for the MAPK module are significantly different (Kolmogorov-Smirnov test). Degree of statistical significance is altered by disease status.

  33. So we’re back to Heat Maps The transcriptional profiles of ONS XS cells from SZ patients more closely resemble those of healthy fibroblasts than any other stem cell signature.

  34. And of course, we’ve left out the interestingg stuff, like where genes are expressed.

  35. LGRC Research Portal

  36. LGRC Research Portal

  37. PAGE DETAILS • Search • Facets • Search within results • Keyword prompts • Search history • Table: • Paged results • Sortable columns • Actions: • Go to Gene detail page • Add genes to ‘gene set’

  38. PAGE DETAILS • Annotation summary & summary view for each assay/data type: • Accordion style sections • GEXP – expression profile across major Dx categories • RNASeq – Exon structure of the gene • SNPs – Table of SNPs in region of gene, highlighting association with major Dx group • - Methylation – Methylation profile in region around gene • Genomic alterations – table of CNVs & alterations observed w/ freq in region around gene • Actions: • - Click through to assay detail page • Add gene to set Annotation Summary Gene Expression Summary RNASeq

  39. LGRC Research Portal

  40. PAGE DETAILS • - View aggregate statistics • View cohort details • Build cohort sets • Build composite phenotypes • Actions: • -Go to data download for selected cohort • Go to assay detail for selected cohort • Go to cohort manager

  41. LGRC Research Portal

  42. Analysis Tools • PAGE DETAILS • Very minimal parameters and options…here just 2 cohorts of interest, maybe p-value cutoff • Generates comprehensive report • Edit in place results – Don’t set parameters, edit the results • Analysis goes into queue, email notification when finished Cohort 1: Set 1 Cohort 2: Set 2 Job name: My job 1 Start Analysis View analysis parameters Job Status Running

  43. Analysis of Differential Expression: My Job 1 • PAGE DETAILS • Very minimal parameters and options. • Generates comprehensive report • Edit in place results – Don’t set parameters, edit the results • Accordion style result sections • Generate PDF report of analysis • Analysis goes into queue, email notification when finished Supervised Analysis Meta analysis Unsupervised analysis

  44. Before I came here I was confused about this subject. After listening to your lecture, I am still confused but at a higher level. - Enrico Fermi, (1901-1954)

  45. Genomics is here to stay

  46. Acknowledgments <johnq@jimmy.harvard.edu> The Gene Index Team Corina Antonescu Valentin Antonescu Fenglong Liu Geo Pertea Razvan Sultana John Quackenbush Center for Cancer Computational Biology Mick Correll HowieGoodell Kristina Holton Jerry Papenhausen Patricia Papastamos John Quackenbush http://cccb.dfci.harvard.edu Microarray Expression Team Stefan Bentink Thomas Chittenden Aedin Culhane Kristina Holton Jane Pak Renee Rubio Array Software Hit Team Katie Franklin Eleanor Howe Sarita Nair Jerry Papenhausen John Quackenbush Dan Schlauch RaktimSinha Joseph White (Former) Stellar Students Martin Aryee Kaveh Maghsoudi Jess Mar Systems Support Stas Alekseev, Sys Admin Eskitis Institute Christine Wells Alan Mackay-Sim Assistant Joan Coraccio Juliana Coraccio http://compbio.dfci.harvard.edu

  47. Shameless self-promotion

More Related