1 / 17

Gene Expression Deconvolution with Single-cell Data

Gene Expression Deconvolution with Single-cell Data. James Lindsay 1 Caroline Jakuba 2 Ion mandoiu 1 Craig Nelson 2. University Of Connecticut 1 Department of Computer Science and Engineering 2 Department of Molecular and Cell Biology. Mouse Embryo. ANTERIOR / HEAD. Neural tube.

cliff
Download Presentation

Gene Expression Deconvolution with Single-cell Data

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Gene Expression Deconvolution with Single-cell Data James Lindsay1 Caroline Jakuba2 Ion mandoiu1 Craig Nelson2 University Of Connecticut 1Department of Computer Science and Engineering 2Department of Molecular and Cell Biology

  2. Mouse Embryo ANTERIOR / HEAD Neural tube Somites Node Primitive streak POSTERIOR / TAIL

  3. Unknown Mesoderm Progenitor What is the expression profile of the progenitor cell type? NSB=node-streak border; PSM=presomitic mesoderm; S=somite; NT=neural tube/neurectoderm; EN=endoderm

  4. Characterizing Cell-types • Goal: Whole transcriptome expression profiles of individual cell-types • Technically challenging to measure whole transcriptome expression from single-cells • Approach: Computational Deconvolution of cell mixtures • Assisted by single-cell qPCR expression data for a small number of genes

  5. Modeling Cell Mixtures Mixtures (X) are a linear combination of signature matrix (S) and concentration matrix (C) cell types mixtures mixtures cell types genes genes

  6. Previous Work • Coupled Deconvolution • Given: X, Infer: S, C • NMF Repsilber, BMC Bioinformatics, 2010 • Minimum polytope Schwartz, BMC Bioinformatics, 2010 • Estimation of Mixing Proportions • Given: X, S Infer: C • Quadratic Prog Gong, PLoS One, 2012 • LDA Qiao, PLoS Comp Bio, 2o12 • Estimation of Expression Signatures • Given: X, C Infer: S • csSAMShen-Orr, Nature Brief Com, 2010

  7. Single-cell Assisted Deconvolution Given: X and single-cells qPCR data Infer: S, C Approach: • Identify cell-types and estimate reduced signature matrix using single-cells qPCR data • Outlier removal • K-means clustering followed by averaging • Estimate mixing proportions C using • Quadratic programming, 1 mixture at a time • Estimate full expression signature matrix S using C • Quadratic programming , 1 gene at a time

  8. Step 1: Outlier Removal + Clustering Remove cells that have maximum Pearson correlation to other cells below .95 unfiltered filtered

  9. Step 2: Estimate Mixture Proportions For a given mixture i:

  10. Step 3: Estimating Full Expression Signatures cell types mixtures mixtures cell types genes genes C: known from step 2 x: observed signals from new gene s: new gene to estimate signatures Now solve:

  11. Experimental Design • Simulated Concentrations • Sample uniformly at random [0,1] • Scale column sum to 1. • Single Cell Profiles • 92 profiles • 31 genes • Actual Mixtures • 12 mixtures • 31 genes • Dimensions • k = 3 • m = 31 • n = 92, 12 • # mixtures = {10…300} • Simulated Mixtures • Choose single-cells randomly with replacement from each cluster • Sum to generate mixture

  12. Data Processing RT-qPCR • CT values are the cycle in which gene was detected • Relative Normalization to house-keeping genes • HouseKeeping genes • gapdh, bactin1 • geometric mean • Vandesompele, 2002 • dCT(x) = geometric mean – CT(x) • expression(x) = 2^dCT(x)

  13. Accuracy of Inferred Mixing Proportions

  14. Concentration Matrix: Concordance predicted

  15. Leave-one-out Accuracy of Inferred Gene Expression Signatures

  16. Future Work • Apply gene signature estimation technique using more genes in mixed samples • Identify PSM-Pr Signature • Confirm the anatomical location of the putative PSM-Prcell population through exhaustive ISH

  17. Conclusion Special Thanks to: • Ion Mandoiu • Craig Nelson • Caroline Jakuba • Mathew Gajdosik James.Lindsay@engr.uconn.edu

More Related