Download
an integrated statistical framework for mapping quantitative traits in mice n.
Skip this Video
Loading SlideShow in 5 Seconds..
An integrated statistical framework for mapping quantitative traits in mice PowerPoint Presentation
Download Presentation
An integrated statistical framework for mapping quantitative traits in mice

An integrated statistical framework for mapping quantitative traits in mice

128 Views Download Presentation
Download Presentation

An integrated statistical framework for mapping quantitative traits in mice

- - - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript

  1. An integrated statistical framework for mapping quantitative traits inmice Richard Mott Jonathan Flint Wellcome Trust Centre for Human Genetics, Oxford Richard.Mott@well.ox.ac.uk

  2. Outline • Introduction • QTL Mapping • Multiple Phenotype Heterogeneous Stock Experiment • Testing for Functional Variants • Expression Data • Future

  3. Genetic Traits • Quantitative (height, weight) • Dichotomous (affected/unaffected) • Factorial (blood group) • Mendelian - controlled by single gene (cystic fibrosis) • Complex – controlled by multiple genes*environment (diabetes, asthma)

  4. Quantitative Trait Loci QTL: Quantitative Trait Locus chromosome genes

  5. Quantitative Trait Loci QTL: Quantitative Trait Locus chromosome QTG: Quantitative Trait Gene

  6. Quantitative Trait Loci QTL: Quantitative Trait Locus chromosome QTG: Quantitative Trait Gene QTN: Quantitative Trait Nucleotide

  7. Disease studied directly Population and environment stratification Very many SNPs (1,000,000?) required Hard to detect trait loci – very large sample sizes required to detect loci of small effect (5,000-10,000) Potentially very high mapping resolution – single gene Very Expensive Animal Model required Population and environment controlled Fewer SNPs required (~100-10,000) Easy to detect QTL with ~500 animals Poorer mapping resolution – 1Mb (10 genes) Relatively inexpensive Map inHumans or Animal Models ?

  8. Inbred founders Mosaic Crosses G3 GN F20 inbreeding mixing chopping up Heterogeneous Stock, Advanced Intercross, Random Outbreds F2, diallele Recombinant Inbred Lines

  9. Sizes of Behavioural QTL in rodents (% of total phenotypic variance)

  10. Effect size of cloned genes

  11. Mapping Resolution • F2 crosses • Powerful at detecting QTL • Poor at Localisation – 20cM • Too few recombinants • Increase number of recombinants: • more animals • more generations in cross

  12. Heterogeneous Stocks • cross 8 inbred strains for >10 generations

  13. Heterogeneous Stocks • cross 8 inbred strains for >10 generations

  14. Heterogeneous Stocks • cross 8 inbred strains for >10 generations 0.25 cM

  15. Multiple Phenotype QTL Experiment

  16. Multiple Phenotypes measured on a Heterogeneous Stock • 2000 HS mice (Northport, Bob Hitzeman) 84 families 40th generation • 150 traits measured on each animal • Standardised phenotyping protocol • Covariates Recorded • Experimenter • Time/Date • Litter • Microchipping

  17. Phenotypes • Anxiety (Conditioned and Unconditioned Tests) • Asthma (Plethysmography) • Diabetes (Glucose Tolerance Test) • Haematology • Immunology • Biochemistry • Wound Healing (Ear Punch) • Gene Expression • ….others….

  18. High throughput phenotyping facility

  19. Neophobia

  20. Fear Potentiated Startle

  21. Ovalbumin sensitization

  22. Plethysmograph

  23. Intraperitoneal Glucose Tolerance Test

  24. Ears

  25. Genotyping • 15360 SNPs genotyped by Illumina • 2000 HS mice • 300 HS parents • 8 inbred HS founders • 500 other inbreds • www.well.ox.ac.uk/mouse/snp.selector • 13459 SNPs successful • 99.8% accuracy (parent-offspring)

  26. Distribution of Marker Spacing (chromosome X) (9 Markers)

  27. LD Decay with distance 99.2% marker pairs on different autosomes have R2 < 0.05.

  28. Genetic Drift in HS • 40 generations of breeding • Allele Frequency in founders will drift • 8% of genome fixed

  29. Analysis • Automated analysis pipeline • R HAPPY package • Single Marker Association • Each phenotype analysed independently • Transformed to Normality, outliers removed • Tailored set of covariates • Linear models for most phenotypes • Survival models for latency phenotypes

  30. Twisted Pair Analysis of Heterogeneous Stock chromosome markers • Want to predict ancestral strain from genotype • We know the alleles in the founder strains • Single marker association lacks power, can’t distinguish all strains • Multipoint analysis – combine data from neighbouring markers alleles 1 1 2 1 2 1 1 1 2 2 1 2 2 1 1 1 1 2 1 1 2 1 1 1 1 1 2 2 1 2 1 2 1 1

  31. Twisted Pair Analysis of Heterogeneous Stock chromosome markers • Hidden Markov model HAPPY • Hidden states = ancestral strains • Observed states = genotypes • Unknown phase of genotypes • Analyse both chromosomes simultaneously • Twisted pair of HMMs • Mott et al 2000 PNAS alleles 1 1 2 1 2 1 1 1 2 2 1 2 2 1 1 1 1 2 1 1 2 1 1 1 1 1 2 2 1 2 1 2 1 1

  32. Testing for a QTL • piL(s,t) = Prob( animal i is descended from strains s,t at locus L) • piL(s,t) calculated by HMM using • genotype data • founder strains’ alleles • Phenotype is modelled E(yi) = Ss,tpiL(s,t)T(s,t) + mi Var(yi) = s2 • Test for no QTL at locus L • H0: T(s,t) are all same • ANOVA partial F test

  33. Genome Scan • Additive and dominance models • Record all peaks that exceed 5% genome-wide significance, • Threshold based on 200 permutations • 9000 preliminary candidate QTL found

  34. Jointly Significant QTL • Forward selection over candidate QTL • Test each QTL conditional on other QTL • Rescan genome conditional on selected QTLs to identify new QTL • 5% genome-wide significance threshold • Bootstrap residuals to find QTL confidence regions

  35. Results • ~7 jointly significant QTL per phenotype • 95% Confidence Interval ~ 2 Mb • ~50% of QTL have a significant non-additive component • Only 3 phenotypes were explained by single major QTL • Most phenotypes are complex

  36. Distribution of QTL Effects Mean Effect size 2.7%

  37. Distribution of #Genes under QTL #QTL +20 QTL With > 10 genes #genes

  38. %Variance Explained [% Additive Genetic Variance calculated using 3-generation pedigree data, not genotypes]

  39. Coat colour genes

  40. A known QTL: HDL HS mapping Wang et al, 2003

  41. New QTLs: two examples • Ear Punch Hole Area Regrowth • wound healing • Cue Conditioning Freeze.During.Tone • measure of fear

  42. Cue Conditioning • Freeze.During.Tone: huge effect, small number of genes chr15 cntn1: Contactin precursor (Neural cell surface protein)

  43. Gene x Environment Gene x Sex • Repeat analysis looking for QTLs that interact with • Gender • Litter number • Season, Month, etc • Experimenter • Compare models E(y) = m + locus + env E(y) = m + locus * env

  44. Gene x Environment • 431 jointly significant GxE QTLs • 27 gene x experimenter, • 81 gene x litter number, • 67 gene x age, • 105 gene x study day • 151 gene x season. • 13% of variation is GxE • 25 GxE QTLs overlapped with original joint QTL • defined as lying within 4Mb of the peak position • 42 GxSex QTLs

  45. Testing for Functional Variants • Is a SNP functional for a trait? • Is a functional assay measured in founders related to a trait? • Gene expression • DNA-Protein binding