430 likes | 596 Views
Gene hunting from a Bayesian viewpoint. Tuan V. Nguyen Bone and Mineral Research Program Garvan Institute of Medical Research Sydney, Australia. Gene search is justified? . Exploration of disease pathway Public health implications Pharmacological applications Treatment? . Complex traits.
E N D
Gene hunting from a Bayesian viewpoint Tuan V. Nguyen Bone and Mineral Research Program Garvan Institute of Medical Research Sydney, Australia
Gene search is justified? • Exploration of disease pathway • Public health implications • Pharmacological applications • Treatment?
BMD: genetics and environments Variation in complex trait = G + E + GxE (G=genetics; E=environment, x=interaction) MZ twins; r=0.73 DZ twins; r=0.47
Genetics, BMD and fracture Fracture BMD
Does familial risk of BMD affect fracture risk? Intraclass correlation in BMD RR of BMD/Fx r = 0.8 r = 0.9 _________________________________________________ 5 1.14 1.16 6 1.17 1.20 7 1.21 1.24 8 1.24 1.28 _________________________________________________ Genes that affect BMD explain small variation in fx risk
Fracture risk: genetics and environments Twin study P Kannus et al, BMJ 1999; 319:1334-7
Current strategies • Linkage analysis • Association analysis • Genome-wide screen • “Candidate gene”
Linkage analysis – identical by descent (ibd) AB AC AB CD AB CD AB AC AC AD BC BC IBD = 0 IBD = 1 IBD = 2
Linkage analysis: basic model Squared difference in BMD among siblings o o o o o o o o o o o o o o o o o o o o o o o o o o o 0 1 2 Number of alleles shared IBD
Population-based association analysis Fracture AC AB AC BC AA AB BB AA AC AB Controls AC BB BC BC CC AB BB CC BC BB
Family-based association analysis AB AA AB AC BC AA AB BC AB
Genome-wide vs candidate gene approach Candidate gene analysis Genome-wide screen Complex No prior knowledge of mechanism Expensive No specific genes Simple Prior knowledge of mechanism Inexpensive Specific genes
Test statistic Test statistic = signal / noise = effect size / random error Result: significant (+ve) or not significant (-ve) Criteria: P-value P<0.05 +ve P>0.05 -ve
Consider an example OR = (0.1 / 0.9) / (0.05 / 0.95) = 2.11 LnOR = 0.74; SE(lnOR) = 0.18 P-value < 0.0001
Diagnostic analogy Diagnosis Genetic research
The meaning of P-value • P-value: probability of getting a significant statistical test given that there is no association (or no linkage) • P-value = P(significant stat | Ho is true)
The logic of P-value • If there was truly no association, the the observation is unlikely • The observation occurred • The no-association hypothesis is unlikely • If Tuan has hypertension, he is unlikely to have pheochromocytoma • Tuan has pheochromocytoma • Tuan is unlikley to have hypertension
P-value “Thinking about P values seems quite counter-intuitive at first, as you must use backwards, awkward logic. Unless you are a lawyer or a Talmudic scholar … you will probably find this sort of reasoning uncomfortable” (Intuitive Statistics)
What do we want to know? Clinical P(+ve | cancer), or P(cancer | +ve) ? Research P(Significant test | Association), or P(Association | Significant test) ?
Breast Cancer Screening Prevalence = 1%; Sensitivity = 90%; Specificity = 91% Population Cancer (n=10) No Cancer (n=990) +ve N=9 -ve N=900 -ve N=1 +ve N=90 P(Cancer| +ve result) = 9/(9+90) = 9%
Genetic association Prior prob. association = 0.05; Power = 90%; Pvalue = 5% 1000 SNPs True (n=50) False (n=950) +ve N=45 -ve N=902 -ve N=5 +ve N=48 P(True association| +ve result) = 45/(45+48) = 48%
P value is • NOT the likelihood that findings are due to chance • NOT the probability that the null hypothesis is true given the data • A p-value = 0.05 does not mean that there is a 95% chance that a real difference exists • The lower p-value, the stronger the evidence for an effect
Bayes factor P(data | H0) BF = _________________ P(data | H1)
Minimum Bayes factor Prob of null hypothesis Where P(H0) is the prior probability of the null hypothesis
Bayes Factor for genetic association study let p be the allelic frequency of genetic marker, then BF can be shown to be:
Bayes Factor and posterior probability of an association let p0 be the prior probability of a true association, the posterior probability of the association is:
Bayes factor and p-value Bayes factor P-value • Non-comparative • Observed + hypothetical data • Evidence only negative • Sensitive • No formal justification or interpretation • Comparative • Only observed data • Evidence +ve or –ve • Insensitive • Formal justification and interpretation
Summary • The criteria of p<0.05 is not an adequate measure of a genetic association • Bayes factor is potentially a relevant measure of association
Distribution of sample sizes Ioannidis et al, Trends Mol Med 2003
Distribution of effect sizes Ioannidis et al, Trends Mol Med 2003
Correlation between the odds ratio in the first studies and in subsequent studiesIoannidis et al, Nat Genet 2001
Evolution of the strength of an association as more information is accumulatedIoannidis et al, Nat Genet 2001
Predictors of statistically significant discrepancies between the first and subsequent studies of the same genetic association Ioannidis et al, Nat Genet 2001
Risk factors for fracture • Drinking coffee • Drinking tea • Coca cola • High protein intake • Blonde hair • Being tall • Wear trouser (women) • High heel (women)
Cancer risk • Being a waiter • Owning a pet bird • Being short • Being tall • Hot dogs • Have a refrigerator! • Electric razors • Broken arms (women) • Fluorescent lights • Allergies • Breeding reindeer Altman and Simon, JNCI 1992
“Half of what doctors know is wrong. Unfortunately we don’t know which half.” Quoted from the Dean of Yale Medical School, in “Medicine and Its Myths”, New York Times Magazine, 16/3/2003