610 likes | 731 Views
Genomes for ALL Jeremy Edwards Molecular Genetics and Microbiology Cancer Research and Treatment Center Chemical and Nuclear Engineering jsedwards@salud.unm.edu. Sanger Sequencing. $1,000 Genome. Error rate must be significantly lower than the variation (~1 in 1kbp)
E N D
Genomes for ALL Jeremy Edwards Molecular Genetics and Microbiology Cancer Research and Treatment Center Chemical and Nuclear Engineering jsedwards@salud.unm.edu
$1,000 Genome • Error rate must be significantly lower than the variation (~1 in 1kbp) • Error Rate goal of 1/100,000 • Raw read accuracy goal = 99.7% with 3x coverage • 3x coverage of the human diploid genome ~40 Billion bases • Requires 40 Million bases by $1 for the $1000 Genome • Is a 20 bp read long enough? • If the human genome sequence was random, one would expect that each 20bp read would be unique – 420>>3x109 • However, the human genome is not random. • Only 73% of the potential 20 bp reads could be uniquely placed in the genome. • Here, we collected 26 bp reads –extend to 34 bp. • The 6+7,7+6=26 bp read could be improved to 9+9,9+9=36 bp read using a different ligase • Mate Pairs – greatly helps the situation. • Should be able to increase this to 72 bp. • 60 bp is thought to be the minimum (Shendure, et al) • Which works for a 40 Mbp contig (34bp)
Acrylamide gel + Thermal cycle + primers and DNA Polonies • Amplify template DNA in thin polyacrylamide gel • All normal PCR ingredients + polyacrylamide mixed and polymerized onto glass slide • Exponential DNA amplification obtained • DNA remains localized due to diffusion limitations Mitra and Church. (1999) Nucleic Acids Research. + PCR reagents
Polonies Mitra and Church. (1999) Nucleic Acids Research.
Polonies Mitra and Church. (1999) Nucleic Acids Research.
Polonies 75 ml PCR ~2 mm thick 15 ml PCR ~50 mm thick
Applying the technology Map Check
Mutants & function • 54 variants of the hG6PD gene were functionally tested • 50 mutants represent all possible amino acid substitutions in the 8 residue G6P binding site which can arise from single nucleotide substitutions • 3 each with reported protein activity, carried single nucleotide mutations located away from the G6P binding site • the last variant was the wild-type hG6PD.
50% Mutant A 50% Mutant B Dilute DNA and Polymerize on a Glass Microscope Slide with PCR Components . . . . Parallel Competition Competitive Growth of Mutant Population Thermal Cycle to Amplify PCR Colonies (Polonies) Arising from Individual DNA Molecules . . . . Isolation of DNA from Population Merritt et al. (2003). Nucleic Acids Research Merritt et al. (2005). Biotech Bioeng
0 hr 48 hr 96 hr Parallel Competition Growth competition used to determine relative activities of mutant proteins; Polonies and SBEs used to measure concentrations of each 0 hr 48 hr 96 hr Culture mutant pool Isolate plasmid DNA Polonies/SBE Green = Lys131Glu Red = WT (at this position)
Barcode design Molecular Barcode
Barcode Readout Tag2 Tag1 AATAAA AATAAA Tag2 = Tag1 = TCCCTT TCCCTT
Polony Sequencing (Old School) Mitra, et al. (2003) Anal Biochem
FISSEQ Mitra, et al. (2003) Anal Biochem b-mercaptoethanol treatment
FISSEQ – 34 bp reads Mitra, et al. (2003) Anal Biochem
Bead Based Higher Throughput Assays Map Check
BEAMing Dressman, D., et al. (2003) PNAS
Polonies or Beads? Well, we will just call them both polonies if they are in a gel
Polony manipulations 5’ 5’ 5’ 5’ …and just about anything else ……To Be Continued
Acetate Data Edwards, et al. (2001) Nat Biotechnol
Predictive Capability Edwards, et al. (2001) Nat Biotechnol
We are wrong? Ibarra, et al. (2002) Nature
Adaptation to glycerol Ibarra, et al. (2002) Nature
Overview Genome ~1 kb fragments Sequence the 13 bp at the end of each ~1kb fragment
Overview Shendure, et al. (2005) Science
Fragmentation of Genome ~1kb Fragment Fragmented Genome – 10mg
Next few steps • A-Tailing • Circularization – Key step for improvement. (<0.1 % efficient) A A MmeI Sites T T T30 30 bp Contains MmeI sites Anchor Primer Hyb sites
RCA RCA MmeI
MmeI digested RCA product Gel Purified 66-68 bp band (Tag-T30-Tag) 1 kb
Ligate Primers to the end • Blunt end ligation produces three different products: • F –TAG-T30-TAG- R • F –TAG-T30-TAG- F • R –TAG-T30-TAG- R • Gel purify all the triplet of bands and then PCR amplify • Correct band is enriched, but other two are still present. • Gel purify the center (134 bp) band from the others (116 & 152 bp) Gel of Final Library
Ligation Sequencing +Direction 6 bp • Direction • 7 bp Inspired by Brenner, et al. (2000) Nat Biotechnol
Sequencing “Center” • Hybridize ‘anchor primer’ complementary to common library sequence • Ligate pool of fluorescently-labeled ‘query primers’ specific to one tag-position • Image to determine which primer pool ligated to each bead • Repeat
“Reads” All data “Good Reads” Proximal -1 Proximal -7