1 / 4

1000G Phase 1 Release chr20 call sets

1000G Phase 1 Release chr20 call sets. Ryan Poplin Genome Sequencing and Analysis Medical and Population Genetics January 25, 2011. Data and Definitions -- Pipeline. Full indel cleaning process including known indels BAQ calculation using GATK implementation of H. Li

temima
Download Presentation

1000G Phase 1 Release chr20 call sets

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. 1000G Phase 1 Release chr20 call sets Ryan Poplin Genome Sequencing and Analysis Medical and Population Genetics January 25, 2011

  2. Data and Definitions -- Pipeline • Full indel cleaning process including known indels • BAQ calculation using GATK implementation of H. Li • Called by main continental AP and by admixed+ AP • Variant quality score recalibration • Quality cut chosen using HapMap3.3 + Omni 2.5M chip sensitivity • Cut at 99.2% of accessible sites • Not yet done genotype refinement

  3. Data and Definitions – 1004 Samples • ASN = CHB + CHS + JPT • ASN+ = CHB + CHS + JPT + MXL + CLM + PUR • EUR = CEU + FIN + GBR + TSI + IBS • EUR+ = CEU + FIN + GBR + TSI + IBS + MXL + CLM + PUR + ASW • AFR = LWK + YRI + ASW • AFR+ = LWK + YRI + ASW + CLM + PUR • AMR = MXL + CLM + PUR • AMR+ = MXL + CLM + PUR + ASW • Note these definitions differ from the other groups

  4. Final chr20 callsets including fragment-based calling and contrastive VQSR clustering

More Related