PDCB BioC for HTS topic Understanding the tech. 02

PDCB BioC for HTS topicUnderstanding the tech. 02 LCG Leonardo Collado Torres lcollado@wintergenomic.com lcollado@ibt.unam.mx September 2nd, 2010

Topics • Basecalling • Quality Filtering • FASTQ format • Error rates • A gamma of problems / reports • Fragment of James Huntley’s ppt on best practices

Basecalling: Illumina

Cross-talk

SWIFT: cross-talk correction

Phasing and Prephasing options

Some warnings!

Describe each case

Quality Filtering: Purity and Chastity

What artifact can be derived from this step?

FASTQ format @ is the seq id sequence + is the qual id Quality in ASCII chars

Originally…

Q to error probability (p) formulas Qphred Qsolexa1.3

FASTQ types What is the quickest way to distinguish fastq-sanger from fastq-illumina? Tip: Check the ASCII table 

phred.R

It is NOT clear what quals of 1 and 2 mean in Illumina (version 1.5+)

FASTQ in CS Base 1 does not include a quality value! (It’s a 0)

Error rates

IlluminavsSOLiD: % per cycle

IlluminavsSOLiD: num of errs

Understanding 454 (GS20) a bit more

454 error types

454 errors

Presence of Ns correlates with error rate (454)

IlluminavsSOLiD

Helicos

A gamma of problems / reports • Aligned to the wrong reference • Did not use the correct quality encoding • Barcodes are trimmed or have mismatches • Trimming the 1st and last base  losing barcodes • GC bias • Sample degradation will affect your data!

PDCB BioC for HTS topic Understanding the tech. 02

PDCB BioC for HTS topic Understanding the tech. 02

Presentation Transcript

Advanced Microeconomics Topic 02: Consumer Demand

Today’s Topic (02/03/14)

PDCB BioC for HTS topic Understanding the tech. 01

Tech Topic

HTS Platforms

YEA Tech Topic

MMG /BIOC 352

Powerpoint Presentation – Tech Tutorial for LIS 488-02

HTS Magnets for ARIES-AT

MMG /BIOC 352

BIOC 300D

Tech Topic Presentation

Bioc 300: Bioinformatics

Tech Topic Presentation Underwriters Laboratories

Tech Topic: Link State

HTS Solutions - Always For You

HTS Classification

HTS Libraries

Bioc 300: Bioinformatics

Understanding Guitar: The Best Content For the Topic Is Here

Understanding SEO for Tech Startups

Understanding the ITIL Certification Online Path for Tech Upward Mobility