Microarray Design and Analysis
1 / 33

Microarray Design and Analysis - PowerPoint PPT Presentation

  • Updated On :

Microarray Design and Analysis. Jeremy D. Glasner. Genetics 875 November 20, 2007. What is a Microarray?. A collection of DNA sequences arrayed on a solid substrate usually with thousands of individual DNA spots. Gene expression analysis. Massively parallel biochemistry aimed

I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
Download Presentation

PowerPoint Slideshow about 'Microarray Design and Analysis' - terrel

An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.

- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript
Slide1 l.jpg

Microarray Design and Analysis

Jeremy D. Glasner

Genetics 875

November 20, 2007

Slide2 l.jpg

What is a Microarray?

A collection of DNA sequences arrayed on a solid substrate

usually with thousands of individual DNA spots

Slide3 l.jpg

Gene expression analysis

Massively parallel biochemistry aimed

at measuring RNA levels

Slide4 l.jpg

Why do gene expression analysis?

  • Predict new gene functions by expression patterns

  • Determine the effect of a drug on gene expression

  • Compare a mutant strain to wild-type

  • Identify an expression signature for a cancer type

  • Find the targets of a transcriptional regulator

  • Measure the half-lives of RNAs in the cell




Slide6 l.jpg

TraSH: transposon site hybridization

Genome wide localization of insertion mutations

Sassetti CM, Boyd DH, Rubin EJ. Proc Natl Acad Sci U S A. 2001 98(22):12712-7

Slide7 l.jpg

CGH: Comparative genome hybridization

Rajashekara G, Glasner JD, Glover DA, Splitter GA. J Bacteriol. 2004 186(15):5040-51.

Slide8 l.jpg

ChIP-Chip: Chromatin immunoprecipitation, chip hybridization

a.k.a. genome-wide occupancy profiling

Identify the chromosomal locations of a DNA binding protein

Slide9 l.jpg

Flow of Information in Array Analyses

Experimental Design

Array Production

Sample Preparation


Image Analysis

Data Processing

Data Analysis

Information Integration

Slide10 l.jpg

Experimental Design Issues

Number of Replicates

What samples should be compared?

Directly on same chip/across arrays?

Calibrators and common references

What controls are necessary?

Slide11 l.jpg

Considerations when designing the sequences for a chip

Sequence Annotation

ORF, UTR, functional RNA prediction

Oligo Selection

Array Design, Replication

Slide12 l.jpg

Sensitivity vs. specificity as a function of oligo length

They are inversely related

Hughes TR, et al., Nat Biotechnol 2001. Apr;19(4):342-7.

Slide13 l.jpg

Two methods for array production



Slide14 l.jpg

Affymetrix Chips

105-106 “Probes”

Perfect Match and Mismatch

Average Difference Values

Slide15 l.jpg

Affymetrix “Units”



A “probe set”

A “probe pair”

Dmd the digital light switch l.jpg
DMD “The Digital Light Switch”

DMD Close-Up

  • Mirrors spacing 17 um

  • Mirror transit time <20 us

  • Tilt angle +10 degrees

  • Five mirrors = diameter human hair

  • Analog pictures from digital switches?

Slide19 l.jpg

Sample Preparation

RNA samples are extracted for the experiment and fluorescent dyes are incorporated

RNA stabilization

Direct vs. indirect labeling

Slide20 l.jpg

Hybridization & Scanning

PMT settings



Data Tracking

Slide21 l.jpg

Image Analysis

Spot Finding

Background subtraction

Intensity Calculation

Slide22 l.jpg

Automatic Grid Finding

Sum signal intensities in X and Y directions

Slide24 l.jpg

Estimating Foreground and Background with the “Histogram” Method




Slide25 l.jpg

Quality Filtering of Data “Histogram” Method

From Tseng et al., 2001. NAR 29(12):2549-2557

Slide26 l.jpg

Data Normalization “Histogram” Method

Data Normalization is necessary if the overall signal differs between experiments and can be complicated if the relationship is nonlinear.

Internal controls can also be used for normalization.

Data from Schadt et al 2001. Journal of Cellular Biochemistry Supplement 37:120-125.

Slide27 l.jpg

M-A Plots “Histogram” Method

Slide28 l.jpg

Print Tip Group Normalization “Histogram” Method

Slide29 l.jpg

Intensity Dependent Normalization “Histogram” Method

Slide30 l.jpg

Normalization Methods “Histogram” Method

Assume linear relationship

Apply non-linear normalization

Normalize to “house-keeping genes”

Normalize to internal Standards

Slide31 l.jpg

Ratio Calculation Methods “Histogram” Method

Slide32 l.jpg

Detecting differential expression “Histogram” Method

Determine which changes are significant:

Fixed cutoff (fold-change>4)

Replication allows assessment of variability

Common statistics such as the t-test are often used for gene expression data. Significance of the value is then determined by referring to the t distribution. This assumes that the data is normally distributed, which may not be true.

Gene expression experiments may require thousands of statistical tests and significance should be adjusted to reflect this. A standard Bonferroni correction is the p-value multiplied by the number of tests but is likely too conservative.

Different methods different results l.jpg
Different methods, different results “Histogram” Method

Millenaar et al., BMC Bioinformatics2006, 7:137