1 / 20

Xinbin Dai, Ph. D .

Affymetrix Probeset Mapping and Medicago Genome Annotation (Mt4.0 RC1). Xinbin Dai, Ph. D . Agenda. About Affymetrix Medicago GeneChip Mapping Algorithm and Tool Bioinformatics Resources for Medicago Truncatula. Affymetrix GeneChip Probes. mRNA. EXON-I. EXON-II. EXON-III.

illias
Download Presentation

Xinbin Dai, Ph. D .

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. AffymetrixProbeset Mapping and Medicago Genome Annotation (Mt4.0 RC1) Xinbin Dai, Ph. D.

  2. Agenda • About AffymetrixMedicagoGeneChip • Mapping Algorithm and Tool • Bioinformatics Resources for Medicago Truncatula

  3. Affymetrix GeneChip Probes mRNA EXON-I EXON-II EXON-III 3’ UTR 5’ UTR Target Transcript Probeset:11 Probes 25-mer Perfect match - PM 1 5 10 15 20 25 Mismatch - MM 1 5 10 15 20 25

  4. Probeset Types • id_at: Designates probe sets that uniquely recognize target transcripts • id_a_at: Designates probe sets that recognize alternative transcripts from the same gene. • id_s_at: Designates probe sets with common probes among multiple transcripts from different genes. • id_x_at: Designates probe sets where it was not possible to select either a unique probe set or a probe set with identical probes among multiple transcripts. Rules for cross-hybridization were dropped in order to design the _x probe sets. These probe sets share some probes identically with two or more sequences and, therefore, these probe sets may cross-hybridize in an unpredictable manner. GeneChip® Expression Analysis Data Analysis Fundamentals.

  5. About MedicagoGeneChip Reference sequences: early version of IMGAG, DFCI GeneIndex and alfalfa EST

  6. Mapping Algorithm and Tool • Gene transcripts were matched to corresponding Affymetrix probe sets using a position-weighted scoring index in which mismatches near the middle of a probe were most heavily penalized as follows: A perfect match for a probe set yields a score of 45 • Matches were declared when at least 8 of 11 probes had scores of 43 or higher.Cutoff for matching: 43x8=344 [1,1,1,1,1,2,2,2,2,2,3,3,3,3,3,2,2,2,2,2,1,1,1,1,1] 1 5 10 15 20 25 Originated from Affymetrix, Inc.

  7. AffyProbeMapping: An Online AffymetrixProbeset Mapping Tool http://bioinfo3.noble.org/affymap/ • Input sequence: • Transcript • cDNA • EST/Unigene • CDS

  8. Output of AffyProbeMapping: AffyProbeMapping also supports Affymetrix chips for other species: Lotus Japonica, Arabidopsis thaliana, rice, soybean, maize, populus, cotton and tomato

  9. Bioinformatics & Data Resources for Medicago Truncatula Data Sources: • Mt3.5v4(2011, version for Nature paper): optical mapping44,124 BAC-based gene loci + 18,264 illumina (nr) gene model • Mt3.5v5(2012, minor changes): 45,859 BAC-based gene loci + 18,264 illumina gene model • Mt4 RC1(2013, PAG 2013 conference): anchored illuminacontigsonto pseudochromosomes.84,993 gene loci (BAC+illumina). Chr sequences frozen; some of gene models might be removed. • DFCI Gene index Release 11294k ESTs/ETs  68,814 Unigenes Originated from Affymetrix, Inc.

  10. Statistics on Mt3.5v4 vs. Probesets Mapping Results usingAffyProbeMapping

  11. Statistics on Mt4RC1 vs. Probesets Mapping Results usingAffyProbeMapping

  12. Statistics on GeneIndex R11 vs. Probesets Mapping Results usingAffyProbeMapping

  13. Mapping between Medicago genome vs. AffyMedicago Chip http://bioinfo3.noble.org/affymap/Dataset.gy

  14. Bioinformatics Tools For Medicago • Sequence Search and Annotation • DOBLAST --- http://bioinfo3.noble.org/doblast/ , a parallel computing accelerated BLAST search tool • Features: • Preload many Medicagodata resource • Capable of handling big dataset • “Tab-delimited bioparseroutput format” works friendly with Excel

  15. Bioinformatics Tools For Medicago • Sequence Download and Cut by Coordinates. • “Sequence Download” page of DOBLAST --- batch download sequences or cut sequences by Coordinates • Preload many Medicago data resources • Batch download • Get a fragment of sequence by coordinates

  16. DOBLAST sequence download page

  17. Bioinformatics Tools For Medicago • LegumeIP: An Integrative Platform to Study Gene Function and Genome Evolution in Legumes. • Features: • Synteny analysis among model legumes • Phylogenetic analysis for gene family • Gene to gene association analysis • Gbrowser • http://plantgrn.noble.org/LegumeIP/ • We are updating to Version 2

  18. LegumeIP: Synteny analysis for Medicago genome

  19. LegumeIP: Phylogenetic analysis for Medicago gene family

  20. LegumeIP: Gene association network analysis for Medicago gene

More Related