1 / 35

Basic features for portal users

Basic features for portal users. Agenda - Basic features. Overview features and navigation Browsing data Files and Samples Gene Summary pages Performing Analyses on the portal Co-expression, differential expression, GSEA Managing your shelf. Overview - portal home page.

hinda
Download Presentation

Basic features for portal users

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Basic features for portal users

  2. Agenda - Basic features Overview features and navigation Browsing data Files and Samples Gene Summary pages Performing Analyses on the portal Co-expression, differential expression, GSEA Managing your shelf

  3. Overview - portal home page http://www.humanimmunology.org/cchi

  4. Overview - organization • The portal data is organized around 4 main concepts • Laboratories (aka projects) • Studies (aka experiments) • Data Sets • Files • Access control is organized around • Users • Groups http://www.humanimmunology.org/cchi

  5. Overview - Labs and Studies • Laboratories • Have defined ‘curator’ groups and ‘reader’ groups • Contain zero or more studies • Studies • Represent a collection of data assembled to answer a question • Contain zero or more datasets • ‘Reader’ groups are a subset of their Lab’s reader groups

  6. Overview - Datasets and Files • Datasets • All data is of one type (gene expression, CN, etc) • Multiple datasets of the same type is OK • contain zero or more files • ‘reader’ groups are a subset of their Study’s reader groups • Files • The basic unit of data in the portal • May be any format • unrecognized formats may not be analyzed but may be shared and downloadable

  7. Overview - Laboratories/projects http://www.humanimmunology.org/cchi

  8. Browsing data Sharing Data You can see (and download) any data files you can see Filter data types with the checkboxes on top Page Info At the top of most pages - brief help for the page My Shelf Save datasets to your shelf for later (re)use

  9. Browsing data

  10. Browsing Samples Interactive browser of sample annotations Filter samples based on phenotypic information provided Thumb-scrollers for numeric data

  11. Exercise 1. Browse the portal • Go to the portal in a web browser http://www.humanimmunology.org/cchi • Login/register if needed • Click on the ‘BROWSE’ menu item Then the ‘DATA’ submenu • Uncheck the ‘Sample annotation’ and ‘undefined’ filter checkboxes • Click on the ‘BROWSE’ menu item again • then the ‘SAMPLES’ submenu • Select a dataset to browse • Experiment with filtering options

  12. Gene Summary Pages Provide an overview of the information about a gene Heatmaps showing expression in the datasets that you can see Gene description (from Entrez), links to COSMIC Optional Display summaries of mutations - if any are loaded in the portal Display plot of copy number by expression - requires paired CN and expression samples & linking ids

  13. Gene Search Enter a gene name in the search box on the home page or near the menus Multiple hits indicates multiple species (we’ll make this more explicit in a later version) click

  14. Gene Summary Pages

  15. Exercise 2. Review your favorite gene • Enter a gene name in the search box e.g. EGFR, FGFR3 • Click a gene name on the results page • Review the gene summary page

  16. Performing Analyses The portal is built to allow non-computational biologists to perform many common analyses Look for co-expressed genes Look for differentially expressed genes Look for gene set enrichment Analyses are performed by a GenePattern server using its modules Co-expression -> Gene Neighbors Diff. Expression -> Comparative Marker Selection Gene Set enrichment -> GSEA

  17. Performing Analyses - details Analysis parameter defaults are set by the portal curator These are set portal-wide To change the parameters and/or assumptions, download the data and analyze it in GenePattern directly Detailed descriptions of the analyses, how to run them, and default parameters are available on the help menu Text tutorials for all Video tutorials for some

  18. Performing Analyses - help

  19. Co-expression Find genes with similar gene expression profiles to a particular gene You provide a gene and select a dataset An analysis is launched to detect the 20 most correlated genes in the dataset using Pearson Correlation The analysis displays a heat map This is a java applet, you must tell your browser to ‘allow’ it when asked or you will not see it The heat map viewer can be ‘popped’ out of the browser to allow you to see more detail Menus (on the viewer) provide numerous other options to explore

  20. Co-expression

  21. Co-expression results

  22. Exercise 3. Find co-expressed genes • Go to the portal home page • Select the ‘Analyses’ menu • Select the GeneNeighbors button, click ‘Next step’ • Enter a gene name (e.g. EGFR), click ‘Select Gene Symbol’ • Click the gene name (if needed), click ‘Select Data Set’ • Select ‘YFV_2008…’, click ‘Select Probe’ • Click ‘Run Analysis’

  23. Differential expression This looks for genes whose expression levels vary between 2 conditions Select a dataset, then define 2 classes based on the sample annotations An analysis is launched to detect the 20 top ranked genes in each direction using 2-sided SNR (median) and 1000 permutations The analysis displays a heat map and a table with the genes and their significance This heatmap is just an image, not an applet

  24. Differential expression

  25. Differential expression results

  26. Exercise 4. differentially expressed genes • Go to the portal home page • Select the ‘Analyses’ menu • Select the Comparative Marker Selection button, click ‘Next step’ • Create a Sample Set, Select ‘YFV_2008…’, click ‘Create Sample Set’ • For Class 1, Click ‘Tcell activation’ and the range 0.49-1.6 • For Class 2, Click ‘Tcell activation’ and the range 9-12.1 • Enter a name and description, • Click ‘Run Analysis’ • Open results from ‘My Shelf’ when complete

  27. Gene Set Enrichment Analysis Skeletal muscle biopsies Normal Diabetic • Example: human diabetes • No single gene significant • GSEA was used to assess enrichment of 149 gene sets including 113 pathways from internal curation and GenMAPP, and 36 tightly co-expressed clusters from a compendium of mouse gene expression data. Sometimes no individual genes are significantly differentially expressed We improve statistical power by comparing gene sets These GSEA results appeared in Mootha et al. Nature Genetics 15 June 2003, vol. 34 no. 3 pp 267 – 273:

  28. Enrichment: KS-score Max. Enrichment Score ES Enrichment Score S Gene Set G Phenotype Gene List Order Index Ordered Marker List hit (member of G) miss (non-member of G) • Rank genes according to their “correlation” with the class of interest. • Test if a gene set (e.g., a GO category, a pathway, a different class signature), “enriches” any of the classes. • Use Kolmogorov-Smirnoff score to measure enrichment. Mootha et al., Nature Genetics 2004 Subramanian et al., PNAS 2005

  29. Enrichment: KS-score Enriched Gene Set Un-enriched Gene Set Max. Enrichment Score ES Max. Enrichment Score ES Enrichment Score S Enrichment Score S Gene List Order Index Gene List Order Index Every hit go up by 1/NH Every miss go down by 1/NM The maximum height provides the enrichment score

  30. Performing GSEA • Like differential expression, select a dataset and define classes • GSEA uses the c2 curated gene sets representing metabolic and signaling pathways (http://www.broadinstitute.org/gsea/msigdb)

  31. GSEA Results

  32. Exercise 5. GSEA • Go to the portal home page • Select the ‘Analyses’ menu • Select the GSEA button, click ‘Next step’ • Create a Sample Set, Select ‘YFV_2008…’, click ‘Create Sample Set’ • For Class 1, Click ‘neutralizing antibody titer’ and the range 482-1280 • For Class 2, Click ‘neutralizing antibody titer’ and the range 20-280 • Enter a name and description, • Click ‘Run Analysis’ • Open results from ‘My Shelf’ when complete

  33. Managing ‘My Shelf’ http://www.humanimmunology.org/cchi

  34. Exercise 6. Review your shelf • Click on the ‘My Shelf’ button at the top right • Click on the ‘Analyses’ tab -Review the analyses you did earlier - revisit the results • Click on the ‘Sample Sets’ tab Review the Sample Sets you created for CMS, GSEA • Click on the ‘Profile’ tab Review your email and group memberships

  35. Review of Basic Features • Overview • features and navigation • Browsing data • Files and Samples • Gene Summary pages • Performing Analyses on the portal • Co-expression, differential expression, GSEA • Managing your shelf

More Related