60 likes | 141 Views
This system annotates microarray probes using databases like Entrez Gene and UniGene for cancer research. It automatically updates annotations, allows web-based and API queries, and aims for caGRID compliance. Utilizes caCORE SDK for code generation. Data sources include UniSTS, Gene Ontology, dbSNP, and more. Limitations include slow PubMed curation, non-optimized complex searches, and lack of meta-data storage in caArray.
E N D
caFE Server Siteman Cancer Center
Introduction • Annotate probes on microarrays using publicly available biomedical databases • Automatically update annotations on a regular basis • Web-based and API based query search • Uses caCORE SDK (version 1.0.3) generated code • Working towards caGRID compliance
Data Sources • Entrez Gene • UniGene • Gene Ontology • dbSNP • UniSTS • HomoloGene • PubMed
Annotation Interlinks Microarrays caArray Custom Entrez Gene UniGene GenBank Gene ID UniGene ClusterID Accession Number HomoloGene dbSNP PubMed Gene Ontology Orthologus genes RefSNP ID Pubmed relationship GO terms
Limitations • Initial run of PubMed data curation takes 2-3 months • APIs generated using caCORE SDK • Not optimized for complex searches • Performing searches that involves complex computations. • For example, searching for literature correlation with depth 2 or more • Meta-information of caArray is not stored