1 / 16

Semantic Cyberinfrastructure for Knowledge and Information Discovery (SCiKID) Proposal

Semantic Cyberinfrastructure for Knowledge and Information Discovery (SCiKID) Proposal. For NSF EAGER Grant. Principle Investigator: Eric Rozell Tetherless World Constellation Rensselaer Polytechnic Institute. Table of Contents. Cover Sheet Information Project Summary Project Description

Download Presentation

Semantic Cyberinfrastructure for Knowledge and Information Discovery (SCiKID) Proposal

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Semantic Cyberinfrastructure for Knowledge and Information Discovery(SCiKID) Proposal For NSF EAGER Grant Principle Investigator: Eric Rozell Tetherless World Constellation Rensselaer Polytechnic Institute

  2. Table of Contents • Cover Sheet Information • Project Summary • Project Description • Unfunded Collaborations • EArly Grants for Exploratory Research(EAGER) “Fitness” • References Cited • Biographical Sketch • Budget Justification • Review

  3. Cover Sheet Information • Awardee: Eric Rozell • Primary Location: Rensselaer Polytechnic Institute • Program: EAGER • Unit of Consideration: NSF Office of Cyberinfrastructure (OCI) • Title: Semantic Cyberinfrastrucutre for Knowledge and Information Discovery (SciKID) • Budget: $300,000 • Duration: 2 years

  4. Project Summary - Overview • Federated search capabilities • Multi-paradigm search tools (e.g., hierarchical, faceted, semantic, etc.) • Multi-disciplinary data discovery platform • Data integration and quality analysis tools

  5. Project Summary – Intellectual Merit • General cyberinfrastructure contribution applicable to many science domains (breadth) • Accelerate scientific discovery • Extensible architecture for data analysis and visualization tools (depth) • Data scientists / informaticists focus on algorithms and tools

  6. Project Summary –Broader Impacts • Applicable to any data-centric science domain • Range of tools supporting experts (e.g., specialized informaticists) to non-experts (e.g., undergraduates) in scientific discovery • Support education on best practices in data pipeline

  7. Project Description - Content • Objectives • To advance the fields of data science and cyberinfrastructure • Significance • Will accelerate scientific discovery in data-centric science domains • Long-term Goals • To provide cyberinfrastructure covering all aspects of the data pipeline enabling the preservation of accessible and reusable data for centuries to come • Related Work • Virtual Observatories (many refs.) • Faceted Browse (many refs.) • Semantic Search (Noesis, other refs.) • Web service discovery (ESIP Discovery Cluster, P2P systems, et al.) • Data provenance (Peter’s DSRC talk)

  8. Project Description - Content • (2 months) Extend S2S to support federated search • (2 months) Investigate search paradigms • E.g., faceted browse, semantic search algorithms, hierarchical search • (4 months) Extend S2S to support variety of search paradigms • (4 months) Investigate data discovery techniques • E.g., service metadata (ontology) models, registry architectures • (2 months) Extend S2S with necessary discovery infrastructure • (6 months) Investigate provenance models that will enable data integration and data QA as presented in Peter’s DSRC talk • (4 months) Extend S2S with more elaborate “data” model

  9. Project Description – Results from Prior Work • No prior NSF funding (as PI / co-PI) • Worked as RA for SeSF? • Results include S2S (see refs.)

  10. Unfunded Collaborations • Will utilize existing connections with… • High Altitude Observatory, NCAR, CO • Geology & Geophysics, WHOI, MA • Physical Oceanography, WHOI, MA • BCO-DMO, WHOI, MA

  11. EAGER “Fitness” • Transformative • Federated search across web service standards • Persistent storage and reuse of user interface components and integration and analysis tools in a web environment • Interdisciplinary • Involving many science domains (for use cases and design input), visualization, web science, data science, software engineering

  12. EAGER “Unfitness” • Not necessarily “Untested” • May work as “regular” NSF proposal

  13. References Cited • S2S Publications • E. Rozell, P. Fox, A. Maffei, S. Zednik (2011), A Framework for Earth Science Search Interface Development, Abstract EGU2011-13413 presented at General Assembly 2011, EGU, Vienna, Austria, 03-08 Apr • E. Rozell, A. Maffei, S. Beaulieu, P. Fox (2010), A Framework for Integrating Oceanographic Data Repositories, Abstract IN23A-1349 presented at 2010 Fall Meeting, AGU, San Francisco, Calif., 13-17 Dec • E. Rozell, P. Fox, A. Maffei, Ontology and Application for Reusable Search Interface Design, to be submitted to Computers & Geosciences • Plus everything needed from the related work

  14. Biographical Sketch • Eric Rozell, Ph.D. Student • Degrees • B.S., Computer Science, RPI • M.S., Management in TC&E*, RPI (in pursuit) • Ph.D., Computer Science, RPI (in pursuit) • Appointments • WHOI Summer Student Fellow • Darrin Fdn. Summer Ugrad. Research Fellow • Collaborators • TWC Professors • WHOI Scientists

  15. Budget Justification • Annual Budget ($150,000 annually) • $30,000 (meager RA salary) • $50,000 (tuition & fees) • $55,000 (jr. software engineer salary) • $10,000 (consulting fees and travel) • $5,000 (hardware) • Cumulative Budget • Above x2

  16. Review • Needs work…

More Related