Chris Abernathy – Oak Ridge National Laboratory Aaron Myers & Phil Nugent Oak Ridge National Laboratory Bioenergy Program Sun Grant/DOE Regional Feedstock Partnership Report and Planning Meeting San Antonio, Texas February 23-24, 2010
ORNL KDF Development Team • Principal Investigator • Budhendra Bhaduri • Project Coordination • Amy Rose • Spatial Data Integration and Analysis • Aaron Myers (Lead), Phil Nugent, Cheng Liu, Steve Peterson, Sujit Das, Shani Mammuni, Alex Sorokine • SGI Data Model and Interface • Phil Nugent (Lead), Chris Abernathy • Database Management and Server Development • Aaron Myers (Lead), Ray Flannery, Paul Edwards, Eric Axley, Robert Stewart
Bioenergy Knowledge Discovery Framework • Strategy: • Design and develop a robust geospatial technology framework that • Offers guidance and access to comprehensive data, modeling, and visualization resources • Standards based • Benefit: • Provide ready access to status of bioenergy in the US • Distribute data needed for assessments • Support emerging bioindustry by providing an integrative perspective • Support policymaking by visualizing the outcomes of proposed policies • Defining where research or demonstration funds should be targeted • Foster education and outreach • Goal: • Provide analytical and visualization capabilities for efficient planning, development, and management of the US bioenergy infrastructure. • Allow analysis, synthesis, and visualization of data that facilitates informed decision making. Outcome: A national decision support framework for addressing the overall sustainability and security of bioenergy infrastructure
Current State of the KDF • User Account Requests and Management • Data and Knowledge Base Development • Almost 1400 Datasets • 75 Models • 115 Journal Articles • Data Upload and Download Functionality • Attribute Search and Spatial Query Capabilities • Dynamic Data Visualization • Integration of Transportation Routing Analysis • SGI Data Management Upgrade
FY10 Goals • Enhanced Security Capabilities • Advanced Spatial Query Capabilities • Targeted and Dynamic Data Acquisition • Advanced Analysis and Reporting • Map Sharing and Collaboration • Implement Data Quality Assurance and User Feedback • High Resolution Imagery Integration
KDF Beyond FY10 • Continue Integration of Multi-Disciplinary Databases • Customizable Dashboard and Reporting Components • Tagging of Datasets for Dynamic Organization Conforming to Multiple Data Models • Knowledge Discovery Tool Development and Implementation • Document Analysis (text) • Spatio-Temporal Data Mining and Visualization
SGI Data Upload • Purpose • Data submission • Fulfills OBP requirement to submit data • Replaces SharePoint as data upload portal • Data validation for inclusion in data model • Internal analysis and visualization • SGI data will remain separate from rest of KDF data • Upload Demonstration
Chris Abernathy – Oak Ridge National Laboratory Chris Abernathy Oak Ridge National Laboratory Bioenergy Program Sun Grant/DOE Regional Feedstock Partnership Report and Planning Meeting San Antonio, Texas February 23-24, 2010
2009 Management Meeting – Data & Reporting Decisions and Results “Within 3 months after the end of the growing season, the PIs will provide field level data to the Sharepoint in FY09; and to the KDF in FY10 and beyond” What was accomplished in 2009 – Data folders were created by region and by species to allow for data upload by the PIs Data security is provided thru Sharepoint and the permissions granted to the data folders.
2009 Management Meeting – Data & Reporting Decisions and Results “Within 3 months after the end of the growing season, the PIs will provide field level data to the Sharepoint in FY09; and to the KDF in FY10 and beyond” What was accomplished in 2009 cont. – PIs have access to their individual data folders, Crop Coordinators have access to their species folders, and Team Leads have access to team member folders Several trials did not generate data as they required an establishment period of at least one growing season For annual species and CRP trials, data submissions are occurring as soon as circumstances allow (weather, field conditions, manpower, funding)
2009 Management Meeting – Data & Reporting Decisions and Results “A universal data template would be provided by ORNL to record and enter data for upload into the Sharepoint and eventually the KDF” What was accomplished in 2009 – The universal template proved to be unwieldy and not very user-friendly…too many unused fields Crop specific templates were developed (with input from the Crop Leads) to standardize the data being collected The lit review template was also changed (with input from the Team Lead) to allow for bibliographic references and to provide for those data most commonly found in the literature
2009 Management Meeting – Data & Reporting Decisions and Results “OBP and Laboratory staff will determine what data and variables from the PIs will be included in a national database and used for national analyses” What was accomplished in 2009 – The crop data templates were created to reflect the statements of work and actual field data collection A subset of data variables in the crop templates was chosen for inclusion into the aggregated SGI database The data model for the national database was developed by KDF team members
2009 Management Meeting – Data & Reporting Decisions and Results “OBP and Laboratory staff will determine what data and variables from the PIs will be included in a national database and used for national analyses” What was accomplished in 2009 cont. – The KDF team developed a data validation process to insure consistency The KDF team also developed a means for populating the database directly from the template into the KDF In addition, the original data file will always be available for review
2010 Management Meeting Decisions to be Made Three issues for discussion: - Data Access - Data Attribution - Data Synthesis Reports
2010 Management Meeting – Data & Reporting Decisions to be Made “How long can/should data remain “hidden” in the KDF for PI publication before it is made public?” Currently, this is up to the data contributor to determine. No mandate exists to make unpublished data available after a certain time period expires. Some suggestions have been made for a pre-determined amount of time (5 years) that the data can reside on the KDF before it becomes available for public access regardless of publication. Discussion...
2010 Management Meeting – Data & Reporting Decisions to be Made “How will data be attributed to field trial PIs when used in the future via the KDF?” Before the data is available to the Public but is needed by DOE/Labs and PIs: For the foreseeable future, DOE and the Labs will be the only users of the data when conducting their regional and national analyses. Appropriate attribution will be provided. Once the data is available to the Public: There is no mechanism available to “force” external data users to cite data that is accessed from the KDF. Data attribution is the ultimately the responsibility of the user. Discussion...
2010 Management Meeting – Data & Reporting Decisions to be Made “Field trial PIs will contribute to species-level synthesis reports (led by crop team leads) and regional-level synthesis reports (led by regional GIS leads) as authors” Will these synthesis reports use data other than that funded via DOE Regional Feedstock Partnership? If so, how will this be addressed? Yes, if the PI and the Crop and GIS lead agree What is the timeline for developing these synthesis reports? Possibly in the fifth year of the trial Discussion...