320 likes | 339 Views
An open-source semantic web application that allows for the discovery of research and scholarship across institutions. Populated with detailed profiles of faculty and researchers, with powerful search functionality.
 
                
                E N D
VIVO Researcher Networking Update April 5, 2011 1-2 p.m. Leslie McIntosh Vivo National Evaluator Washington University Ellen J. Cramer Special Projects Lead Cornell University Jonathan Corson-Rikert Vivo Development Lead Cornell University
University of Florida Mike Conlon (VIVO and UF PI) Beth Auten Chris Barnes Cecilia Botero Kerry Britt Erin Brooks Amy Buhler Ellie Bushhousen Linda Butson Chris Case Christine Cogar Valrie Davis Mary Edwards Nita Ferree Rolando Garcia-Milan George Hack Chris Haines Sara Henning Rae Jesano Margeaux Johnson Meghan Latorre Yang Li Paula Markes Hannah Norton Narayan Raum Alexander Rockwell Sara Russell Gonzalez Nancy Schaefer Dale Scheppler Nicholas Skaggs Syraj Syed Matthew Tedder Michele R. Tennant Alicia Turner Stephen Williams Indiana University Katy Borner (IU PI) Kavitha Chandrasekar Bin Chen Shanshan Chen Ryan Cobine Jeni Coffey Suresh Deivasigamani Ying Ding Russell Duhon Jon Dunn Poornima Gopinath Julie Hardesty Brian Keese Namrata Lele Micah Linnemeier Nianli Ma Robert H. McDonald Asik Pradhan Gongaju Mark Price Michael Stamper Yuyin Sun Chintan Tank Alan Walsh Brian Wheeler Feng Wu Angela Zoss VIVO Collaboration Weill Cornell Medical College Curtis Cole (Weill PI) Paul Albert Victor Brodsky Mark Bronnimann Adam Cheriff Oscar Cruz Dan Dickinson Richard Hu Chris Huang ItayKlaz Kenneth Lee Peter Michelini Grace Migliorisi John Ruffing Jason Specland Tru Tran VinayVarughese Virgil Wong Washington University School of Medicine in St. Louis RakeshNagarajan (WUSTL PI) Kristi L. Holmes CaerieHouchins George Joseph Sunita B. Koul Jasmine Owens Leslie D. McIntosh Cornell University Dean Krafft (Cornell PI) ManoloBevia Jim Blake Nick Cappadona Brian Caruso EllyCramer MedhaDevare Elizabeth Hines Huda Khan Brian Lowe Joseph McEnerney Holly Mistlebauer Stella Mitchell AnupSawant Christopher Westling Tim Worrall Rebecca Younes Jon Corson-Rikert The Scripps Research Institute Gerald Joyce (Scripps PI) Catherine Dunn Brant Kelley Paula King Angela Murrell Barbara Noble Cary Thomas Michaeleen Trimarchi Ponce School of Medicine Richard J. Noel, Jr. (Ponce PI) Ricardo Espada Colon Damaris Torres Cruz Michael Vega Negrón This project is funded by the National Institutes of Health, U24 RR029822 "VIVO: Enabling National Networking of Scientists”
An open-source semantic web application that enables the discovery of research and scholarship across disciplines in an institution. Populated with detailed profiles of faculty and researchers; displaying items such as publications, teaching, service, and professional affiliations. A powerful search functionality for locating people and information within or across institutions.
Participating Institutions • National Network Team • University of Florida, Gainesville, FL • Cornell University, Ithaca, NY • Indiana University, Bloomington, IN • The Scripps Research Institute, La Jolla, CA
Get the Data Manage the Data Who owns the data now? Do you need to create a data management system? How will you refresh your data? How often? • Who owns the data? • Where are the data sources? • What permissions do you need to use the data? Your data are only as good as the source.
Contribute to the Community More to open-source than contributing code • Data • Documentation • IRC communication • Listservs • Lessons learned vivoweb.org vivo.sourceforge.net
VIVO Cornell: In-house to National Cloud 2003-2007 Development of research profiles using ontologies in a database-driven website to meet the needs of the Life Sciences initiative. 2007 Converted to Semantic Web standards. Expanded to include disciplines across the institution 2007–2011+ With NIH grant, moved to national and international network of institutions and organizations and their faculty and researcher profiles
Local Outreach • Provost Office - institutional support • Data providers – HR, Annual faculty reporting, Grants, Courses, Other • Librarian VIVO liaisons -subject areas • Web developers - repurposing of data • Department editors - training
Networking Other sites piloting or adopting VIVO technology Arizona State University, Duke University, IICA, Los Alamos National Laboratory, Northwestern University, Stony Brook University, University of Arkansas, University of Buffalo, University of Colorado – Boulder, University of Delaware, University of Oregon, University of Virginia, USDA Integration partners APA (Digital Trust), Duke (Widgets), Harvard University (Harvard Profiles), Indiana University (HUBzero), Orchid, Stony Brook University (UMLS), University of Hong Kong (Knowledge Exchange), University of Pittsburgh (Digital Vita), Weill Cornell Medical College (Google Refine). • International efforts • ANDS-Vitro Consortium (Griffith, QUT, University of Melbourne, VeRSI) • Chinese Academy of Sciences • IICA (Inter-American Institute for Cooperation on Agriculture) is • considering options like VIVO for a researcher network for their SIDALC • Application and there is a pilot VIVO implementation at the El Colegio de Postgraduadosof Mexico.
VIVO update part III • VIVO core design principles • Enhancements during the NIH grant • Planned development • VIVO at web scale • Mini-grants and collaborations • Building community and sustainability
First, it’s about data • Consistent formatting, in a language of the Web • Self-describing • Ontology • Context inherent in the data • Distributed • De-referenceable • Reusable without (or with) modification • Persistent independently of any application
VIVO is not just people or profiles • Anything can be a type (and have individuals) • All individuals have the same structure • Varying attributes & relationships • Inheritance • Extend the ontology without modifying the app • Tradeoffs of generality vs. optimal interface
Highlights of recent improvements VIVO Core Ontology Linked Open Data Harvester Visualizations Application HR data theming grants Drupal importer navigation eagle-i research resources Pubmed scalability MVC structure page templates external authentication self-editing
Deliverables by August, 2011 VIVO Core Ontology Linked Open Data Harvester Visualizations Application more pub formats Map of Science theming Bioportal submission Drupal importer GeoMap navigation national grant data scalability MVC structure Search-related functionalities page templates aggregator software linking between VIVOs external authentication self-editing RDF to Solr indexer local/ national search UI role-based authorization
“National” search • NIH mandated no reliance on sustained centralized infrastructure • Aggregation of RDF from multiple sources • Harvard Profiles, Collexis, and likely others • Solr indexing leveraging the VIVO ontology • Aggregator and indexing will be configurable to harvest any desired set of sources
National networking & search Scripps VIVO UF VIVO WashU VIVO IU VIVO Other VIVOs Other CTSA VIVOs Ponce VIVO Harvard Profiles RDF VIVO aggregator triple store Cornell Ithaca VIVO Other VIVOs Future CTSA triple store Solr search index Weill Cornell VIVO Other RDF Future state or regional triple store Future CTSA Solr index VIVO national network search future Solr index Other RDF Linked Open Data
VIVO at web scale • Connections directly between VIVOs • Multiple campuses of 1 institution • Multiple institutions within a consortium • Data resides & served from home institution • Individuals linked by URI or common identifier • Updates via linked data harvesting or pingback
As the linked data cloud grows • Search enhanced by authoritative, structured, and updated data • Retrieval and filtering by type & relationship, not just text • Enables better data mining and analysis • Reduces reporting burden • Unique semantic advantages • Categorization implicit in defined ontologies • Common references to shared terminologies • ORCID and other initiatives leading to common references to individuals
Community development • VIVOweb.org • VIVO on sourceforge • Fully open source (BSD license) • Subversion repository – download or check out • Active development and implementation mail lists & forums • Installation and upgrade documentation • Wiki-based documentation effort • Supplemental materials • Many ways to contribute and benefit
Mini-grants address key areas • Controlled vocabularies (Stony Brook) • Author IDs and disambiguation (ORCID) • Widgets to re-use VIVO data in standard web pages (Duke) • Direct output to biosketchesand CVs (Pittsburgh) • Connection to the HUBzero scientific simulation and grid services platform, via Joomla CMS (IU) • Google Refine for data cleanup and export (Weill Cornell)
Community collaborations • ORCID • Connections to institutional repositories, as other libraries implement VIVO • Library of Congress support for Exhibit API with VIVO as one target • Dataset metadata discovery and registry work, with Australian VIVO consortium
Questions yet to address • What access points and services need to be provided for national (or international) research networking to succeed? • How will people be able to integrate this data into their daily workflow and research process? • How will boundaries between public and private data and services work? • Federating group privileges as well as identities across multiple VIVOs and to other research-enabling tools