1 / 47

CORE: Aggregating, Enriching and Reusing Open Access

CORE: Aggregating, Enriching and Reusing Open Access. Petr Knoth Knowledge Media institute The Open University. Outline. Aggregating Open Access (OA) publications W hy agregate and who is it for The added value of aggregations The CORE system

marv
Download Presentation

CORE: Aggregating, Enriching and Reusing Open Access

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. CORE: Aggregating, Enriching and Reusing Open Access Petr Knoth Knowledge Media institute The Open University

  2. Outline Aggregating Open Access (OA) publications Why agregate and who is it for The added value of aggregations The CORE system Supporting research in mining databases of scientific publications (DiggiCORE)

  3. Outline Aggregating Open Access (OA) publications Why agregate and who is it for The added value of aggregations The CORE system Supporting research in mining databases of scientific publications (DiggiCORE)

  4. The rapid rise of OA articles The graph (from Laasko and Bjork's paper - BMC Medicine 2012, 10:124) shows the numbers of papers published in three different types of online open access journals from 2000 to 2011.

  5. Growth of items in Open Access repositories

  6. Growth of Open Access repositories

  7. Why we need aggregations? “Each individual repository is of limited value for research: the real power of Open Access lies in the possibility of connecting and tying together repositories, which is why we need interoperability. In order to create a seamless layer of content through connected repositories from around the world, Open Access relies on interoperability, the ability for systems to communicate with each other and pass information back and forth in a usable format. Interoperability allows us to exploit today's computational power so that we can aggregate, data mine, create new tools and services, and generate new knowledge from repository content.’’ [COAR manifesto]

  8. Access to information according to the level of abstraction Analytical information access Repository Repository Repository Interfaces OLTP OLAP Aggregation Transaction information access Semantic Enrichment Metadata Content Metadata Transfer Interoperability Raw data access

  9. Who should be supported by aggregations? The following users groups (divided according to the level of abstraction of information they need): Raw data access. Transaction information access. Analytical information access.

  10. Who should be supported by aggregations? The following users groups (divided according to the level of abstraction of information they need): Raw data access. Developers, DLs, DL researchers, companies … Transaction information access. Researchers, students, life-long learners … Analytical information access. Funders, government, bussiness intelligence …

  11. Existing aggregation systems

  12. BASE Interfaces Analytical information access Repository Repository Repository OLTP OLAP Enrichment Aggregation Transaction information access Metadata Content Metadata Transfer Interoperability Raw data access

  13. OCLC WorldCAT Interfaces Analytical information access Repository Repository Repository OLTP OLAP Enrichment Aggregation Transaction information access Metadata Content Metadata Transfer Interoperability Raw data access

  14. The power of full-text aggregations (WorldCat vs CORE)

  15. RepUK Interfaces Analytical information access Repository Repository Repository OLTP OLAP Enrichment Aggregation Transaction information access Metadata Content Metadata Transfer Interoperability Raw data access

  16. RepUK

  17. Aggregations need access to content, not just metadata! Certain metadata types can be created only at the level of the aggregation Certain metadata can be changing in time Ensuring content: accessibility availability validity quality …

  18. CiteSeerX (computer science) Interfaces Analytical information access Repository Repository Repository OLTP OLAP Enrichment Aggregation Transaction information access Metadata Content Metadata Transfer Interoperability Raw data access

  19. Should an aggregation system support all three access levels? Can be realised by more than one system providing that the dataset is the same!

  20. The problem of result transparency Google Scholar Microsoft Academic Search

  21. Outline Aggregating Open Access (OA) publications – why, how, what for? The CORE system Supporting research in mining databases of scientific publications (DiggiCORE)

  22. CORE objective CORE aims to provide a comprehensive technical infrastructure for Open Access scholarly publications that will support access and reuse of scholarly materials at different levels of abstraction.

  23. CORE functionality Content harvesting, processing

  24. CORE functionality Semantic enrichment

  25. CORE functionality Providing services

  26. What does CORE provide at different access levels? Repository Analytics Interfaces Analytical information access Repository Repository Repository OLTP OLAP CORE Portal, CORE Mobile, CORE Plugin Enrichment Aggregation Transaction information access Metadata Content Metadata Transfer Interoperability CORE API CORE API Raw data access

  27. CORE Applications CORE Portal – Allows searching and navigating scientific publications aggregated from Open Access repositories

  28. CORE Applications CORE Mobile – Allows searching and navigating scientific publications aggregated from Open Access repositories

  29. CORE Plugin – A plugin to system that recommendations for related items. CORE Applications

  30. CORE Applications Repository Analytics – is an analytical tool supporting providers of open access content (in particular repository managers).

  31. CORE Applications CORE API – Enables external systems and services to interact with the CORE repository. Search service Pdf and plain text service Similarity service Classification service Citation service

  32. CORE Applications CORE API registered users: British Education Index Cottagelabs UKCORR Europeana ULCC Library, The Open University Los Alamos National Laboratory, USA University of Manchester Library Universidad de los Andes. Bogotá, Colombia UNESCO

  33. Outline Aggregating Open Access (OA) publications – why, how, what for? The CORE system Supporting research in mining databases of scientific publications ( )

  34. Partners Advisory Board

  35. Objective Software for exploration and analysis of very large and fast-growing amounts of research publications stored across Open Access Repositories (OAR).

  36. DiggiCORE networks Three networks: (a) semantically related papers, (b) citation network, (c) author citation network

  37. DiggiCORE objectives Allow researchers to use this platform to analyse publications. Why? To identifying patterns in the behaviour of research communities To detect trends in research disciplines To gain new insights into the citation behaviour of researchers To discover features that distinguish papers with high impact

  38. Questions the system can help answering? What are the attributes of impact publications? Do these attributes differ in the humanities, social sciences and computer sciences? What are the features of research groups within disciplines and how do these features relate to contributions generated by the group? What are the attributes of high-impact authors and what is their role within the group? What are the dynamics of successful research groups?

  39. Questions the system can help answering? What is the mechanism of cross-fertilisation within disciplines, especially between the humanities and the sciences? Who are the authors whose work is worth monitoring because they contribute to the achievements of their own discipline and also inspire other disciplines? How should the novice in the discipline get acquainted with key achievements in the discipline? How should he/she search for the most important publications?

  40. Summary Aggregations should serve the needs of different user groups. We need to aggregate content, not just metadata. Machine access to publications provides lots of new opportunities. We can have many services that are part of the infrastructure, but should work with the same data. CORE (and DiggiCORE) aims to prepare the way for innovative open access services demonstrate the benefits of programmable access to publications

  41. Acknowledgement This work has been partly funded by JISC and AHRC. Contributors to the CORE software: Vojtech Robotka, Magdalena Krygielova, Drahomira Herrmannova, Tomas Korec, Jakub Novotny, Ian Tindle, Harriett Cornish, Gabriela Pavel, Markus Muhr, Loukas Anastasiou, Colin Smith, Chris Yates Participants of CORE Advisory Boards and other CORE related meetings Owen Stephens, Bill Hubbard, Andy McGregor, Stuart Dempster, Andreas Juffinger, Markus Muhr, Jan Molendijk, Paul Walk, Chris Yates, Chris Biggs, Non Scantlebury

More Related