1 / 24

The DRIVER initiative for networking repositories

The DRIVER initiative for networking repositories. Wolfram Horstmann Universität Bielefeld. DRIVER motivation. Scholarly communication changes towards distributed provision of text, data and services Repositories are thought as a saviour in this development building such a distributed system

reba
Download Presentation

The DRIVER initiative for networking repositories

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. The DRIVER initiative for networking repositories Wolfram Horstmann Universität Bielefeld

  2. DRIVER motivation Scholarly communication changes towards distributed provision of text, data and services Repositories are thought as a saviour in this development building such a distributed system An infrastructure supporting distributed repositories and services is needed (needs explanation) (and reactions)

  3. Some observations on repositories They represent a shift towards … open internet-exposure as opposed to closed database (‚graveyards‘) content orientation as opposed to mere technical orientation (‚web-servers‘) distributed systems centralized structures not immediateley required nowadays

  4. „Everybody can be a publisher“ Common description standards e.g. Dublin Core Metadata Initiative Many subject-specific standards Common transfer protocols e.g. OAI-PMH, but also FTP, XML-RPC, WS, etc. Searchability is possible! Still: many results are lost to re-use/remix Closed: too sensible, weakly described, unimportant (???) Missing service frameworks / infrastructures Problems: Data and service interoperability Solution: „Infrastructure“ Repositories can solve access problem

  5. What infrastructures are: DRIVER terms Not an infrastructure Single repository Single application for search and retrieval (e.g. BASE) Only local operation Backwards causation on repositories is missing Maybe an infrastructure Distributed repository landscape as a whole As a capacity for emergent properties, e.g. quality and quantity incentive for data population Nurturing development of service providers Definitely an infrastructure Many service providers in one organisational and technical context (e.g. run-time environment) Enabling re-use and remix of data and services

  6. DRIVER Objectives Organisational structure for repositories e.g. the „Confederation“ Improving quality and standards in local rep. e.g. validation procedures Building a distributed runtime system e.g. service and data sharing Target Groups Repository Managers Service Providers Information System Executives

  7. The DRIVER approach is incremental Start with publication metadata Existing distributed system, somehow connected Considerable homogeneity and formats: OAI-PMH Extend geographical coverage From 5 countries, to 10, to 27, to ??? Extend towards other contents From publication metadata to enhanced publications, i.e. representations of „texts + data“ Learn about subject specificity Data bring in disciplinary requirements

  8. The DRIVER Initiative 8 DRIVER-I 6/2006 – 11/2007 • Organisational Models and Technical Test-Bed DRIVER-II 12/2007 – 11/2009 • Running Organisation and Production Infrastructure DRIVER-Confederation 2010ff • Operations Office and Technical Deployment NB: DRIVER is not an authoritative body, it is a liberal bottom-up initiative of stakeholders

  9. DRIVER partners and related projects Networking, Support, Policy, Studies Göttingen, Nottingham, SURF, Genth, Ljubiljana, Minho, Copenhagen Technical development and deployment Athens, Bielefeld, Pisa, Warsaw Partners make links to many other things OA-services: Sherpa-ROMEO, OpenDOAR, BASE… Projects: Europeana, PEER, DELOS, DL.org, D4Science, PARSE-Insight, NESTOR… Orgs: DINI, JISC, LIBER, SPARC, KE … Platforms: DSPACE/FEDORA/OPUS/ePrints

  10. Some Results: Studies

  11. Some Results: A Portal

  12. Some Results: A Search

  13. Some Results: Repository Registration

  14. Some Results: Guidelines • Build on knowledge from past & current IR projects (EU) • 26 actively involved contributors (experts and repository managers) from 8 countries. • Practical answers on how to: • Improve full-text access • Standardize metadata quality • Create a reliable infrastructure for permanent identification, resolution, traceability and storage • Resolve semantic and classification issues

  15. Some Results: Support structures

  16. Some Results: Repositories 185+ harvested repositories 21 countries 856,264+ documents

  17. Some Results: Service-Oriented-Arch. 9 hosting nodes 25+ Functionality typologies (services) 36 service Instances 3 applications: DRIVER Main, Belgium, Spain-Recolecta

  18. Some Results: Runtime-System & Hosting National portals Advanced User Interfaces Project Applications End users Functionality Layer EU Open Access Repositories Data Layer Administrators Enabling Layer 18

  19. Some Results: A software Meant for large service providers only!

  20. Current Work: DRIVER-II Networking Confederation with who-is-who advisory board Outreach: LIBER, SPARC, US, JAPAN etc… Consolidation DRIVER-I Services packaged and performing in production quality Enhancement DRIVER-I Services Improved indexing and data aggregation functionalities DRIVER-II Services Enhanced publication management and functionality

  21. Lessons learnt Distributed data infrastructure requires links between organisational and technical concepts Data specialists, computer scientists, service providers Guidelines / content policies as a „glue“ In distributed data provision, quality and access measures are the most ‚expensive‘ tasks Distributed service operation (not data provision) can be solved but asks novel questions (SLAs) „Infrastructure“ is a very tough concept to get across and eventually forms a complex system Simplification makes it weaker, e.g. re-use is restricted

  22. Summary DRIVER tackles the data infrastructure challenge from the text-repository side (mostly OAI-PMH) DRIVER handshakes with primary & secondary data through „enhanced publications“ DRIVER isn‘t only a project but a forum for information specialists ‚Products‘ include: Studies, Infrastructure run-time-system in production, software, support … DRIVER has adressed many problems for data and service interoperability and found solutions

  23. Agenda today 09.00 – 09.15 DRIVER Overview Wolfram Horstmann 9.15 - 9.30 DRIVER Guidelines Friedrich Summann 9.30 – 09.45 Mentor service Mary Robinson 09.45 – 10.55 DRIVER Infrastructure Paolo Manghi & Natalia Manola 10.55 – 11.10 DRIVER Confederation Dale Peters 11.10 – 11.30 Discussion & Wrap-Up Dale Peters and Wolfram Horstmann

  24. Thanks

More Related