1 / 49

Interface, Preservation, and Future of Digital Libraries INFO653 Digital Libraries Week 9

Interface, Preservation, and Future of Digital Libraries INFO653 Digital Libraries Week 9. Xia Lin College and Information Science and Technology Drexel University. The Future of Digital Libraries. Depends on Innovative Technologies Better interfaces Better semantics

naasir
Download Presentation

Interface, Preservation, and Future of Digital Libraries INFO653 Digital Libraries Week 9

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Interface, Preservation, and Future of Digital Libraries INFO653 Digital LibrariesWeek 9 Xia Lin College and Information Science and Technology Drexel University

  2. The Future of Digital Libraries • Depends on • Innovative Technologies • Better interfaces • Better semantics • Better repositories and preservation • Innovative services • Life-time learning • Knowledge management • Knowledge discovery

  3. Improving User Interfaces • Go beyond searching & browsing • Go beyond textual information • Making it visual.

  4. Example 1: AquaBrowser

  5. Example 2: Visual Concept Explorer

  6. Example 3: OCLC Dewey Browser

  7. Example 4: AuthorSpace

  8. Digital Preservation • Digital preservation is defined as the managed activities necessary: • For the long term maintenance of a byte stream (including metadata) sufficient to reproduce a suitable facsimile of the original document and • For the continued accessibility of the document contents through time and changing technology.

  9. What are trusted Repositories? • Trusted digital content • Trusted storage media • Trusted system architecture • Trusted organization

  10. Trusted Digital Content • In the traditional environment • Publishers and other professional information providers provide genuine content to libraries. • In the digital environment • What is original, • What is “published,” • What is professional or amateur writing, • Might be blur. • Who provide trusted content to libraries?

  11. Trusted Storage Media • How long does our current storage media last? • The media is changed every five years? • What to do when the data are loss? • Are there any difference between digital preservation and digital collection back ups?

  12. Trusted System Architecture • Digital content depends on a digital repository system. • Each system has its own architecture and data formats/standards. • Its’ quite possible that digital objects are “there” and no systems or technology can read the content out. • This problem is particularly serious with non-text materials such as images, audio, video or other multimedia materials.

  13. Trusted Organizations • If a publisher is gone, books it publisher will still be there. • If an organization is gone, digital repositories maintained by the organization will ……. • the willpower of the institution is a major success factor in digital preservation. • Having a preservation strategy is another. • Activity management is also required.

  14. Solutions? • Trusted digital content? • Trusted storage media? • Trusted system architecture? • Trusted organization?

  15. A New Reading • Research Libraries Group. (2002). Attributes of a trusted digital repository: Meeting the needs of research resources. An RLG-OCLC report.http://www.rlg.org/legacy/longterm/repositories.pdf

  16. Trusted Digital Repositories • accept responsibility for the long-term maintenance of digital resources on behalf of its depositors and for the benefit of current and future users; • have an organizational system that supports not only long-term viability of the • repository, but also the digital information for which it has responsibility; • demonstrate fiscal responsibility and sustainability; • design its system(s) in accordance with commonly accepted conventions and standards to ensure the ongoing management, access, and security of materials deposited within it; • establish methodologies for system evaluation that meet community expectations of trustworthiness; • be depended upon to carry out its long-term responsibilities to depositors and users openly and explicitly; • have policies, practices, and performance that can be audited and measured;

  17. Attributes of Trusted Digital Repository • Compliance with the Reference Model for an Open Archival Information System (OAIS) • Administrative responsibility • Organizational viability • Financial sustainability • Technological and procedural suitability • System security • Procedural accountability

  18. Semantic Digital Libraries • Making Digital Libraries “delicious” • Creating life-long learning environment • Using Digital Libraries for Knowledge Management

  19. Creating Semantics • NKOS • Networked Knowledge Organization Systems • Simile • leverages and extends DSpace • JeromeDL • Digital libraries with semantics

  20. Relationship Models: Ontologies Semantic networksThesauri Strongly-structured Classification &Categorization: Classification schemes TaxonomiesCategorization schemes Subject Headings Metadata-like Models: GazetteersDirectories Authority Files Synonym Rings Weakly-structured Term Lists: Glossaries/Dictionaries Pick lists Natural language Controlled language A Taxonomy of KOS

  21. NKOS • KOS that talks to each others on the Web • Semantic interoperability • Syntactic interoperability • Reusable ontologies, classes, attributes • Ontology registry

  22. SKOS • W3C latest standards (Nov. 2005) • Simple Knowledge Organization System • provides a model for expressing the basic structure and content of concept schemes such as thesauri, classification schemes, subject heading lists, taxonomies, 'folksonomies', other types of controlled vocabulary.

  23. Simile • A open source project ioined conducted by MIT and W3C to • Extend DSpace and enhance its support to metadata • enhance inter-operability among digital assets, schemata/vocabularies/ontologies, metadata, and services.

  24. JeromeDL

  25. JeromeDL • searching and browsing, • users' profile management • resources management. • AND • enable communication with other digital library systems • utilize results of latest results in Semantic Web and communication and information management research.

  26. Making Libraries Delicious • Folksonomy • Social classification • Social Tagging • Social bookmarking

  27. Del.icio.us

  28. Folksonomies • Folksonomies: Cooperative Classification and communication through shared metadata • How does it work? • Metadata for the masses • Minimum barries to use; litle cognitve costs • Serendipity • Good interfaces • Lack of structures • “the lack of hierarchy, synonym control and semantic precision are precisely why it works” • Do it for yourself; benefit for the community; • Values come from aggregration.

  29. Making Libraries more delicious • Use social bookmaking in digital libraries • Allow user-centered tagging • Promote resource sharing • Encourage self-service publishing • Foster collaboration

  30. Creating a Learning Environment • Search is not enough • Information space for learning and intelligent work • Knowledge space for collaboration

  31. DL: Search is not Enough • What do users do with Digital Libraries? • find information?

  32. Library use is part of a larger task context

  33. Beyond Searching and Browsing • Next level of DL Applications • Documents/terms Clustering • Documents/terms Categorization • Thesaurus/Ontology Generation • Knowledge Discovery • Information Extraction • Document Summarizing • Question Answering

  34. Information Extraction • analyze unrestricted text in order to extract specific types of information. • Convert unstructured to structured • Convert unstructured text documents into codified database entries. • Semantic analysis, from word recognition, to sentence analysis, to understanding at the sentence level on up to discourse analysis at the level of the full text document.

  35. Example: Salvadoran President-elect Alfredo Cristiania condemned the terrorist killing of Attorney General Roberto Garcia Alvarado and accused the Farabundo Marti Natinal Liberation Front (FMLN) of the crime. … Garcia Alvarado, 56, was killed when a bomb placed by urban guerillas on his vehicle exploded as it came to a halt at an intersection in downtown San Salvador. … According to the police and Garcia Alvarado’s driver, who escaped unscathed, the attorney general was traveling with two bodyguards. One of them was injured.

  36. Extraction results: • Incident: Date 19 Apr 89 • Incident: Location El Salvador: San Salvador • Incident: Type Bombing • Perpetrator: Individual ID “urban guerillas” • Perpetrator: Organization ID “FMLN” • Human Target: Name “Roberto Garcia Alvarado” • … ...

  37. Text Mining • Information Retrieval • Find information in documents • Match information needs of the user to information in documents • Text Mining • Find information not in any individual document but in a collection of documents. • Seek new information that wasn't previously known to anyone.

  38. Knowledge Discovery • Most famous example: • Don Swanson explored medical literature that were logically related yet not cited each other. • His explorations led to hypotheses for causes of rare diseases, which were later confirmed by experimental evidence.

  39. Question Answering • TREC-Q&A Track • answer questions by selecting sentences from the documents in which the answers occur. • use semantic networks and computational lexicology.

  40. Question & Answering • Capture, organize and index experts’ answers to previous questions. • Match new questions from stored answers whatever possible. • Match new questions to the right experts

  41. Knowledge Management • Are Digital Libraries a KM tool? • Or a Content Management tool? • Or an information system tool? • One difference • CM deals with tangible knowledge • KM includes intangible knowledge • The difference is blurry • There is intangible knowledge in Digital lbiraries

  42. A final reading … • Why Information Technology Inspired But Cannot Deliver Knowledge Management • By Richard McDermott • California Management Review, 41(4) 103-117, Summer 1999.

  43. Avoid Creating Information Junkyards • Creating an information system without understanding what knowledge professionals needed, or the form and level of detail they needed, did little to leverage knowledge.

  44. Combine Human and Information systems • Leveraging knowledge involves a unique combination of human and information systems. • Knowledge is a human act. • Knowledge belongs to communities. • New knowledge is created at the boundaries of old.

  45. Example • The art of professionals practice is to turn information into solutions. • To know a city is to know its streets, • Not as a list of street names or a map • But as a set of sights and routes useful for different purposes.

  46. Implications for Leveraging knowledge • To leveraging knowledge, develop communities. • Create forums for thinking as well as systems for sharing information. • Use the community’s terms for organizing knowledge. • Let the community decide what to share and how to share it.

More Related