90 likes | 171 Views
Explore the use of similarity search to enhance community building by identifying interesting objects or persons in archive content. This approach aims to address interoperability challenges and improve retrieval in learning environments.
E N D
Community Support Based on Thematic Objects and Similarity Search N. Pinkwart, N. Malzahn, D. Westheide, H.U. Hoppe
Basic Idea • Computational tools used in learning contexts are highly heterogeneous – role of the computer differs • Frequently: digital media as a means for sharing and exchanging knowledge • Shared resources offer potential for community building • Approach: Based on archive content, determine interesting objects or persons
Existing tools/techniques • Metadata standards • Address interoperabilty problems • Often manual creation neccessary • Does not solve naviagation and retrieval issues • Recommender Systems • Manual rating needed • Critical mass of participants/artifacts needed • Domain model needed for a lot of algorithms
Proposed Approach Avoid problems with traditionalrecommendation algorithms – Use similar objects instead! What is similar? • Depends on the document content • Depends on the current learning context
Similarity Search Metadata based similarity search • Manual creation of metadata is time consuming:Semi automatic (tool embedded) generation of metadata from the task context • Retrieval often requires rather complicated forms: Use current document as a first search template (associative lookups) • Standard Metadata is too generic:Extension of standard metadata to cover specific needs of the particular domain. Make use of ontologies to allow for higher-level access
Personal Computer: << component>>:Learning Object Repoitory << component>>:Cool Modes << component>>:ArchiveService Server: Java Method calls << component>>:WebService :SearchStrategy :SimilarityModel SQL Architecture sketch SOAP
Similarity Search • Current state • Based on simple search strategies / similarity measures (boolean retrieval of keywords simple hit counting) • Promising results despite simple approach • Future Work • More elaborated (e.g. ontology based) retrieval and measures taking into account the structure of the document • Larger and distributed test cases