eXtract: A Snippet Generation System for XML Search. Yu Huang, Ziyang Liu, Yi Chen Arizona State University . http://eXtract.asu.edu. Motivation: . Good snippets help users to easily judge the relevance and find desired results. Problem: How to generate good snippets for XML search?.
Yu Huang, Ziyang Liu, Yi Chen
Arizona State University
Good snippets help users to easily judge the relevance and find desired results.
Problem: How to generate good snippets for XML search?
No existing work on XML snippet generation yet.
Contributions: eXtract - the first system on snippet generation for XML search[Huang et al, SIGMOD ’08]
Challenge: What are good snippets?
Challenge: What information in result is significant to achieve the properties?
Solution: Designed an algorithm to generate IList
Solution: Identified desirable properties
Challenge: How to select instances in the result when generating a snippet to maximally cover IList within a size bound?
Solution: Designed an efficient and effective algorithm that generates good snippets from IList
retailer apparel Texas
(of size 11)
Find the apparel retailers in Texas.
A Query Result
Features and their occurrences
Dominance score (DS): DS (Houston) = 2/(3/2) = 1.33, DS (children) = 53/(300/3) = 0.53
IList : Texas, apparel, retailer, store, Brook Brothers, outwear, suit, casual, men
34th International Conference on Very Large Data Bases, August 23th-28th, 2008, Auckland, New Zealand