eXtract: A Snippet Generation System for XML Search. Yu Huang, Ziyang Liu, Yi Chen Arizona State University . http://eXtract.asu.edu. Motivation: . Good snippets help users to easily judge the relevance and find desired results. Problem: How to generate good snippets for XML search?.
eXtract: A Snippet Generation System for XML Search
Yu Huang, Ziyang Liu, Yi Chen
Arizona State University
Good snippets help users to easily judge the relevance and find desired results.
Problem: How to generate good snippets for XML search?
No existing work on XML snippet generation yet.
Contributions: eXtract - the first system on snippet generation for XML search[Huang et al, SIGMOD ’08]
Challenge: What are good snippets?
Challenge: What information in result is significant to achieve the properties?
Solution: Designed an algorithm to generate IList
Solution: Identified desirable properties
Challenge: How to select instances in the result when generating a snippet to maximally cover IList within a size bound?
Solution: Designed an efficient and effective algorithm that generates good snippets from IList
retailer apparel Texas
(of size 11)
Find the apparel retailers in Texas.
A Query Result
Features and their occurrences
Dominance score (DS): DS (Houston) = 2/(3/2) = 1.33, DS (children) = 53/(300/3) = 0.53
IList : Texas, apparel, retailer, store, Brook Brothers, outwear, suit, casual, men
34th International Conference on Very Large Data Bases, August 23th-28th, 2008, Auckland, New Zealand