Managing Unstructured Data. AnHai Doan University of Wisconsin-Madison. Unstructured Data. Appears in many forms emails, Web pages, memos, call center text record, etc. Is pervasive 80% of the world data, and is growing Managed by many players
Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.
University of Wisconsin-Madison
We should work on it, or risk missing the boat!
But what sets us apart from the above guys?
DB + IR + IE + II, in a best-effort, Web 2.0 fashion
Best-effort, pay-as-you-go, improving over time
Scale up to huge data (by running over clusters)