1 / 8

Database Issues for the Web - Dasfaa 2003 Panel -

Database Issues for the Web - Dasfaa 2003 Panel -. Dik Lun Lee Department of Computer Science Hong Kong University of Science and Technology. Characteristics of the Web. Publication of information by: Businesses, large and small Individuals

Download Presentation

Database Issues for the Web - Dasfaa 2003 Panel -

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Database Issues for the Web- Dasfaa 2003 Panel - Dik Lun Lee Department of Computer Science Hong Kong University of Science and Technology

  2. Characteristics of the Web • Publication of information by: • Businesses, large and small • Individuals • No global rules or structures: structures and rules are spontaneously created by web authors • Result: we can only discover, not control, the web

  3. Why is Google so Successful • Exploit collective opinions of web authors • Use link analysis to identify web pages of high authority and/or quality • Use anchor texts as an indicator for web page theme • Both are created by human authors • Large scale ensures robustness Human-created index/knowledge Sub-consciously created Works only under large scale

  4. topics topics Beyond Google • Web communities • Topic distillation

  5. Research Issues: mining meaning • Derivation of concise and precise web communities • Link and content analysis • Integration of multiple sources of “intelligence”: Click-stream, search engine log, human-created directories: Open Directory and ontological databases • Creation of multiple hierarchies between communities • Topic drifting: restructuring of communities and their corresponding topics; migration of “drifted” pages to another community • Implementation: centralized versus peer-to-peer

  6. Research Issues: Mobile Web • Modeling of physical space to support, e.g., semantic nearest neighbors (time and space) • I am at the Kansei airport now, lead me to the platform for the Kyoto-bound train • Fine-grain, step by step instructions • Continuous monitoring to ensure I am at the right place at the right time • Processing nearest-neighbor, k-nn, window and continuous queries • Indexing of locations of data objects • Specialized index for NN and k-NN queries • Caching: validation of answers as user moves along • Connection: point-to-point, broadcast, continuous

  7. Research Issues: Wireless Web • Wireless mesh networks: nodes are connected wirelessly; peer-to-peer network with limited reach • Applications: sensor networks, low-cost substitution of 3G networks • Dissemination of large volume of data • How to announce availability of new data or updates? • Data broadcast; message hopping across the network • Filter on-the-air • Finding information on the network • Limited bandwidth and batter power: continuous active querying is discouraged

  8. Mobile/wireless web Summary Ubiquitous information access Personalization, query tracking Clustering, term suggestion, query tracking google, link analysis Ubiquitous information publication

More Related