1 / 37

Mining Interesting Locations and Travel Sequences from GPS Trajectories

Mining Interesting Locations and Travel Sequences from GPS Trajectories. Outline. Abstract Introduction System Overview Tree-Based Hierarchical Graph Location Interest Inference Experiment. Abstract. Object:

llavine
Download Presentation

Mining Interesting Locations and Travel Sequences from GPS Trajectories

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Mining Interesting Locations and Travel Sequences from GPS Trajectories

  2. Outline • Abstract • Introduction • System Overview • Tree-Based Hierarchical Graph • Location Interest Inference • Experiment

  3. Abstract • Object: • Mine interesting locations and classical travel sequences in a given geospatial region • How: • Model multiple individuals' location histories with a tree-based hierarchical graph (TBHG) • HITS (Hypertext Induced Topic Search)-based inference model • Mine classical travel sequences • Evaluate: • Dataset: 107 users, over a period of one year, GPS data

  4. Outline • Abstract • Introduction • System Overview • Tree-Based Hierarchical Graph • Location Interest Inference • Experiment

  5. Introduction • The increasing pervasiveness of location-acquisition technologies such as GPS, GSM network, etc., is leading to the collection of large spatio-temporal datasets and the opportunity of discovering valuable knowledge about movement behavior. • GPS data related community have appeared on the Internet . Users can visualized and manage their GPS trajectories on a Web map. Further, they can obtain reference knowledge from sharing experiences. • However, almost all of these applications still directly use raw GPS data, like coordinates and time stamps, without much understanding. Facing such a large dataset in a community, it is impossible for a user to browse each GPS trajectory one by one.

  6. Introduction (cont.) • Object • Mine interesting locations and classical travel sequences in a given geospatial region • Challenge • The interest of a location does not only depend on the number of users visiting this location but also lie in these users' travel experience. • An individual's travel experience and interest of a location are relative values (i.e., it is not reasonable to judge whether or not a location is interesting), and are region-related. (i.e., conditioned by the given geospatial region).

  7. Introduction (cont.) • Done in this paper: • Mine the top n interesting locations and the top m classical travel sequences in a given geospatial region. • Infer the most k experienced users in a geo-related community. • Contributions: • Tree-based hierarchical graph (TBHG) model multiple users' travel sequences. • HITS-based model to infer users’ travel experiences and interest of a location within a region.

  8. Related Work • Mining individual location history • detecting significant locations • predicting movement • recognizing user-specific activities at each location (行为方式识别) • Mining multiple user's location histories • mine similar sequences • recognize social pattern

  9. Outline • Abstract • Introduction • System Overview • Tree-Based Hierarchical Graph • Location Interest Inference • Experiment

  10. System Overview

  11. System Overview (cont.) • GPS data: • GPS log, GPS trajectory • A stay point s stands for a geographic region where a user stayed over a certain time interval. • Location history: an individual's location history (LocH) is represented as a sequence of stay points they visited with corresponding arrival and leaving times.

  12. Outline • Abstract • Introduction • System Overview • Tree-Based Hierarchical Graph • Location Interest Inference • Experiment

  13. TBHG • Tree-based hierarchical graph (TBHG) model multiple users' travel sequences. • Reason: the location histories of various people are inconsistent and incomparable as the stay points pertaining to different individuals are not identical.

  14. TBHG (cont.) • Constructing the TBHG • Formulate a tree-based Hierarchy H: hierarchically cluster the detected user's stay points. • Build graphs G on each level: connect the clusters of the same level based on the tree-based hierarchy H and user's location histories. • Tree-Based Hierarchy H: H is a collection of stay point-based clusters C with a hierarchy structure L. • Tree-Based Hierarchical Graph (TBHG): A TBHG is the integration of H and G, TBHG=(H, G).

  15. Modeling Location History

  16. Outline • Abstract • Introduction • System Overview • Tree-Based Hierarchical Graph • Location Interest Inference • Experiment

  17. HITS-based inference model • HITS stands for hypertext induced topic search, which is a search-query-dependent ranking algorithm for Web information retrieval. • A page's authority score is the sum of the hub scores of the pages it points to, and its hub score is the sum of the hub scores of the pages it points to, and its hub score is the integration of authority scores of the pages pointed to by it. • Using a power iteration method, the authority and hub scores of each page can be calculated.

  18. HITS-based inference model (cont.) Algorithm: Compute authority and hub values G := set of pages for each page p in G do p.auth = 1 // p.auth is the authority score of the page p p.hub = 1 // p.hub is the hub score of the page p function HubsAndAuthorities(G) for step from 1 to k do // run the algorithm for k steps for each page p in G do // update all authority values first for each page q in p.incomingNeighbors do p.auth += q.hub for each page p in G do // then update all hub values for each page r in p.outgoingNeighbors do p.hub += r.auth

  19. HITS-based inference model (cont.) • Similar to HITS, a hub is a user who has accessed many places, and an authority is a location which has been visited by many users. Therefore, users' travel experiences (hub scores) and the interests of locations (authority scores) have a mutual reinforcement relationship. • Location Interest: The interest of a location is represented by a collection of authority scores. • User Travel Experience: A user's travel experience is represented by a set of hub scores.

  20. HITS-based inference model (cont.) • Strategy for Data Selection: A location might have multiple authority scores based on the different region scales it falls in. Also, a user might have multiple hub scores conditioned by the regions of different clusters. • Speedup: Thus, calculate in advance the interest of every location using the regions specified by ascendant clusters.

  21. HITS-based inference model (cont.) • Advantages: • The strategy, which sets multiple hub scores for a user and multiple authority scores for a location, has the following two advantages: • First, we are able to leverage the main strength of HITS to rank locations and users with the contexts of geospatial region (query topic). • Second, these hub and authority scores can be calculated offline. Therefore, we can ensure the efficiency of our system while allowing users specify any regions on a map.

  22. HITS-based inference model (cont.) • Compute Score: • Adjacent matrix M: between users and locations based on the users' access on these locations. In this matrix, item stands for the times that user has visited to that cluster. • Then, the mutual reinforcement relationship of user travel experience and location interest is represented as :

  23. HITS-based inference model (cont.)

  24. Mining Classical Travel Sequences • The classical score of a sequence is the integration of the following three aspects. • 1) The sum of hub scores of the users who have taken this sequence. • 2) The authority scores of the locations contained in this sequence. • 3) These authority scores are weighted based on the probability that people would take a specific sequence. • For instance:

  25. Mining Classical Travel Sequences (cont.)

  26. Outline • Abstract • Introduction • System Overview • Tree-Based Hierarchical Graph • Location Interest Inference • Experiment

  27. Experiment • 107 users (49 females and 58 males), 2007.05 - 2008.10

  28. Experiment (cont.) • GPS logs: 166,372 KM; GPS points: 5,081,369

  29. Experiment (cont.) • Data Processing: • Stay point detection: threshold are 20 minutes and 200 meters. Extracted 10,354 stay points. • Clustering:

  30. Experiment (cont.) • Framework of the Evaluation

  31. Experiment (cont.) • In this study, 29 subjects (14 females and 15 males), who have been in Beijing for more than 6 years, were invited to answer the evaluation questions. Using the region specified by the fourth ring road of Beijing, we retrieved the top 10 interesting locations and the top 10 classical travel sequences based on a variety of approaches, including our methods and some baselines. • Baseline: • interesting locations: rank-by-count and rank-by-frequency • classical travel sequences: rank-by-count, rank-by-interests and rank-by-experience

  32. Experiment (cont.) • Measurements • MAP (Mean Average Precision): In this experiment, it stands for the mean of the precision score after each interesting location is retrieved. Here, a location is deemed as an interesting location if its interest level equals to 2. • For instance, the MAP of an interest rating vector, G = <2, 0, 2, 0, 1, 0, 0, 0, -1>, for the top 10 location, is computed as follows. • MAP = (1 + 2/3 + 3/8)/3 = 0.681

  33. Experiment (cont.) • Measurements • nDCG (normalized discounted cumulative gain)

  34. Experiment (cont.) • Results: • Presentation ability (mean score of ratings)

  35. Experiment (cont.) • Results: • Ranking ability • Results Related to Classical Sequences

  36. Experiment (cont.) • There is more : Correlation between locations and users

  37. The End • Question?

More Related