Efficient Similarity Measures for Multidimensional Trajectories

Discovering Similar Multidimensional Trajectories ICDE, San Jose, CA, 2002

Outline • Introduction • Similarity Measures • Compute the Similarity • Indexing Trajectories • Experimental Evaluation • Related Work • Conclusion

Introduction • The trajectory of a moving object is typically modeled as a sequence of consecutive locations in a multidimensional Euclidean space • An appropriate and efficient model for defining the similarity for trajectory data will be very important for the quality of the data analysis tasks.

Examples of 2D trajectories

Hierarchical clustering of 2D series (displayed as 1D for clariry)

Similarity Measures • Let A and Bbe two trajectories of moving objects with size n and mrespectively, where A = ((ax,1 , ay,1),…,(ax,n , ay,n)) and B = ((bx,1, by,1),…,(bx,m, by,m)). For a trajectory A, let Head(A) be the sequence Head(A) = ((ax,1, ay,1),…,(ax,n-1, ay,n-1))

Deﬁnition 1 • Given an integer δand a real number 0 < ε <1, we deﬁne the LCSSδ,ε(A; B) as follows:

Deﬁnition 2 • We deﬁne the similarity function S1 between two trajectories A and B, given δ and ε, as follows:

A region of δ & εfor a trajectory

Deﬁnition 3 • Given δ, εand the family Fof translations, we deﬁne the similarity function S2 between two trajectories A and B, as follows:

Translation of trajectory B

Deﬁnition 4 • Given δ,εand two trajectories A and B we deﬁne the following distance functions:

Compute the Similarity • Similarity function S1 • Given two trajectories A and B, with |A| = n and |B| = m, we can find the LCSSδ,ε(A, B) in O(δ(n + m)) time. • Similarity function S2 • Given two trajectories A and B, with |A| = n and |B| = m, we can compute the S2(δ,ε, A, B) in O((n+m)3δ3) time.

Approximate Algorithm

Indexing Structure • For every node C of the tree we store the medoid (MC) of each cluster. The medoid is the trajectory that has the minimum distance (or maximum LCSS) from every other trajectory in the cluster:

Time and Accuracy Experiments • Similarity values and running times from SEALS dataset

Experiment 1 - Video tracking data

Experiment 2 & 3 - Australian Sign Language Dataset (ASL)

Evaluating the indexing technique

Related Work • Use a p-norm distance to deﬁne the similarity measure. [2, 37, 18, 14, 10, 32, 10, 20, 24, 23] • Based on the time warping technique.[5, 25, 28, 33] • Find the longest common subsequence (LCSS) of two sequences.[3, 7, 11] • Define time series similarity are based on extracting certain features.[13, 17, 29, 31]

Conclusion • Efficient techniques to accurately compute the similarity between trajectories • Approximate algorithms with provable performance bounds • efﬁcient index structure

Comments • Good approach for similarity queries • Use real GPS trajectory data? • …

Dynamic Time Warping • Sequences are similar but accelerate differently along the time axis • Enforcing a temporal constraint δ on the warping window size improves computation efficiency and accuracy • Application: Speech recognition (Berndt and Clifford, 1996)

Longest Common Subsequence Similarity • Match 2 sequences by allowing some elements to be unmatched • C = {1,2,3,4,5,1,7} and Q = {2,5,4,5,3,1,8} • Longest is {2,4,5,1} • Application: Bioinformatics 2 5 4 5 3 1 8 1 2 3 4 5 1 7 0 0 0 0 1 0 1 1 1 1 1 1 1 1 1 2 2 2 1 1 1 1 2 2 2 2 2 1 1 3 3 2 2 3 3 1 3 3 2 2 4 4 Dissimilarity: 1 3 3 2 2 4 4 2 1 4 5 Tolerance: c Vlachos et al., 2002

Longest Common Subsequence Similarity • Input sequences C[1..m] and Q[1..n] • Compute LCS btwn C[1..i] and Q[1..j] • for all 1 ≤ i ≤ m and 1 ≤ j ≤ n • Stores it in L[i,j] • L[m,n] = length of the LCS 2 5 4 5 3 1 8 1 2 3 4 5 1 7 0 0 0 0 1 0 1 1 1 1 1 1 1 1 1 2 2 2 1 1 1 1 2 2 2 for i := 1..m for j := 1..n if C[i] = Q[j] L[i,j] := L[i-1,j-1] + 1 • else: • L[i,j] := max(L[i,j-1], L[i-1,j]) return L[m,n] 2 2 1 1 3 3 2 2 3 3 1 3 3 2 2 4 4 1 3 3 2 2 4 4 2 1 4 5 Vlachos et al., 2002

Efficient Similarity Measures for Multidimensional Trajectories

Efficient Similarity Measures for Multidimensional Trajectories

Presentation Transcript

Multidimensional Arrays

Rocket Trajectories

Modelo Multidimensional

Multidimensional Arrays

Non-Linear Unidimensional Scale Trajectories Through Multidimensional Content Spaces

Multidimensional Array:

Multidimensional Arrays

Multidimensional Scaling

Multidimensional Arrays

Trajectories

Multidimensional de Broglie - Bohm dynamics: the quantum motion with trajectories

Multidimensional Arrays

Multidimensional chromatography

Trajectories

Trajectories 05.11.16

Multidimensional Assessment

Semantic Trajectories

Addiction Trajectories

Match Trajectories

Lunar Trajectories