Tree-based indexing methods for similarity search in metric and nonmetric spaces. Department of Software Engineering Faculty of Mathematics and Physics Charles University in Prague Mgr. Jakub Lokoč Supervisor: Doc. RNDr . Tom áš Skopal , Ph.D. Presentation outline. Introduction
Tree-based indexing methods for similarity search in metric and nonmetric spaces
Department of Software Engineering
Faculty of Mathematics and Physics
Charles University in Prague
Mgr. Jakub Lokoč
Supervisor: Doc. RNDr. TomášSkopal, Ph.D.
query object
Feature extraction
Similarity evaluation
Feature extraction
range query
Q
(euclidean 2D space)
O5
O3
O1
O7
O9
O4
O5
O1
O4
O6
O1
O3
O11
O11
O5
O2
O7
STACK
O8
O9
O10
O2
O8
O6
O9
O10
CoPhIR (color layout and structure), dim 76, dbSize250.000
1. Aggregation
2. Parallel batch loading
3. Traditional inserting
Not inserted objects
“Split generating” – will be inserted in traditional way (exploiting limited parallelism)
Postponed – will be inserted during the next batch
CoPhIR 1.000.000
Dimension 76 (12 + 64)
L5.123456 distance
24 / 25 inner/leaf node size
512MB cache size
Identity
Non-negativity
Symmetry
Triangle inequality
2NN ( ) = { , }
2NN ( ) = { , }
