1 / 26

A Linear-Algebraic Technique with an Application in Semantic Image Retrieval

A Linear-Algebraic Technique with an Application in Semantic Image Retrieval. Jonathon S. Hare, Paul H. Lewis 2006,CIVR. Outline. Introduction Using Linear-Algebra to Associate Image and Terms Method Decomposing the Observation Matrix Using the Terms as a Basis for New Documents

alima
Download Presentation

A Linear-Algebraic Technique with an Application in Semantic Image Retrieval

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. A Linear-Algebraic Technique with an Application in Semantic Image Retrieval Jonathon S. Hare, Paul H. Lewis 2006,CIVR

  2. Outline • Introduction • Using Linear-Algebra to Associate Image and Terms • Method • Decomposing the Observation Matrix • Using the Terms as a Basis for New Documents • Experimental Result

  3. Introduction • Automatic annotation of image

  4. Outline • Introduction • Using Linear-Algebra to Associate Image and Terms • Method • Decomposing the Observation Matrix • Using the Terms as a Basis for New Documents • Experimental Result

  5. Using Linear-Algebra to Associate Image and Terms • Any document can be described by a series of observations , or measurements. • We refer to each of these observations as terms. • Don’t explicitly annotate images.

  6. Using Linear-Algebra to Associate Image and Terms Given a corpus of n documents, it is possible to form a matrix of m observations or measurements (i.e. a term-document matrix). This m × n observation matrix, O, essentially represents a combination of terms and documents, and can be factored into a separate term matrix, T, and document matrix, D:

  7. Outline • Introduction • Using Linear-Algebra to Associate Image and Terms • Method • Decomposing the Observation Matrix • Using the Terms as a Basis for New Documents • Experimental Result

  8. Decomposing the Observation Matrix • Lemma 1(The rank principle for a noise-free term-document matrix) Without noise, the observation matrix ,O, has a rank at most equal to the number of independent terms or documents observed. • In reality, the observation term-document matrix is not at all noise free.

  9. Decomposing the Observation Matrix • The observation matrix, O can be decomposed using SVD into a m × r matrix U,a r × r diagonal matrix Σ and a r × n matrix VT , O = UΣVT, such that UTU = VVT = VTV = I, where I is the identity matrix.

  10. Decomposing the Observation Matrix • Assume O∗is the ideal, noise-free observation matrix, with k independent terms. Now partitioning the U, Σ andVTmatrices as follows:

  11. Decomposing the Observation Matrix • Lemma 2 (The rank principle for a noisy term-document matrix). All of the information about the terms and documents in O is encoded in its k largest singular values together with the corresponding left and right eigenvectors.

  12. Decomposing the Observation Matrix • define the estimated noise-free term matrix, ,and document matrix, . • ,where represents the estimated noise-free observation matrix.

  13. Decomposing the Observation Matrix • The rows of the term matrix create a basis representing a position in the space of each of the observed terms. • The columns of the document matrix represent positions of the observed documents in the space.

  14. Outline • Introduction • Using Linear-Algebra to Associate Image and Terms • Method • Decomposing the Observation Matrix • Using the Terms as a Basis for New Documents • Experimental Result

  15. Using the Terms as a Basis for New Documents • Theorem 1 (Projection of partially observed measurements). The term-matrix of a decomposed fully-observed measurement matrix can be used to project a partially observed measurement matrix into a document matrix that encapsulates estimates of the unobserved terms.

  16. Using the Terms as a Basis for New Documents • To project a new partially observed measurement matrix into a basis created from a fully observed training matrix, we need only pre-multiply the new observation matrix by the transpose of the training term matrix.

  17. Using the Terms as a Basis for New Documents • In order to query the document set for documents relevant to a term, we just need to rank all of the documents based on their position in the space with respect to the position of the query term in the space. • The cosine similarity is a suitable measure for this task.

  18. Outline • Introduction • Using Linear-Algebra to Associate Image and Terms • Method • Decomposing the Observation Matrix • Using the Terms as a Basis for New Documents • Experimental Result

  19. Experimental Result

  20. Experimental Result • we chose a value of k = 100 for the following • experiments.

  21. Experimental Result

  22. Experimental Result

  23. Experimental Result

  24. Experimental Result • we use a much simpler feature: a 64-bin • global RGB histogram.

  25. Experimental Result

  26. Conclusions • This paper presented a novel approach. • The method does not produce equal performance for all queries. • The featuresare not enough. • The training data are not enough.

More Related