Experiments on Using Semantic Distances Between Words in Image Caption Retrieval

Experiments on Using Semantic Distances Between Words in Image Caption Retrieval Alan F. Smeaton and Ian Quigley School of Computer Applications Dublin City University Presenter: Cosmin Adrian Bejan

IR implementation - traditional approach • Represent: • a user query = a bag of query terms • document = a bag of index terms • Compute: • a degree of similarity between a document and a query based on the overlap or number of query terms in common between them.

Problems in IR implementation • caused by • same words describing different things (“bar”, “bank”) • different words describing same thing (“stomach pain” – “belly ache”) • natural language is fraught with ambiguities at all levels leading to multiple interpretations of words, phrases, etc. • Common way to address these problems: query expansion • The approach in this paper: when computing the degree of similarity between query and document instead of basing similarity on the terms in common between the two incorporate a quantitative measure of the semantic similarity between index terms into the measure.

Measuring semantic distance between words • knowledge base – hierarchical concept graphs (HCGs) automatically constructed from WordNet • The similarity of two classes or synsets: • Computing the similarity between two word senses (nouns) can only be done if both are in the same HCG, otherwise they are regarded as being dissimilar. information content of the class ci P(ci) the class probability of class ci

Experimental Set-up • Hand-caption 2714 images • Manually disambiguate polysemous words in caption • Manually built a collection of 60 queries • Compute various query-caption similarity measure using word-word semantic distances.

Retrieval Strategies [1-2] • Notation • query Q={q1, q1, … qm}. • caption C={c1, c1 … cn} where a qi or a cj is the original term used only as a representation for its synset. • Sim(ti, tj) is the similarity between the sense-disambiguated form of two terms ti and tj. • Run1 • Run2 straightforward statistically-based tf*IDF match between the word forms or strings, i.e. not using word sense disambiguated captions or queries. where terms in caption in query are both expanded to include other word strings from their sense disambi-guated sysnsets (query expansion).

Retrieval Strategies [3-5] • Run3 • Run4 • Run5 when considering different threshold values for each HCG, given that there is a concentration of usage of concepts from some HCGs (like entity) and hardly any use of others (like shape).

Retrieval Strategies [6-8] • Run6 • Run7 • Run8

Experimental Results

Experiments on Using Semantic Distances Between Words in Image Caption Retrieval

Experiments on Using Semantic Distances Between Words in Image Caption Retrieval

Presentation Transcript

Image Retrieval

Information Retrieval on the Semantic Web Using Ontology-based Visualization

Content-Based Image Retrieval using the Bag-of-Words Concept

Image Retrieval

“Semantic” Image Annotation and Retrieval

Seminar on Image Similarity and Image Retrieval

Image Retrieval

Measuring Semantic Similarity between Words Using HowNet

Hierarchical Semantic Indexing for Large Scale Image Retrieval

Image Retrieval

Descriptive Semantic Image Retrieval

Image Retrieval

Using Semantic Relations to Improve Information Retrieval

Keypoints in Image Retrieval

Special Topic on Image Retrieval

Multimodal Semantic Indexing for Image Retrieval

Special Topic on Image Retrieval

Document Image Retrieval using Bag of Visual Words Model

Image Retrieval

Content-Based Image Retrieval using the Bag-of-Words Concept

Image Retrieval using Neutrosophic Sets

Image Retrieval