ALIP: Automatic Linguistic Indexing of Pictures. Jia Li The Pennsylvania State University. Can a computer do this?. “Building, sky, lake, landscape, Europe, tree”. Outline. Background Statistical image modeling approach The system architecture The image model Experiments
Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.
The Pennsylvania State University
Annotation: “man, male, people, cloth, face”
Training images used to train a concept with
description “man, male, people, cloth, face”
Regard an image as a grid. A feature vector is computed for each node.
The underlying states are governed by a Markov mesh.
(i’,j’)<(i,j) if i’<i; or i’=i & j’<j
Context: the set of states for (i’, j’): (i’, j’)<(i, j)
by wavelet transform
An approximation to the
classification EM approach
Computer Prediction: people, Europe, man-made, water
Building, sky, lake, landscape, Europe, tree
People, Europe, female
Food, indoor, cuisine, dessert
Snow, animal, wildlife, sky, cloth, ice, people