400 likes | 529 Views
Online Manifold Regularization: A New Learning Setting and Empirical Study. Andrew B. Goldberg, Ming Li, and Xiaojin Zhu.(ECML PKDD, 2008). Hu EnLiang Friday, April 17, 2009. Standard online learning VS. Online Manifold Regularization.
E N D
Online Manifold Regularization: A New Learning Setting and Empirical Study Andrew B. Goldberg, Ming Li, and Xiaojin Zhu.(ECML PKDD, 2008). Hu EnLiang Friday, April 17, 2009
Standard online learning VS. Online Manifold Regularization • Both of them are long-life learning and learn non-iid sequentially; • Standard online learning: traditionally assumes that every input point is fully labeled, it cannot take advantage of unlabeled data; • Online MR: it learns even when the input point is unlabeled.
Online MR VS. batch MR (advantages) • Online MR scales better than batch MR in time and space; • Online MR achieves comparable performance to batch MR; • Online MR can handle concept drift; • Online MR is an “anytime classifier”, which continuously is being improved and its training is cheap.
The relationship of batch risk, instantaneous regularized risk and average instantaneous risk
A BriefIntroduction to CBIR(Content-based Image Retrieval) Hu en liang Tuesday, April 08, 2008
Background:Content-based Image Retrieval • Properties: • Querying image according to user’s semantic-concepts. • Querying images according to image’s contents, such as: color, texture, shape, etc. • Hypothesis——similar contents means semantic affinity; • ‘Semantic gap’——semantic affinity doesn't means similar contents.
Background: The Difficulty of ‘Semantic Gap’ • Key problems: • How to extract user’s semantic-concept (intention)? • How to bridge between content and semantic ? • Main methods: • Machine learning based RF (Relevance-Feedback); • The prior knowledge such as the historical logs.
How to Connect CBIR to ML? • (Semi-)supervised Metric Learning; • Manifold Learning, Dimension Reduction… • (Semi-)supervised Classification; • Active Learning; Co-training; • Assembling Classifier; • Ranking; …
Some Individual Characteristics for feedback-based CBIR • In contrast to typical ML, there are some special characteristics for RF-CBIR: • The problem of the small size sample; • The problem of asymmetrical training sample; • The online algorithm with real-time requirement;
Manifold Regularization (MR) Manifold regularization: a geometric framework for learning from labeled and unlabeled examples. Mikhail Belkin, Partha Niyogi, Vikas Sindhwani. Journal of machine Learning Research 7, pp 2399-2434, 2006
To Modify MR for CBIR • There are some intrinsic characteristics for CBIR: • The problem of the small size sample; • The problem of asymmetrical training sample; • The online algorithm with real-time requirement; The (1+x)-manifolds hypothesis There only single submanifold for positive class, but multi-submanifolds for negative class!
positive manifold Negative manifold The Problem of MR for the Multi-Submanifolds Case