Automatic Content Filtering. KDDI R&D Laboratories Inc. UGC(User Generated Content) is very popular and becoming a high part of online volume. Industry sources tell us that YouTube content submissions are moving to 5M minutes of new content uploads per day
Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.
Automatic Content Filtering
KDDI R&D Laboratories Inc.
UGC(User Generated Content) is very popular and becoming a high part of online volume.
Industry sources tell us that YouTube content submissions are moving to 5M minutes of new content uploads per day
A large variety of formats, resolutions and sizes of videos and images are uploaded to the internet daily
How can a company can check all this picture and movie content?
Drawbacks of Manual checking :
Subjective evaluation is time and resource consuming
Subjective evaluation introduces fluctuations in results
What are the key drivers for automatic content filtering?
Can operate 55Pics/sec. using only Laptop PC
Adopt proprietary image features
Fast training by introducing iSVM
SVM (Support Vector Machine) : Concept and Problem
Concept : Mapping to multidimensional space and determining boundary between OK/NG
Problem :Huge calculations are needed to support working on these huge datasets.
Conventional SVM cannot handle a huge training dataset
There’s a Strong Need for Fast Training Algorithm while maintaining high accuracy
Incremental SVM (iSVM) : Concept, Features, Benefit
Introducing KDDI R&D Labs’ proprietary adaptive training algorithm - iSVM
Now calculation cost increases are proportional to the amount of data!
Conventional methods SVM cube the proportion of calculation to data!!!
We have confirmed that iSVM accelerates calculation speeds
up to 8X for 5,000,000 training datasets.
5X faster than other product