information extraction from automobile advertisements
Download
Skip this Video
Download Presentation
Information Extraction From Automobile Advertisements

Loading in 2 Seconds...

play fullscreen
1 / 8

Information Extraction From Automobile Advertisements - PowerPoint PPT Presentation


  • 215 Views
  • Uploaded on

Information Extraction From Automobile Advertisements. Nipun Bhatia Rakshit Kumar Shashank Senapaty. Problem Definition. Craigslist - Rudimentary keyword search. Not a natural way to search for cars. Difficult to efficiently find ads with particular attributes.

loader
I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
capcha
Download Presentation

PowerPoint Slideshow about 'Information Extraction From Automobile Advertisements' - daniel_millan


An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript
information extraction from automobile advertisements

Information Extraction From Automobile Advertisements

Nipun Bhatia

Rakshit Kumar

Shashank Senapaty

problem definition
Problem Definition
  • Craigslist - Rudimentary keyword search.
  • Not a natural way to search for cars.
  • Difficult to efficiently find ads with particular attributes.
  • Want structured search over attributes.
    • Attributes : Make, Model, Price, Year, Mileage, Transmission, PostedBy, Location, Contact
dataset issues
Dataset & Issues
  • 350 postings from the cars & trucks section in Craigslist.
  • Manually annotated with the attributes.
feature selection
Feature Selection
  • Features:
    • Title : isPresentLexicon, hasDollar_hasDigit, hasParanthesis, hasDigit, hasApostrophe_hasDigit, PrevLabel, Word
    • Body :isPresentTrLexicon, isPresentOwLexicon, hasDigit_ hasDash, hasDigit_hasDot, hasDigit_ hasParanthesis, Word_Representation, Neighbor
results
Results

Body Classifier

Title Classifier

ad