Information extraction from automobile advertisements
Download
1 / 8

Information Extraction From Automobile Advertisements - PowerPoint PPT Presentation


  • 210 Views
  • Updated On :

Information Extraction From Automobile Advertisements. Nipun Bhatia Rakshit Kumar Shashank Senapaty. Problem Definition. Craigslist - Rudimentary keyword search. Not a natural way to search for cars. Difficult to efficiently find ads with particular attributes.

Related searches for Information Extraction From Automobile Advertisements

loader
I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
capcha
Download Presentation

PowerPoint Slideshow about 'Information Extraction From Automobile Advertisements' - daniel_millan


An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript
Information extraction from automobile advertisements l.jpg

Information Extraction From Automobile Advertisements

Nipun Bhatia

Rakshit Kumar

Shashank Senapaty


Problem definition l.jpg
Problem Definition

  • Craigslist - Rudimentary keyword search.

  • Not a natural way to search for cars.

  • Difficult to efficiently find ads with particular attributes.

  • Want structured search over attributes.

    • Attributes : Make, Model, Price, Year, Mileage, Transmission, PostedBy, Location, Contact


Dataset issues l.jpg
Dataset & Issues

  • 350 postings from the cars & trucks section in Craigslist.

  • Manually annotated with the attributes.



Feature selection l.jpg
Feature Selection

  • Features:

    • Title : isPresentLexicon, hasDollar_hasDigit, hasParanthesis, hasDigit, hasApostrophe_hasDigit, PrevLabel, Word

    • Body :isPresentTrLexicon, isPresentOwLexicon, hasDigit_ hasDash, hasDigit_hasDot, hasDigit_ hasParanthesis, Word_Representation, Neighbor


Results l.jpg
Results

Body Classifier

Title Classifier




ad