1 / 13

Theme Based Blog Opinion Extraction

Theme Based Blog Opinion Extraction. -- cs498cxz course project. By: Xu Ling Matthew Ryan Wondra 05/02/2006. Motivation. The explosive spread of weblogs has attracted increasing research work in automatically mining large numbers of blog pages for opinions and recommendations.

gray-boone
Download Presentation

Theme Based Blog Opinion Extraction

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Theme Based Blog Opinion Extraction -- cs498cxz course project By: Xu Ling Matthew Ryan Wondra 05/02/2006

  2. Motivation • The explosive spread of weblogs has attracted increasing research work in automatically mining large numbers of blog pages for opinions and recommendations. • Personal opinions on the same event are typically addressed in different aspects. For example, students say a lot of things about UIUC in their blogs… • This project aims to extract blog opinions over different aspects. The mining results will be represented as different opinions in multiple themes modeling different aspects of the same event.

  3. Existing Work We wantBlog Miner … • http://www.opinmind.com/ • http://blogsearch.google.com/

  4. Blog seracher (google blog search) Blog articles Query Ranking Training articles Comparative Text mining Theme models Sentiment models System Architecture User

  5. Implementation • Crawler • Comparative text mining … • Sentence splitter • Ranking strategies (KL-divergence) • Interface

  6. Comparative Text Mining (1)

  7. C1 C2 Cm Background model θB Collection Specific Model θ1 Collection Specific Model θ2 Collection Specific Model θm Comparative Text Mining (2) • Learn theme/sentiment models separately (supervised learning)

  8. positive negative Background model θB Collection Specific Model θ1 Collection Specific Model θ2 Theme 1 in common Theme 2 in common Theme k in common Comparative Text Mining (3) • Learn theme/sentiment models simultaneously (semi-supervised learning)

  9. Experiments & Results (1) • Cars theme models finance mpg maintains otd msrp rebates invoice rebate edmunds mats fees dealers ttl card discount quote doc civic leftover jerk invoive college website package march greatly include options finance leather upshift sales offer graduate offers cheaper quoted splash loan depreciation accessory cash taxes current sl missed moonroof incentives guys apr qualify discounts payments Fluid oil maintenance viscosity belt filter replace plastic timing box release brake yourself lock key required parts inspected oils valve 10w rotation slot screws rotors glove spark chain water 5w20 needed inspection trust effect claim tray plug clock extensor pcv list 93 removing shop runs coil finding stereo hoses smell Highway mph traffic 32 75 averaged cruise tank mile speeds mpg 65 gallons hwy trips filled steady 85 limit overall cc efficient tanks downhill saving short mileage gal fast fillup accelerate diameter gotten comment odometer hemi 37 deer calculated account tripletreds sedan 45 making spd ratio tail east gals 67

  10. Experiments & Results (2) • Cars sentiment models positive negative Awesome love loved freddie prefer loves together amazing neat might decent building asked adore hilarious common willing currently bless opportunity summer accord totem logs roadster dress miata fabulous lights shadow wenders tom shopping yours 350 newer titan loving lang 2004 till kinds motorcycles praise miles stratus fantastic 93 following california Hate boring sucks stupid horrible ugly difficult drivers suck desperate information owners sucked start stupidest himself lets frustrated rental saudi sit nearly current evil walk worry general awful terrible disappointment erect lady despise starting minivan theatre lane square orange driven community ugliest loser sent minutes jk pissed sucking dislike fears

  11. Experiments & Results (3) • movies sentiment models positive negative awesome amazing love laced volume copy smiles tonight loves role enjoying appreciate onto scraping nicey men perfect several saying tonite arnold forgotten mountains toyota barely cars frank trek 1st p sometime superb loved ice sky edition idk save superman moons age deep myspace minds disappoint shrek suppose please rent zetten Stupid depressing terrible evil awful difficult hate sucked lousy head dislike knight annoying useless beast loathe plane fake against country making em seemed society behalf worst despise cannot planet community aka wrong prove indigenous complained annoyed biased lost r armor company father vendetta complain shaved former wall peoples viviana stealing

  12. Demo • Blog Miner xs • Cached results:hondahonda accordtoyota camrylord of the rings

  13. Problems & Future Improvement • Supervised vs. unsupervised • Ranking strategy: how to combine the the theme relevance score and the sentiment score? • Other functionalities?

More Related