1 / 12

Riding the tide of sentiment change: sentiment analysis with evolving online reviews

Riding the tide of sentiment change: sentiment analysis with evolving online reviews. Yang Liu, Xiaohui Yu, Aijun An, Xiangji Huang. General Idea. Perform sentiment analysis on IMDB movie review Using the sentiment to predict box office revenues. Sentiment Analysis: PLSA.

gzifa
Download Presentation

Riding the tide of sentiment change: sentiment analysis with evolving online reviews

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Riding the tide of sentiment change: sentiment analysis with evolving online reviews Yang Liu, Xiaohui Yu, Aijun An, Xiangji Huang

  2. General Idea • Perform sentiment analysis on IMDB movie review • Using the sentiment to predict box office revenues

  3. Sentiment Analysis: PLSA • PLSA can be used as an unsupervised classifier to determine the mixture of sentiment in a review • Incapacity of adapting itself as new data become available • Brutal way is to train the whole model again • This paper proposed two ways to deal with new data

  4. Data structure • Only appraisals in reviews are recognized and frequency of appraisals are counted in each reviews • Appraisal lexicon • Polarity means whether there is negation like “not”, “no”, “non’, etc

  5. EM algorithm for PLSA

  6. Adaptive to new data • Light-weight incremental model • For data do not change dramatically throughout the time

  7. Adaptive to new data • Quasi-Bayesian model • X is the available data till time n • Assumed probability distribution on each time interval determined by some parameters • Dirichlet distribution is used φ = {α, β} are the hyperparameters of the Dirichlet distribution

  8. Application to sales prediction • Linear regression model

  9. Evaluation • 28,353 reviews for 20 drama films released in the US from 1 May 2006 to 1 September 2006 • Box office revenue of the corresponding drama films from 1 May 2006 to 1 September 2006 • 4 time intervals for the training of PLSA • Mean absolute percentage error is used to measure the prediction accuracy

  10. Evaluation • Time interval set to be days = 2,4,6,8

  11. Evaluation • Time interval set to be day = 5, 10, 15, 20

More Related