1 / 22

Summarizing Contrastive Viewpoints in Opinionated Text

Summarizing Contrastive Viewpoints in Opinionated Text. Michael J. Paul, ChengXiang Zhai, Roxana Girju EMNLP ’ 10 Speaker: Hsin-Lan, Wang Date: 2010/12/07. Outline. Introduction Modeling Viewpoints Topic-Aspect Model Features Multi-Viewpoint Summarization Comparative LexRank

flint
Download Presentation

Summarizing Contrastive Viewpoints in Opinionated Text

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Summarizing Contrastive Viewpoints in Opinionated Text Michael J. Paul, ChengXiang Zhai, Roxana Girju EMNLP’10 Speaker: Hsin-Lan, Wang Date: 2010/12/07

  2. Outline • Introduction • Modeling Viewpoints • Topic-Aspect Model • Features • Multi-Viewpoint Summarization • Comparative LexRank • Summary Generation • Experiment and Evaluation • Conclusion

  3. Introduction • The amount of opinionated text available online has been growing rapidly. • In this paper, we study how to summarize opinionated text in a such a way that highlights contrast between multiple viewpionts.

  4. Introduction • Generate two types of multi-view summaries: • macro multi-view summary • Contains multiple sets of sentences, each representing a different viewpoint. • micro multi-view summary • Contains a set of pairs of contrastive sentences.

  5. Modeling Viewpoints • Challenge: to model and extract viewpoints which are hidden in text. • Solve: Topic-Aspect Model (TAM)

  6. Modeling Viewpoints • TAM

  7. Modeling Viewpoints • Features • Words • baseline approach • do not do any stop word removal • stemming • Dependency Relations • use Stanford parser • full-tuple: rel(a,b) • split-tuple: rel(a,*), rel(*,b)

  8. Modeling Viewpoints • Features • Negation • Rel(wi, wj), if either wi or wj is negated, then we simply rewrite it as . • Polarity • use Subjectivity Clues lexicon • amod(idea, good)→ amod(idea,+) and amod(*,good) • →rel(a,-).

  9. Modeling Viewpoints • Features • Generalized Relations • use Stanford dependencies • Rewrite rel(a,b) as Rrel(a,b).

  10. Multi-Viewpoint Summarization • Comparative LexRank • Make it favor jumping to a good representative excerpt x of any viewpoint v. • Make it favor jumping between two excerpts that can potentially form a good contrastive pair.

  11. Multi-Viewpoint Summarization • Comparative LexRank

  12. Multi-Viewpoint Summarization • Summary Generation • Macro contrastive summarization • Using the random walk stationary distribution across all of the data to rank the excerpts. • Separate the top ranked excerpts into two disjoint sets. • Remove redundancy and produce the summary. • Micro contrastive summarization • Consist of a pair (xi,xj) with the pairwise relevance score. • Rank these pairs and remove redundancy.

  13. Experiments and Evaluation • Experimental Setup • First dataset: 948 verbatim responses to a Gallup phone survey about the 2010 U.S. healthcare bill. • Second dataset: use the Bitterlemons corpus, a collection of 594 editorials about the Israel-Palestine conflict.

  14. Experiments and Evaluation • Stage One: Modeling Viewpoints

  15. Experiments and Evaluation • Stage Two: Summarizing Viewpoints • Gold Standard Summaries • Gallup healthcare poll

  16. Experiments and Evaluation • Stage Two: Summarizing Viewpoints • Baseline Approaches • Graph-based algorithms • When λ=1, the random walk model only transitions to sentences within the same viewpoint. • The modified algorithm produces the same ranking as the unmodified LexRank. • Model-based algorithms • Compare against the approach of Lerman and McDonald.

  17. Experiments and Evaluation • Stage Two: Summarizing Viewpoints • Metrics • using the standard ROUGE evaluation metric • For evaluating the macro-level summaries: • For evaluating the micro-level summaries:

  18. Experiments and Evaluation • Stage Two: Summarizing Viewpoints • Evaluation Results

  19. Experiments and Evaluation • Unsupervised Summarization • Bitterlemons corpus (without a gold set) • Asked 8 people to guess if each viewpoint’s summary was written by Israeli or Palestinian authors.

  20. Experiments and Evaluation • Unsupervised Summarization • Macro-level summaries: • Correctly labeled 78% of the summary sets. • Micro-level summaries: • Many of the sentences are mislabeled, and the ones that are correctly labeled are not representative of the collection.

  21. Conclusion • Present steps toward a two-stage system that can automatically extract and summarize viewpoints in opinionated text. • First: the accuracy of clustering documents by viewpoint can be enhanced by using rich dependency features.

  22. Conclusion • Second: use Comparative LexRank to generate contrastive summaries both at the macro and micro level.

More Related