1 / 50

Style and Influence in Social Text

Style and Influence in Social Text. 11-27-29. Announcement. Project reports next week same drill as midterm reports reverse order as midterm reports W e know you’re not done yet … but you will be by midnight Mon 12/10, right? start with one slide summarizing midterm. FCE’s. Are now open

quito
Download Presentation

Style and Influence in Social Text

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Style and Influence in Social Text 11-27-29

  2. Announcement • Project reports next week • same drill as midterm reports • reverse order as midterm reports • We know you’re not done yet • … but you will be by midnight Mon 12/10, right? • start with one slide summarizing midterm

  3. FCE’s • Are now open • We do read them…and people do care • Especially this year • free-text comments on assignments/structure/layout of course very welcome

  4. Puzzle time • Thssntnchs n vwls • ieee a o osoa

  5. Today’s topics • Summary: there are signals in common words • What can you infer from how people use the mostfrequent words in text?

  6. Today’s topics • Summary: there are signals in common words • What can you infer from how peopleuse the mostfrequent words in text?

  7. Today’s topics • Summary: there are signals in common words • What can you infer from how people use the most frequent words in text?

  8. Today’s topics • Summary: there are signals in common words • What can you infer from how people use the most frequent words in text? • Patterns of usage  ”literary style” • predicts: authorship, gender, … • Style changes according to situation • and is transmitted from person to person • Outline: • some background and two recent papers

  9. Background: Authorship attribution • Mosteller and Wallace, 1964. “Inference and Disputed Authorship”: frequency of function words can be used to classify documents by author. • Function words are not under conscious control • Function word use is independent of content • Histogram of function words is ok

  10. Authorship attribution SchlomoArgamon, SchlomoLevitan SVM on histogram of 200 most frequent words

  11. COLING 2006

  12. LIWC • 1986: writing about emotional upheavals improved physical health (!) • Can you refine this statement? • what sort of writings yield the best results? • but: people don’t agree on ratings • and: “judges tend to get depressed when reading depressing stories.” • so: design an automatic “instrument” to rate writings (Linguistic Inquiry and Word Count) based on most frequent words

  13. LIWC words - cover about 55% of the tokens (not types) in most text Categories are mostly designed by hand, by committee

  14. Another signal of rank: starting a fashion

  15. most frequent 200 words

  16. People adopt each other’s mannerisms and style in many ways….

  17. Corpus • Pennebaker & Niederhoffer, 2002: • 98 pairs in the lab + Watergate tapes • Twitter A: • 1.3M “conversations” between 300k users--many are too short to analyze successfully • Twitter B: More crawling • all pairs with 2+ conversations • all posts from these pairs • 15M tweets, 7800 users, 215k conversations, 2200 pairs

  18. Measuring “cohesion” for a property C

  19. Measuring “cohesion” Tweet T contains word from class C T and R are a “turn” Reply R contains word from class C

  20. Measuring “accommodation” and “influence” Tb, from b, is a reply to Ta, from a

  21. Tb uses word class C in a reply to a after a uses C Tb uses word class C in a reply to a

  22. Evidence of fashion in linguistic style spreading through a conversation • Time lag suggests influence not associative sorting • We don’t have anything like direction…..

  23. If Acc(a,b)>0: • Symmetric: Acc(b,a) > 0 • Default asymmetric: Acc(b,a) = 0 • Divergent asymmetric: • Acc(b,a) < 0

  24. Does one party accommodate more than the other? Accommodation does not correlate with “status” features like #followers, #days on Twitter, ….

  25. Does one party accommodate more than the other? • ????

  26. Datasets • Wikipedia: wikipedia editors talk pages: 240k conversations; plus 32k discussions over who gets promoted to admins. • Status: admin vs non-admin • Dependence: learning to support/reject • Supreme court: 50k verbal exchanges for 204 cases. • Status: chief justice vs justice vs lawyer • Dependence: leaning to support/learning to reject

  27. Experiments • Similar notion of “coordination” (=accomodation) • Hypotheses: e.g., you accommodate more when speaking to a big shot and he coordinates less with other people

  28. more coordination with admins than non-admins admins coordinate more with others than non-admins

  29. Why? Maybe the folks that become admins are different somehow? eg more accommodating? admins coordinate more with others than non-admins

  30. the people that eventually become admins coordinate more than people who eventually fail to become admins

  31. revised hypothesis: after you become an admin you will coordinate with others less than you did before

  32. What about the court dataset?

  33. What about the court dataset?

  34. Status prediction • Given conversation between x,ypredict if status(x)>status(y) or vice-versa • Very easy to do in Supreme Court domain (“your honor,….”) • Hard for humans in Wikipedia (inter-annotator aggrement ~= 80%, accuracy ~=70%)

  35. One more observation…

  36. So to summarize… • Summary: there are signals in common words • Even though we don’t think about how we use them • Patterns of usage  ”literary style” • predicts: authorship, gender, … • Style changes according to situation • and is transmitted from person to person • you can observe that transmission (accommodation, coordination) and determine its direction • the direction of accommodation it tells you something about the status of the speakers

More Related