1 / 16

Topic Trends from CiteSeer Data

Topic Trends from CiteSeer Data. Michal Rosen-Zvi Padhraic Smyth Mark Steyvers. Data and Topic Models. Author-topic-word model for 70k authors and 300 topics built from 162,489 Citeseer abstracts Each word in each document assigned to a topic

elsielyons
Download Presentation

Topic Trends from CiteSeer Data

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Topic Trends from CiteSeer Data Michal Rosen-Zvi Padhraic Smyth Mark Steyvers

  2. Data and Topic Models • Author-topic-word model for 70k authors and 300 topics built from 162,489 Citeseer abstracts • Each word in each document assigned to a topic • For the subset of 131,602 documents that we know the year • Group documents by year • Calculate the fraction of words each year assigned to a topic • Plot the resulting time-series, 1990 to 2002 • Caveats • Data set is incomplete (see next slide) • Variability (noise) will be high for 2001 and 2002

  3. Trends within Database Research

  4. NLP and IR

  5. Security research reborn….

  6. Rise of machine learning, data mining

  7. Bayes lives on…

  8. Rise in Web/Mobile topics

  9. (Not so) Hot Topics

  10. Vision and Robotics

  11. Decline in programming languages, OS, ….

  12. Decline in CS Theory

  13. Decrease in use of Greek Letters 

  14. Burst of French writing in mid 90’s?

  15. Why the decrease in these “topics”?

More Related