slide1 n.
Download
Skip this Video
Loading SlideShow in 5 Seconds..
Introduction Vandalism -deliberate activity that compromises Wikipedia integrity. PowerPoint Presentation
Download Presentation
Introduction Vandalism -deliberate activity that compromises Wikipedia integrity.

Loading in 2 Seconds...

play fullscreen
1 / 1

Introduction Vandalism -deliberate activity that compromises Wikipedia integrity. - PowerPoint PPT Presentation


  • 108 Views
  • Uploaded on

Elusive Vandalism Detection in Wikipedia Deepika Sethi, Raga Sowmya T Computer Science Department University of Georgia dsethi@uga.edu, sowmya@uga.edu. Introduction Vandalism -deliberate activity that compromises Wikipedia integrity.

loader
I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
capcha
Download Presentation

PowerPoint Slideshow about 'Introduction Vandalism -deliberate activity that compromises Wikipedia integrity.' - wiley


An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript
slide1

Elusive Vandalism Detection in WikipediaDeepika Sethi, Raga Sowmya TComputer Science DepartmentUniversity of Georgiadsethi@uga.edu, sowmya@uga.edu

  • Introduction
  • Vandalism-deliberate activity that compromises Wikipedia integrity.
  • Problem-In general around 8% of Wikipedia edits are vandalized.
  • Elusive vandalism-Doesn’tcontain normal characteristics of vandalism and hence hard to detect.
  • Ex. abusive words, changing dates.
  • Use Google to check for co-occurrence probability of Wikipedia word and its page title.
  • Probability too low might imply out of context and vandalism.
  • Preliminary Results
  • Observing order of magnitude between vandalized and non-vandalized words.
  • Able to distinguish vandalism edits that were undetected by humans.

Objective

As a large number of users rely on Wikipedia for useful information, we tried to detect vandalism on Wikipedia pages

Extract Words

  • Context Aware Approach
  • Detecting vandalism based on the context in which it is used.
  • Identifies words that are out of context with the existing words in an article.

Acknowledgements

We would like to express our appreciation to Professor Dr. LakshmishRamaswamy and Dr.Kang Li