1 / 19

GSAT (General Sentiment Analysis Tool)

GSAT (General Sentiment Analysis Tool). Design Review By Asaf Bruner. Problem Description. Big Data & Sentiment Analysis. Let’s start with a short video: http:// www.youtube.com/watch?v=ij5yC-moPCM Textual information is either facts or opinions.

dalecarl
Download Presentation

GSAT (General Sentiment Analysis Tool)

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. GSAT(General Sentiment Analysis Tool) Design Review By Asaf Bruner

  2. Problem Description

  3. Big Data & Sentiment Analysis • Let’s start with a short video: http://www.youtube.com/watch?v=ij5yC-moPCM • Textual information is either facts or opinions. • Very little research has been made on the processing of opinions until only recently. Yet, opinions are so important that whenever we need to make a decision we want to hear others’ opinions.

  4.    The specific problem I will be dealing with • Currently there is no unified solution that can answer the problem which was discussed above. • I will design and build a system which does the following: • Automatically collects the talkbacks from websites • Analyzes the data using NLP tools • Draws conclusions from the gathered information • Displays it in an easy to understand way • Answer some very interesting and important questions.

  5. Where else can we use GSAT? • Individuals making purchasing decisions. • Organizations can use this tool to replace opinion polls, surveys, and focus groups. • Trend analysis.

  6. General scheme of the proposed solution

  7. The Data • I am using an open source, java based, web crawler – crawler4j by Google to collect my data. • Using regular expression and DOM analysis I extract the main text & talkbacks from the article while cleaning advertisements and unrelated text. • The list of sites I am crawling is defined in advance.

  8. The algorithm – Design review

  9. The algorithm – Design review

  10. The Tools and infrastructure I am using • The program is written in java (eclipse IDE). • Crawling using crawler4j. • NLP & sentiment analysis using AlchemyAPI. • Database using MySQL. • GUI using Google visualize.

  11. Expected deliverables

  12.  What is actually going to be delivered and how it can be used • I am going to present a specific use case – analyzing ynetnews.com and haaretz.com for political entities and sentimental information relating to them. • Other then that this will be a fully functional program. Meaning only slight changes will have to be made to generalize this use case.

  13. Potential intellectual property that could come out of the project • Integration between several tools • Algorithm

  14. Competing solutions

  15. Well… • Currently no free open source tool is available that does what GSAT offers!

  16. Other ways the problem can be solved • Currently there are 30 US based companies that offer paid sentiment analysis. None of them offers freely the combination of data mining and text analysis.

  17. Characterization of the users

  18. Initial group of users and the most general group • Everyone who wants to know what is being wrote and thought about entities in which they have interest. • Everyone who has interest in analyzing trends.

  19. How do you think one could make money out of your product • Advertisement market (campaign evaluating). • Product comparison (retail companies). • Trend analysis. • And many more…

More Related