slide1 n.
Download
Skip this Video
Loading SlideShow in 5 Seconds..
The user entered the query “What is the historical relation between Greek and Roma”. PowerPoint Presentation
Download Presentation
The user entered the query “What is the historical relation between Greek and Roma”.

Loading in 2 Seconds...

play fullscreen
1 / 1

The user entered the query “What is the historical relation between Greek and Roma”. - PowerPoint PPT Presentation


  • 67 Views
  • Uploaded on

Natural Sciences Engineering Sciences. Client. Server. HTTP Servlet. HTTP Client : Java script. Morphological Analyzer. LDA : Jar. Manager. Learning Content Models for Semantic Search. Manager. GUI. SolR. Eran Peer, Hila Shalom.

loader
I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
capcha
Download Presentation

PowerPoint Slideshow about 'The user entered the query “What is the historical relation between Greek and Roma”.' - dareh


An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript
slide1

Natural Sciences

Engineering Sciences

Client

Server

HTTP Servlet

HTTP Client :

Java script

Morphological Analyzer

LDA : Jar

Manager

Learning Content Models for Semantic Search

Manager

GUI

SolR

Eran Peer, Hila Shalom

Advisors: Prof. Michael Elhadad, Dr. Menachem Adler

to allow the user to explore a repository of textual documents, and discover material of interest even when he is not familiar with the content of the repository.

Existing search engines provide powerful features to identify known documents by using a short description of their content (keywords, name of document). For example, a user wants to find information about a term that he knows, he enters the term's name in a search engine, and the search engine returns the address of the term's value at Wikipedia.

The task we address is what happens when the user does not know the name of the term he is looking for, or when the term the user enters has several meanings. The system that we developed will help the user explore interactively the repository and the terms that are most significant in this repository.

User Graphical Interface

The project goal

Search in the search results

Enter text for searching

Choose an article

The user entered the query “What is the historical relation between Greek and Roma”.

Here are the query’s results.

The user can enter a query for new search or to search in the results.

Search by

article’s data

The user can choose one of the topics to add it to the search.

The user can choose one of the articles, to view it.

The user clicked the topic “Roman copies of Greek sculptures”. Here are the results to the query with the topic.

The user can choose to remove parts of the query.

The user can view the topics as a list or as a wheel

System Architecture

The user can choose to search one of the topics.

The user chose the article “Venus de’ Medici”.

Here is the article.

The users send questions in natural language (English/Hebrew), and receive relevant results

Search by

article’s topics

The user can choose to search by the article’s data.

The system builds automatically hierarchical topics by existing corpus

Corpus

  • Client
  • HTTP Client – The functions in the client side
  • GUI – Web based graphical user interface
  • Server
  • HTTP Servlet – Handles the connection with the client
  • Morphological Analyzer – Analyze the articles into verb, noun, adjective etc.
  • LDA – Algorithm that creates hierarchical topics from the corpus. Tags all the articles with the topics. Can be replaced with a better one.
  • Corpus – Contains the articles and their data (author, date etc.). New and existing articles can be added or removed. Can be at English or Hebrew.
  • SolR – System for searching quickly and efficiently in huge corpus.

The system tags the articles in the corpus with the topics

What is…?

This project was programmed in Java and Javascript using Eclipse.

The system was developed using Mallet, SolR, Morphological analyzer and Tomcat.

Web system - available every where, every time

The users can navigate in the system to

understand

the corpus

Our code at http://code.google.com/p/semantic-search2011