1 / 10

Discussion Class 2

Discussion Class 2. A Vector Space Model for Automated Indexing. Discussion Classes. Format: Questions. Ask a member of the class to answer. Provide opportunity for others to comment. When answering: Stand up. Give your name. Make sure that the TA hears it.

Download Presentation

Discussion Class 2

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Discussion Class 2 A Vector Space Model for Automated Indexing

  2. Discussion Classes Format: Questions. Ask a member of the class to answer. Provide opportunity for others to comment. When answering: Stand up. Give your name. Make sure that the TA hears it. Speak clearly so that all the class can hear. Suggestions: Do not be shy at presenting partial answers. Differing viewpoints are welcome.

  3. Question 1: Reading a Research Paper Who are the authors of this paper? What is their background? Why did they write this paper? (b) When was the paper written? Since then, what has changed about computing? (c) Since the paper was published was has changed about information retrieval?

  4. Question 2. Reading a Research Paper

  5. Question 3: Research Methodology Define precision and recall.

  6. Question 4. Summary of the paper What is the overall hypothesis that is examined in this paper? How does Section 2, Correlation between Indexing Performance and Space Density, relate to the hypothesis? How does Section 3, Correlation between Space Density and Indexing Performance, relate to the hypothesis? (d) How does Section 4, The Discrimination Value Model, relate to the hypothesis?

  7. Question 5: Document Space Explain this diagram

  8. Question 6: Weighting -- Term Frequency The paper examines the effect of term weighting on the space density of index terms. (a) Why is this of interest in information retrieval? (b) What form of term frequency (tf) is used in this paper? (c) How does this form of term frequency differ from the standard form discussed in class? Under what circumstances is this difference significant?

  9. Question 7: Discrimination Value Model Explain the following expression, which the authors use to measure the contribution of term k to the space density. DVk = Qk - Q What does this tell about the discriminant value of term k?

  10. Question 7: Question 8: Discuss this graph

More Related