1 / 37

Natural Language Generation

Natural Language Generation. Ling 571 Fei Xia Week 8: 11/17/05. Outline. NLG basics What is NLG? Example NLG systems Applications of NLG Architecture for NLG Three subtasks of NLG Document planning Microplanning Surface realization. NLG basics. What is NLG?.

sawyer
Download Presentation

Natural Language Generation

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Natural Language Generation Ling 571 Fei Xia Week 8: 11/17/05

  2. Outline • NLG basics • What is NLG? • Example NLG systems • Applications of NLG • Architecture for NLG • Three subtasks of NLG • Document planning • Microplanning • Surface realization

  3. NLG basics

  4. What is NLG? • Definition (McDonald 1992): the process of deliberately constructing a NL text in order to meet specified communicative goals. • Input: non-linguisticrepresentation of info • Output: text, hypertext, speech

  5. NLU vs. NLG • NLU: ambiguity, ill-formed input • NLG: choice Meaning NLU NLG Text Text

  6. NLG system #1: FoG • FoG: Forecast Generator • Input: weather map • Output: textual weather report in English and French • Developer: CoGen Tex • Status: in operational use since 1992

  7. NLG system #2: SumTime-Mousam • FoG: Forecast Generator • Input: weather data • Output: textual weather report in English • Developer: University of Aberdeen • Status: Used by one company to generate weather forecasts for offshore oil rigs. • Demo

  8. NLG System #3: STOP • Input: Questionnaire about smoking attitudes, history, beliefs • Output: a personalized smoking-cessation leaflet • Developer: University of Aberdeen • Demo: non-tailored letter, questionnaire, a tailored letter. • Status: undergoing clinical evaluation

  9. Applications of NLG • Automated document production: • Weather forecasts, letters, …. • Dialogue systems: e.g., ELIZA • Text summarization • Machine translation • Question Answering • ….

  10. ElIZA system • H: My head hurts • E: Why do you say your head hurts? • H: My mother hates me. • E: Who else in your family hates you? • H: *******. • E: Tell me more about it. • H: What should I do? • E: That’s a tough decision. • H: ****? • E: Does that question interest you?

  11. ELIZA system (cont) • The famous 1966 computer system by Joseph Weizenbaum, which acted as a therapist • Named after “Eliza Doolittle” in in Shaw's Pygmalion (“My fair lady”). • Use pattern matching rules • Very successful: prompt him to write a book to explain the limits of computers

  12. Challenges of NLG • Decide what to say and how to say it. • The output has to be well-formed text. • Orthography, morphology, syntax • Reference, word choice, pragmatics • Example: weather report

  13. Component tasks in NLG • Content determination: what content to express • Document structuring: how to structure the info to make a coherent text. • Aggregation: combine units in a document plan tree. • Lexicalization: what words to use? • Referring expression generation: NP or pronoun? • Structure realization: add markups • Linguistic realization: tense, aspect, voice, …

  14. Tasks and modules in NLG

  15. A pipelined architecture Communicative goal Knowledge base Grammar …. Document Planning Document plan Microplanning Text Specification Surface realization Text

  16. Another architecture Knowledge base Communicative Goal Discourse Planner Discourse specification Surface Realizer NL output

  17. Outline • NLG basics • What is NLG? • Example NLG systems • Applications of NLG • Architecture for NLG • Three subtasks of NLG • Document planning • Microplanning • Surface realization

  18. Document planning • Content determination: • What is important? • a domain-dependent expert-system-like task. • Ex: Weather summary: • This Nov was very dry. • The temperature was lower. • Document structuring: • Use RST or other discourse theory

  19. Document plan tree … Nucleus Nucleus (sequence) Nucleus Satellite (Elaboration) Satellite (contrast) Nucleus Msg3 Msg1 Msg2

  20. Microplanning • Aggregation • Lexicalization • Referring expression generation

  21. Aggregation • Combinations can be on the basis of • Information content • Possible forms of realization • Some possibilities: • Conjunction • Ellipsis • Embedding • …

  22. Aggregation via conjunction • Without aggregation: • Light rain fell on the 6th. • Light rain fell on the 8th. • With Aggregation: • Light rain fell on the 6th and light rain fell on the 8th. (conjunction) • Light rain fell on the 6th and the 8th.

  23. Aggregation via embedding • Without aggregation: • November had a rainfall of 20mm. • November normally is the wettest month. • With aggregation: • November, which normally is the wettest month, had a rainfall of 20mm this year. • Although November is the wettest month, this November had a rainfall of only 20mm.

  24. Aggregation strategies • Conform to genre conventions and rules, and take account of pragmatic goals: • Ex: making sentences shorter for poor readers • Observe structural properties: • Ex: aggregate only messages that are siblings in the document plan tree.

  25. An aggregation rule S Nucleus Satellite (contrast) S S Msg1 Msg1 Conj S Msg2 although Msg2

  26. Lexicalization • The process of choosing words to communicate the info in messages • When several lexicalizations are possible, consider: • User knowledge and preference • Consistency with previous usage: sometimes, it is better to vary lexemes • Pragmatics

  27. Examples • Light rain • A small amount of rain • It is encouraging that you have many reasons to stop. • It’s good that you have a lot of reasons to stop.

  28. Referring expression generation • How do we identify specific objects and entities? • Two cases: • Initial introduction of an object • Subsequence references to an already mentioned object.

  29. Initial reference A few options: • Use a full name: John Smith • Relate to an entity that is already salient • One of Dr. Klein’s patients • The person who came to the clinic yesterday

  30. Subsequence reference Some possibilities: • Pronouns: He is very determined. • Definite NPs: This person is very determined. • Proper names: John is very talented.

  31. Choosing referring expressions • Some suggestions from the literature: • Use a pronoun if it refers to an entity mentioned in the previous clause, and there is no other entity in the previous clause that this pronoun could refer to. • Otherwise, use a name (a short one if possible) • Otherwise, use a definite NP.

  32. Choosing referring expressions (cont) • Considering genre conventions and the context • Ex: • Nov 2005 • November • This month

  33. Surface realization • Goal: to convert text specification into actual text. • Structural realization: e.g., add html markup • Linguistics realization: • Insert function words • Choose correct inflection • Order words within a sentence • Add punctuation

  34. Linguistics realization • Systemic Grammar • Functional unification grammar • Ex: • Input: (:action rain :tense past :time November :degree little) • Output: it rained little in November

  35. Summary • NLG basics • What is NLG? • Example NLG systems • Applications of NLG • Architecture for NLG • Three subtasks of NLG • Document planning • Microplanning • Surface realization

  36. Beyond text generation • Flat text • Structured text: itemized lists, section, chapter, …. • Text and graphics: e.g., picture with caption • Speech • ….

  37. Resources • SIGGEN (ACL special interest group for Generation): www.siggen.org • Book: “Building NLG systems” by Reiter et. al., Cambridge University Press, 2000. • List of NLG systems: • Companies: • CoGenTex: www.cogentex.com • ERLI: www.erli.com

More Related