1 / 31

Pushpak Bhattacharyya CSE Dept., IIT Bombay 12 th April, 2011

CS460/626 : Natural Language Processing/Speech, NLP and the Web (Lecture 37– Semantics; Universal Networking Language). Pushpak Bhattacharyya CSE Dept., IIT Bombay 12 th April, 2011. Semantics: wikipedia.

eshe
Download Presentation

Pushpak Bhattacharyya CSE Dept., IIT Bombay 12 th April, 2011

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. CS460/626 : Natural Language Processing/Speech, NLP and the Web(Lecture 37– Semantics; Universal Networking Language) Pushpak BhattacharyyaCSE Dept., IIT Bombay 12th April, 2011

  2. Semantics: wikipedia • Semantics (from Greeksēmantiká, neuter plural of sēmantikós)is the study of meaning. • It typically focuses on the relation between signifiers, such as words, phrases, signs and symbols, and what they stand for, their denotata.

  3. Computational Semantics: wikipedia • Computational semantics is the study of how to automate the process of constructing and reasoning with meaning representations of natural language expressions. • Some traditional topics of interest are: construction of meaning representations, semantic underspecification, anaphora resolution, presupposition projection, and quantifier scope resolution. • Methods employed usually draw from formal semantics or statistical semantics. • Computational semantics has points of contact with the areas of lexical semantics (word sense disambiguation and semantic role labeling), discourse semantics, knowledge representation and automated reasoning (in particular, automated theorem proving). • Since 1999 there has been an ACL special interest group on computational semantics, SIGSEM.

  4. A hurdle: signifier-denotata dichotomy • Divide between a word and what it stands for • “red” is NOT red in colour • “red wine”, “red rose”, “he is in the red” denote very different sense of the word • Translation into another language reveals this difference

  5. A Perpective Discourse Pragmatics Semantics Syntax Lexicon Morphology

  6. Our tryst with semantics: Universal Networking Language (UNL)

  7. Motivation • Extraction of semantics, i.e., deep meaning is important for many applications. • Machine Translation, Meaning-based IR, CLIR • Robust, scalable & efficient methods of knowledge extraction required • Machine Translation and Cross Lingual IR: a need of the hour for crossing language barrier

  8. Interlingua: a vehicle for machine translation Hindi English Interlingua (UNL) Analysis Chinese French generation

  9. UNL: a United Nations project • Started in 1996 • 10 year program • 15 research groups across continents • First goal: generators • Next goal: analysers (needs solving various ambiguity problems) • Current active language groups • UNL_French (GETA-CLIPS, IMAG) • UNL_English+Hindi • UNL_Italian (Univ. of Pisa) • UNL_Portugese (Univ of Sao Paolo, Brazil) • UNL_Russian (Institute of Linguistics, Moscow) • UNL_Spanish (UPM, Madrid)

  10. World-wide Universal Networking Language (UNL) Project Marathi • Language independent meaning representation. English Russian UNL Spanish Japanese Hindi Others

  11. The UNL MT System: an Overview

  12. NLP@IITB

  13. Foundations and Applications • UNL Foundations • Semantic Relations • Universal Words • Attributes • How to write UNL expressions • UNL Applications • Machine Translation: Rule based and Statistical • Search • Text Entailment • Sentiment Analysis

  14. IR: Cross Lingual Search Crawling Indexing Multilingual Relevance Feedback Information Extraction: Part of Speech tagging Named Entity Recognition Shallow Parsing Summarization Language Processing & Understanding Machine Translation: Statistical Interlingua Based EnglishIndian languages Indian languagesIndian languages Indowordnet Machine Learning: Semantic Role labeling Sentiment Analysis Text Entailment (web 2.0 applications) Using graphical models, support vector machines, neural networks Resources: http://www.cfilt.iitb.ac.in Publications: http://www.cse.iitb.ac.in/~pb Linguistics is the eye and computation the body

  15. UNL represents knowledge: John eats rice with a spoon Universal words Semantic relations attributes Repository of 42 Semantic Relations and 84 attribute labels

  16. Sentence embeddings Deepa claimed that she had composed a poem. [UNL] agt(claim.@entry.@past, Deepa) obj(claim.@entry.@past, :01) agt:01(compose.@past.@entry.@complete, she) obj:01(compose.@past.@entry.@complete, poem.@indef) [\UNL]

  17. Constituents of Universal Networking Language • Universal Words (UWs) • Relations • Attributes • Knowledge Base

  18. forward(icl>send) @ entry @ past agt gol obj he(icl>person) minister(icl>person) @def mail(icl>collection) @def UNL Graph He forwarded the mail to the minister.

  19. UNL Expression agt (forward(icl>send).@ entry @ past, he(icl>person)) obj (forward(icl>send).@ entry @ past, minister(icl>person)) gol (forward(icl>send ).@ entry @ past, mail(icl>collection). @def)

  20. What is a Universal Word (UW)? • Words of UNL • Constitute the UNL vocabulary, the syntactic-semantic units to form UNL expressions • A UW represents a concept • Basic UW (an English word/compound word/phrase with no restrictions or Constraint List) • Restricted UW (with a Constraint List ) • Examples: “crane(icl>device)” “crane(icl>bird)”

  21. TheLexicon Format of the dictionary entry e.g., [minister] {}“minister(icl>person)”(N,ANIMT,PHSCL,PRSN); • Head word • Universal word • Attributes • Morphological - Pl(plural), V_ed(past tense form) • Syntactic - V(verb),VOA(verb of action) • Semantic - ANIMT(animate), PLACE, TIME [headword] {}“Universal word“(Attribute list);

  22. TheLexicon (cntd) He forwarded the mail to the minister. Content words: [forward] {} “forward(icl>send)” (V,VOA) <E,0,0>; [mail] {}“mail(icl>message)”(N,PHSCL,INANI) <E,0,0>; [minister] {}“minister(icl>person)” (N,ANIMT,PHSCL,PRSN) <E,0,0>; Headword Universal Word Attributes

  23. TheLexicon (cntd) He forwarded the mail tothe minister. function words: [he] {} “he” (PRON,SUB,SING,3RD) <E,0,0>; [the] {} “the” (ART,THE) <E,0,0>; [to] {} “to” (PRE,#TO) <E,0,0>; Headword Universal Word Attributes

  24. Hindi example: संज्ञा का उदाहरण १/२ सार्वभौम शब्द मुख्य शब्द गुण farmer farmer(icl>creator) E N,ANIMT,FAUNA,MML,PRSN शेतकरी M N,M,ANIMT,FAUNA,MML,PRSN किसान H N,M,ANIMT,FAUNA,MML,PRSN,Na

  25. The Features of a UW • Every concept existing in any language must correspond to a UW • The constraint list should beas small as necessary to disambiguate the headword • Every UW should be defined in the UNL Knowledge-Base

  26. Restricted UWs • Examples • He will hold office until the spring of next year. • The spring was broken. • Restricted UWs, which are Headwords with a constraint list, for example: “spring(icl>season)” “spring(icl>device)” “spring(icl>jump)” “spring(icl>fountain)”

  27. How to create UWs? • Pick up a concept • the concept of “crane" as "a device for lifting heavy loads” or as “a long-legged bird that wade in water in search of food” • Choose an English word for the concept. • In the case for “crane", since it is a word of English, the corresponding word should be ‘crane' • Choose a constraint list for the word. • [ ] ‘crane(icl>device)' • [ ] ‘crane(icl>bird)'

  28. How to create UNL expressions

  29. English sentences: basic structure verb • A <verb> B • John eats bread • agt(eat.@entry, John) • obj(eat.@entry, bread) • A <verb> • John sleeps • aoj(sleep.@entry, John) • A <be> B • John is good • aoj(good.@entry, John) R2 R1 A B R2 verb R1 A B aoj A

  30. Hindi sentences: basic structure verb • A B <verb> • John rotikhaataahai • agt(eat.@entry, John) • obj(eat.@entry, bread) • A <verb> • John sotaahai • aoj(sleep.@entry, John) • A <be> B • John acchaahai • aoj(good.@entry, John) R2 R1 A B R2 verb R1 A B aoj A

  31. Complex English sentences: Use recursion on the basic structure eat A <verb> B • John who is a good boy eats bread which is toasted • agt(eat.@entry, :01) • obj(eat.@entry, :02) • aoj:01(boy, John.@entry) • mod:01(boy, good) • obj:01(toast, bread.@entry.@focus) agt obj :01 :02 :02 :01 toast boy aoj mod obj John good Bread Red arrows indicate entry nodes

More Related