1 / 16

Svetla Koeva, Svetlozara Lesseva, Ivelina Stoyanova 6 th INTEX Workshop, Sofia, 28-30 May

INTEX as an educational subject in the Master's program in Computational Linguistics at Sofia University. Svetla Koeva, Svetlozara Lesseva, Ivelina Stoyanova 6 th INTEX Workshop, Sofia, 28-30 May. The beginning.

Download Presentation

Svetla Koeva, Svetlozara Lesseva, Ivelina Stoyanova 6 th INTEX Workshop, Sofia, 28-30 May

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. INTEX as an educational subjectin the Master's program in Computational Linguistics at Sofia University Svetla Koeva, Svetlozara Lesseva, Ivelina Stoyanova 6th INTEX Workshop, Sofia, 28-30 May 6th INTEX Workshop, Sofia 28-30 2003

  2. The beginning • INTEX was included in the curriculum of the Master`s programme in Computational Linguistics in the academic year 2001-2002. http://compling.ibl.bas.bg • INTEX is used in teaching the subject Computer Systems for NLP. 6th INTEX Workshop, Sofia 28-30 2003

  3. Main goals • To expand the students competence on formal linguistic representation. • To help students in grasping the theoretical complexity and the scope of the linguistic phenomena. • To develop the students competence on finite state automata and their application in natural language processing. 6th INTEX Workshop, Sofia 28-30 2003

  4. Standard tasks for all students • Enhancing the existing Sentence boundaries delimiting FST • FST-s for the analytic cardinal and ordinal numerals in Bulgarian • FST-s for dates – Latin and Arabic numbers 6th INTEX Workshop, Sofia 28-30 2003

  5. Individual tasks • DELAF and DELACF dictionaries for: historical periods and events, institutions` names, companies` names. • DELAF and DELACF dictionaries for: chemical compounds terms, botanical and zoological terms, toponyms, abbreviations. 6th INTEX Workshop, Sofia 28-30 2003

  6. Individual tasks • DELACF dictionary of phraseologisms. • DELACF dictionary of frozen expressions • Decision-making in presenting the paradigms 6th INTEX Workshop, Sofia 28-30 2003

  7. Some examples • Modifier + Noun head Carbon dioxide въглероден диоксид,въглероден диоксид.N+M:s въглеродния диоксид,въглероден диоксид.N+M:sh въглеродният диоксид,въглероден диоксид.N+M:sl 6th INTEX Workshop, Sofia 28-30 2003

  8. Some examples • metal oxide: метален оксид, метален оксид.N+M:s металния оксид, метален оксид.N+M:sh металният оксид, метален оксид.N+M:sl метални оксиди, метален оксид.N+M:p металните оксиди, метален оксид.N+M:pd 6th INTEX Workshop, Sofia 28-30 2003

  9. Individual tasks • FST-s for the analytic forms of the grammatical paradigms of verbs, nouns and adjectives. • FST-s for recognition of analytic verb forms in the indicative mood, active voice. These are the present perfect, pluperfect, future, future perfect, future in the past, future perfect in the past. • At the present moment the students develop FST-s for the passive voice of the indicative mood of all tenses, for conditional mood forms of all tenses. 6th INTEX Workshop, Sofia 28-30 2003

  10. Individual tasks • A particular case is the negative tensed forms because the negative forms are always analytical and FST-s are devised for tenses which have otherwise synthetic formation. Negation patterns and question patterns have particular word order which has to be considered, too. • Some examples 6th INTEX Workshop, Sofia 28-30 2003

  11. Future Perfect in the Past in Bulgarian 6th INTEX Workshop, Sofia 28-30 2003

  12. Perfect tense in Bulgarian 6th INTEX Workshop, Sofia 28-30 2003

  13. Result of the application of analytic_tenses.grf 6th INTEX Workshop, Sofia 28-30 2003

  14. Tasks • Expansion of the existing linguistic corpus • Notations unification • Devising of bigger and richer libraries of dictionaries and FST-s 6th INTEX Workshop, Sofia 28-30 2003

  15. Masters` theses • A master's thesis was written on representation of Bulgarian compound nouns in INTEX. • This year students are going to use INTEX for analysis of recognition errors for Grammar checker OCR correction. 6th INTEX Workshop, Sofia 28-30 2003

  16. Future directions • Introducing a wider range of Bulgarian researchers to INTEX. • Applying INTEX in a wider range of activities. • Enhancing the system with more resources. • Cooperation in expanding the functionalities of INTEX. 6th INTEX Workshop, Sofia 28-30 2003

More Related