Information-Analytical System “Manuscript”: technologies and tools of creation of electronic col...
Sponsored Links
This presentation is the property of its rightful owner.
1 / 29

Victor BARANOV Linguistics Department Izhevsk State Technical University PowerPoint PPT Presentation


  • 117 Views
  • Uploaded on
  • Presentation posted in: General

Information-Analytical System “Manuscript”: technologies and tools of creation of electronic collections of ancient and medieval documents. Victor BARANOV Linguistics Department Izhevsk State Technical University Laboratory of Computer-Aided Philological Research Udmurtia State University.

Download Presentation

Victor BARANOV Linguistics Department Izhevsk State Technical University

An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -

Presentation Transcript


Information-Analytical System “Manuscript”: technologies and tools of creation of electronic collections of ancient and medieval documents

  • Victor BARANOV

  • Linguistics Department

  • Izhevsk State Technical University

  • Laboratory of Computer-Aided Philological Research

  • Udmurtia State University


Title page of the portalof IAS “Manuscript”

Digital Historical Corpora


Model of hierarchies and subnets of manuscript and text units

Digital Historical Corpora


Net of linguistic relationships

Text

<…> се быша дроузи мои .<…>

Εnd of the “single" relationship

Relationship

Predicate part

се быша дроузи мои .

Εnd of the “multiple" relationship

Средство связи

Mean of relationship

Syntactic group

се

быша дроузи мои

Word-form

се

быша

дроузи

мои

Word-combination

Дроузи мои

быша дроузи

Co-ordination

се быша дроузи

Dependence

с

е

б

ы

ш

а

д

р

оу

з

и

м

о

и

.

Digital Historical Corpora


Model of the Manuscript system

Digital Historical Corpora


Editor OldEd: main panels

Digital Historical Corpora


Editor OldEd: Text input and editing

Digital Historical Corpora


Editor OldEd: Fragmentation of the manuscript texts into units and relationships with the dictionary units

Dictionary of fragments

Properties of fragments

Fragments

Digital Historical Corpora


Editor OldEd: Visualization of unit relationships

Symbol

Geometric hierarchy:

Line

Page

Linguistic hierarchy:

word-form

normalize forms

Dictionary:

Lemma

Properties and values of the Lemma

Dictionary:

word-forms of texts

Digital Historical Corpora


Editor OldEd: Page layout

Digital Historical Corpora


Result of creation of the layout on the site

Marginalia

Marginalia

Marginalia

Digital Historical Corpora


Automated lemmatization and establishing relationships between words and lemmas

Digital Historical Corpora


Electronic edition: search page

Collections & Manuscripts

Search criteria

Search result

Digital Historical Corpora


Search result: word index and concordance

Digital Historical Corpora


Module of retrievals: selection of the text

Digital Historical Corpora


Module of retrievals: selection of the unit

Digital Historical Corpora


Module of retrievals: setting the unit properties and values

Digital Historical Corpora


Module of retrievals: saving the query

Digital Historical Corpora


Module of retrievals: specifying the compositionof the query result

Digital Historical Corpora


Comparative index of the wordforms

Digital Historical Corpora


Comparative index of the fragments

Digital Historical Corpora


Grammar dictionaries

Grammar dictionary of the modern Russian language

Grammar dictionary of the Old Russian language

Grammar dictionary of the Old Slavonic language

Grammar dictionarypseudo-elements

Text N

Text 6

Text 5

Text 4

Text 3

Text 2

Text 1

Digital Historical Corpora


Grammar dictionaries: retrieval form

Digital Historical Corpora


Grammar dictionaries: bringing the Old Russian word-forms to the lemma

Digital Historical Corpora


Grammar dictionaries: оbtaining paradigm of lemma

Digital Historical Corpora


Electronic editions

Digital Historical Corpora


Electronic edition:reverse index of word-forms and context

Digital Historical Corpora


Acknowledgment

The work on the creation of IRS Manuscript is being carried out with the support from the Russian Foundation of Basic Research

(Grant # 05-07-90217в).

Τhe work on the creation of the automated morphologic analyzer with the support of the Russian Foundation for the Humanities

(Grant # 05-04-12408в).

Digital Historical Corpora


Contacts

Laboratory of Computer-Aided

Philological Research

Udmurtia State University

Linguistics Department

Izhevsk State Technical University

Izhevsk, Russia

Victor Baranov - [email protected]

http://manuscripts.ru/index_en.html

Digital Historical Corpora


  • Login