1 / 2

The Diachronic Electronic Corpus of Tyneside English

The Diachronic Electronic Corpus of Tyneside English. What we have: The Newcastle Electronic Corpus of Tyneside English (NECTE: http://research.ncl.ac.uk/necte/ ) NECTE is an AHRC-funded corpus of dialect speech from Tyneside.

maxim
Download Presentation

The Diachronic Electronic Corpus of Tyneside English

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. The Diachronic Electronic Corpus of Tyneside English What we have: The Newcastle Electronic Corpus of Tyneside English (NECTE: http://research.ncl.ac.uk/necte/) NECTE is an AHRC-funded corpus of dialect speech from Tyneside. It is based on two existing corpora, one from the 1960s and the other from 1994. It amalgamates the legacy corpora into a single TEI-conformant XML-encoded corpus and makes them available in a variety of formats: digitized audio, standard orthographic transcription, phonetic transcription, and part-of-speech tagged, all time-aligned. 2. What we’re developing: The Diachronic Electronic Corpus of Tyneside English (DECTE: http://research.ncl.ac.uk/decte/) DECTE, also AHRC-funded, updates and extends NECTE. It will incorporate about 100 additional interviews from 2007-current. It also incorporates thematic mark-up and associated graphical material. The aim is to make DECTE usable by the general public, the cultural industries, and by all levels of the education sector from primary to higher.

  2. The Diachronic Electronic Corpus of Tyneside English 3. What we would like • Not to have wasted our time, that is, to have NECTE / DECTE used and to remain usable for the foreseeable future. • For the language corpus community to converge on a set of formatting, archiving, and access standards, that is, to reverse the babble of Babel. • To develop a wider range of analytical tools usable by corpora which adhere to these standards.

More Related