Compiling an oral corpus of child language g s c c
This presentation is the property of its rightful owner.
Sponsored Links
1 / 20

Compiling an oral corpus of child language (G.S.C.C) PowerPoint PPT Presentation


  • 97 Views
  • Uploaded on
  • Presentation posted in: General

Compiling an oral corpus of child language (G.S.C.C). Gavriilidou Zoe (Democritus University of Thrace) Elina Chadjipapa (FLEXSEM, Autonomous University of Barcelona ) Anna Giannakopoulou (FLEXSEM, Autonomous University of Barcelona ). PLAN. Purposes Definition of a corpus Characteristics

Download Presentation

Compiling an oral corpus of child language (G.S.C.C)

An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -

Presentation Transcript


Compiling an oral corpus of child language g s c c

Compiling an oral corpus of childlanguage (G.S.C.C)

Gavriilidou Zoe (Democritus University of Thrace)

Elina Chadjipapa (FLEXSEM, Autonomous University of Barcelona )

Anna Giannakopoulou (FLEXSEM, Autonomous University of Barcelona)


Compiling an oral corpus of child language g s c c

PLAN

  • Purposes

  • Definition of a corpus

  • Characteristics

  • Application fields

  • Construction

  • Description

  • Research based on G.S.C.C.

  • Perspectives


Purpose

Purpose

  • CONSTRUCTION OF A REPRSENTATIVE ORAL CORPUS OF CHILDREN IN PRE-SCHOOL AGE

  • ITS USE IN FURTHER APPLICATIONS/RESEARCHES


What a corpus is

WHAT A CORPUS IS

  • A corpus is a collection of pieces of language text in electronic form, selected according to external criteria to represent, as far as possible, a language or language variety as a source of data for linguistic research(Intuition and annotation - the discussion continues.Sinclair J. 2004 ).

  • The concept of carrying out research on written or spoken texts is not restricted to corpus linguistics. ("Corpus Linguistics“ Tony McEnery Andrew Wilson).


Corpora can either be borodal 2002

Corpora can either be:(Borodal, 2002)

  • Tagged – where all words have been marked in some way e.g. evaluation test

  • Untagged – that is not processed at all (spontaneous)


Characteristics of a corpus

Representative sample of a language

Quantitive and Qualitive analysis of the sample

Direct and quick enrichment of the sample

Electronic availability

CHARACTERISTICS OF ACORPUS


Application fields

Teaching

Literature

Lexicography

Linguistics

Sociolinguistics

Psycholinguistics

Computational Linguistics

APPLICATION FIELDS


Constructing the

CONSTRUCTING THE

GREEK SPEAKING CHILDREN CORPUS (G.S.C.C)


Based on the construction rules

BASED ON THE CONSTRUCTION RULES

  • Size of the Sample

  • Authenticity of the Corpus

  • Range of the Sample


Size of the g s c c

SIZE OF THE G.S.C.C

151.380 WORDS

45 approximately hours of speech

Available in http://utopia.duth.gr/~zgabriil


Description of the g s c c

10 children 3-4 years

35 children 4-5 years

15 children 5-6 years

DESCRIPTION OF THE G.S.C.C

  • Interviews of 60 children 3-6 years old

  • 35 females

  • 25 males


De recording and transcription of the interviews

De-recordingand Transcription of the interviews

  • 35 Greek speaking children (Standard Greek)

  • 15 Cypriot dialect speaking children

  • Phonetic Transcription of the Interviews (IPA)


Contents data of the interviews

CONTENTS (DATA) OF THE INTERVIEWS

  • Children with speech problems

  • Greek speaking resident (bilingual)

  • Words and phrases used from children 3-6 years old (vocabulary)

  • Idioms from many Greek regions


Corpus s collection regions

CORPUS’S COLLECTION REGIONS

  • Greece

  • Orestiada

  • Aleksandroupolis

  • Kavala

  • Thessalonica

  • Edessa

  • Athens

  • Cyprus


Applications

Applications

  • Within the frame of Linguistic research

  • Difficulties encountered from children 3-6 years

  • Level of the communication language.

  • Amount of words and level of frozen phrases

  • Basic Vocabulary


Research based on g s c c

Research based on G.S.C.C.

  • “Phonological and Phonetic analysis of cases of G.S.C.C.”

  • 23 subjects, 15 females and 8 males

  • SPSS was used for the statistical analysis


Compiling an oral corpus of child language g s c c

AIMS

  • Frequency of phonological errors

  • Frequency of distorted phonemes

  • Comparison between sexes

  • Language acquisition among age groups

  • Context (environment) of the distorted phonemes

  • Accented/not accented syllables in the word


Future work

FUTURE WORK

  • Enlarge the G.S.C.C

  • Raise the speaking hours

  • Add more dialects

  • Add non native Greek Speaking children


Perspectives

PERSPECTIVES

Use of the G.S.C.C in:

  • Further linguistic analysis, such as syntax, grammar, vocabulary and morphology.

  • As a tool in Corrective Phonetics for the Greek linguistic system.

  • For further application in other fields of Linguistics (Psycholinguistic, Computational Linguistic, Sociolinguistics, etc.)


Thank you for your attention

THANK YOU FOR YOUR ATTENTION


  • Login