1 / 20

Compiling an oral corpus of child language (G.S.C.C)

Compiling an oral corpus of child language (G.S.C.C). Gavriilidou Zoe (Democritus University of Thrace) Elina Chadjipapa (FLEXSEM, Autonomous University of Barcelona ) Anna Giannakopoulou (FLEXSEM, Autonomous University of Barcelona ). PLAN. Purposes Definition of a corpus Characteristics

zudora
Download Presentation

Compiling an oral corpus of child language (G.S.C.C)

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Compiling an oral corpus of childlanguage (G.S.C.C) Gavriilidou Zoe (Democritus University of Thrace) Elina Chadjipapa (FLEXSEM, Autonomous University of Barcelona ) Anna Giannakopoulou (FLEXSEM, Autonomous University of Barcelona)

  2. PLAN • Purposes • Definition of a corpus • Characteristics • Application fields • Construction • Description • Research based on G.S.C.C. • Perspectives

  3. Purpose • CONSTRUCTION OF A REPRSENTATIVE ORAL CORPUS OF CHILDREN IN PRE-SCHOOL AGE • ITS USE IN FURTHER APPLICATIONS/RESEARCHES

  4. WHAT A CORPUS IS • A corpus is a collection of pieces of language text in electronic form, selected according to external criteria to represent, as far as possible, a language or language variety as a source of data for linguistic research(Intuition and annotation - the discussion continues.Sinclair J. 2004 ). • The concept of carrying out research on written or spoken texts is not restricted to corpus linguistics. ("Corpus Linguistics“ Tony McEnery Andrew Wilson).

  5. Corpora can either be:(Borodal, 2002) • Tagged – where all words have been marked in some way e.g. evaluation test • Untagged – that is not processed at all (spontaneous)

  6. Representative sample of a language Quantitive and Qualitive analysis of the sample Direct and quick enrichment of the sample Electronic availability CHARACTERISTICS OF ACORPUS

  7. Teaching Literature Lexicography Linguistics Sociolinguistics Psycholinguistics Computational Linguistics APPLICATION FIELDS

  8. CONSTRUCTING THE GREEK SPEAKING CHILDREN CORPUS (G.S.C.C)

  9. BASED ON THE CONSTRUCTION RULES • Size of the Sample • Authenticity of the Corpus • Range of the Sample

  10. SIZE OF THE G.S.C.C 151.380 WORDS 45 approximately hours of speech Available in http://utopia.duth.gr/~zgabriil

  11. 10 children 3-4 years 35 children 4-5 years 15 children 5-6 years DESCRIPTION OF THE G.S.C.C • Interviews of 60 children 3-6 years old • 35 females • 25 males

  12. De-recordingand Transcription of the interviews • 35 Greek speaking children (Standard Greek) • 15 Cypriot dialect speaking children • Phonetic Transcription of the Interviews (IPA)

  13. CONTENTS (DATA) OF THE INTERVIEWS • Children with speech problems • Greek speaking resident (bilingual) • Words and phrases used from children 3-6 years old (vocabulary) • Idioms from many Greek regions

  14. CORPUS’S COLLECTION REGIONS • Greece • Orestiada • Aleksandroupolis • Kavala • Thessalonica • Edessa • Athens • Cyprus

  15. Applications • Within the frame of Linguistic research • Difficulties encountered from children 3-6 years • Level of the communication language. • Amount of words and level of frozen phrases • Basic Vocabulary

  16. Research based on G.S.C.C. • “Phonological and Phonetic analysis of cases of G.S.C.C.” • 23 subjects, 15 females and 8 males • SPSS was used for the statistical analysis

  17. AIMS • Frequency of phonological errors • Frequency of distorted phonemes • Comparison between sexes • Language acquisition among age groups • Context (environment) of the distorted phonemes • Accented/not accented syllables in the word

  18. FUTURE WORK • Enlarge the G.S.C.C • Raise the speaking hours • Add more dialects • Add non native Greek Speaking children

  19. PERSPECTIVES Use of the G.S.C.C in: • Further linguistic analysis, such as syntax, grammar, vocabulary and morphology. • As a tool in Corrective Phonetics for the Greek linguistic system. • For further application in other fields of Linguistics (Psycholinguistic, Computational Linguistic, Sociolinguistics, etc.)

  20. THANK YOU FOR YOUR ATTENTION

More Related