Project Overview: Technology Based Assessment of Language and Literacy. The TBALL Project Data Collection: Making a Young Children's Speech Corpus
Technology Based Assessment of Language and Literacy
The TBALL Project Data Collection: Making a Young Children's Speech Corpus
Abe Kazemzadeh*, Hong You+, Markus Iseli+, Barbara Jones+, Xiaodong Cui+, Margaret Heritage+, Patti Price^, Elaine Anderson*, Shrikanth Narayanan*, and Abeer Alwan+
* University of Southern California, + University of California Los Angeles, and ^ PPRICE Speech and Language Techology
Results and Observations
Data Collection Motivation
Native Language Distribution of Recorded Subjects
Wizard of Oz Interface
Language Background Effects
Our recording setup was similar to our target application:
Higher Level Phenomena
This project is supported in part by the NSF.
In addition, this work would not be possible with out the hard work of transcribers Daylen Riggs and Nathan Go; the patience and bilingualism of Kimberly Reynolds and Blanca Martinez; the careful recordings of Erdem Unal, Vivek Rangarajan, Shiva Sundaram, Yirong Yang, Jinjin Ye, and Yijian Bai; and the planning of Larry Casey and Christy Boscardin.