120 likes | 253 Views
Corpus-informed exercises for learners of English: the TestBuilder program. INTER 2002 ”L anguage Technology” Łódź, 21 June , 2002. Aleksandra Wojnowska & Przemysław Kaszubski School of English Adam Mickiewicz University Poznań, Poland. Why TestBuilder (TB) was created:.
 
                
                E N D
Corpus-informed exercises for learners of English: the TestBuilder program INTER 2002 ”Language Technology” Łódź, 21 June, 2002 Aleksandra Wojnowska & Przemysław Kaszubski School of English Adam Mickiewicz University Poznań, Poland
Why TestBuilder (TB) was created: • 'real language' (corpus-based) • fast computerised search for test items • lexicogrammatical context • frequency information (corpus-driven) • one tool: from e-text to test output • highly user-controlled (corpus, query, output edition, etc.) INTER 2002, Łódź, Poland
Miscellaneous& TB-supportedtest types: • Multiple choice • Cloze (every n-th word deleted) • Gap-filling/Completion(meaningful words deleted) • Hangman • Paraphrasing • Transformation • Matching • Sentence re-ordering • Finding errors • Word-building INTER 2002, Łódź, Poland
TestBuilder queries: overview Single or multiple occurrences and/or strings of: • word(s) • prefixes, suffixes, infixes, other part-words • word-tag pairs • (fixed) phrases • collocations & collocational frameworks • colligations • tag combinations (contiguous / non contiguous) • words/tags at (specific distance) sentence initial / final positions • sentences of specific word length INTER 2002, Łódź, Poland
TestBuilder queries: other features • query building • query batch list • frequency feedback INTER 2002, Łódź, Poland
TestBuilder: Help files INTER 2002, Łódź, Poland
TestBuilder: corpus selection & pre-processing screen INTER 2002, Łódź, Poland
TestBuilder: main screen INTER 2002, Łódź, Poland
TestBuilder: frequency information table for a query INTER 2002, Łódź, Poland
TestBuilder: gap editing INTER 2002, Łódź, Poland
TB: Current limitations & caveats: • one sentence per test item • no case sensitivity • no punctuation search • ineffective negative operator (!) in combinations • modest display (small fonts, no line wrapping) • tagging: 8.3 filenames only • ‘evidence as good as corpus’ • imperfact tokenization & tagging accuracy INTER 2002, Łódź, Poland
TB: About the author Aleksandra Wojnowska III year student, School of English (IFA), Poznań woola@ifa.amu.edu.pl ********** This show shortly available from P. Kaszubski’s corpus linguistics seminar page: http://main.amu.edu.pl/~przemka/diplsem1.html INTER 2002, Łódź, Poland