The Story of the Learner Corpus LINDSEI_CZ
Článek
Zobrazit/otevřít
Trvalý odkaz
http://hdl.handle.net/20.500.11956/97524Identifikátory
ISSN: 2336-6702
Kolekce
- Číslo 2 [10]
Autor
Datum vydání
2017Nakladatel
Univerzita Karlova, Filozofická fakultaZdrojový dokument
Studie z aplikované lingvistiky - Studies in Applied LinguisticsRok vydání periodika: 2017
Ročník periodika: 8
Číslo periodika: 2
Práva a licenční podmínky
http://creativecommons.org/licenses/by-nc-nd/2.0/Klíčová slova (anglicky)
corpus methodology, learner corpora, learner corpus linguistics, LINDSEI, spoken corporaThe article presents the recently completed Czech subcorpus of the multinational learner corpus ofadvanced spoken English LINDSEI and aims to draw attention to some of the methodological concernsthe field of learner corpus linguistics faces. First, it describes the Louvain family of learnercorpora, where this project originated, and provides a detailed description of LINDSEI, its history,design, structure, transcription system and metadata. It then outlines the nature of the Czech subcorpusLINDSEI_CZ, telling the story of its compilation and providing a quantitative description ofthe corpus size, task sizes and learner variables, as well as a description of the transcription process.The core part of this text discusses methodological concerns affecting learner corpus designand construction and deals with such issues as task design, recording instructions, the matter oflearner-participant proficiency, and transcription system employed. It concludes with a considerationof various methodological suggestions and offers the possible view that, despite certain weaknesses,LINDSEI is an invaluable source of highly authentic learner data. The last section providesa thematic categorisation of existing studies on LINDSEI and concludes with descriptions of somefuture projects. The article calls for a thorough reconsideration of learner corpus design and practiceand for the formulation of compilation and research standards which would lead to an increasein the reliability and exploitation potential of learner corpora.