Home > Research > Publications & Outputs > The Trinity Lancaster Corpus


Text available via DOI:

View graph of relations

The Trinity Lancaster Corpus: Development, Description and Application

Research output: Contribution to journalJournal articlepeer-review

<mark>Journal publication date</mark>24/09/2019
<mark>Journal</mark>International Journal of Learner Corpus Research
Issue number2
Number of pages33
Pages (from-to)126-158
Publication StatusPublished
<mark>Original language</mark>English


This paper introduces a new corpus resource for language learning research, the Trinity Lancaster Corpus (TLC), which contains 4.2 million words of interaction between L1 and L2 speakers of English. The corpus includes spoken production from over 2,000 L2 speakers from different linguistic and cultural backgrounds at different levels of proficiency engaged in two to four tasks. The paper provides a description of the TLC and places it in the context of current learner corpus development and research. The discussion of practical decisions taken in the construction of the TLC also enables a critical reflection on current methodological issues in corpus construction.