Building LANA-CASE, a spoken corpus of American English conversation - Research Portal

Home > Research > Publications & Outputs > Building LANA-CASE, a spoken corpus of American...

Linguistics and English Language

Electronic data

Hanks_et_al_Abstract
Accepted author manuscript, 76.9 KB, PDF document
Available under license: CC BY: Creative Commons Attribution 4.0 International License

Text available via DOI:

https://doi.org/10.32714/ricl.12.02.03
Final published version
Available under license: CC BY: Creative Commons Attribution 4.0 International License

View graph of relations

Building LANA-CASE, a spoken corpus of American English conversation: Challenges and innovations in corpus compilation

Research output: Contribution to Journal/Magazine › Journal article › peer-review

Published

Elizabeth Hanks
Anthony McEnery
Jesse Egbert
Tove Larsson
Douglas Biber
Randi Reppen
Paul Baker
Vaclav Brezina
Gavin Brookes
Isobelle Clarke
Raffaella Bottini

More...

<mark>Journal publication date</mark>	31/10/2024
<mark>Journal</mark>	Research in Corpus Linguistics
Issue number	2
Volume	12
Number of pages	21
Pages (from-to)	24-44
Publication Status	Published
Early online date	3/03/24
<mark>Original language</mark>	English

Abstract

The Lancaster-Northern Arizona Corpus of Spoken American English (LANA-CASE) is a collaborative project between Lancaster University and Northern Arizona University to create a publicly available, large-scale corpus of American English conversation. In this article, we describe the design of LANA-CASE in terms of the challenges that have arisen and how these have been addressed – including decisions related to operationalizing the domain, sampling the data, recruiting participants, and selecting instruments for data collection. In addressing these challenges, we were able to draw on and further develop strategies established in the creation of other spoken corpora (including the British English counterpart to LANA-CASE, the Spoken British National Corpus 2014) as well as to implement recent theoretical and technical innovations related to each step. We hope that this discussion can inform future projects focused on the design and construction of spoken corpora.

Research

Electronic data

Links

Text available via DOI:

Building LANA-CASE, a spoken corpus of American English conversation: Challenges and innovations in corpus compilation

Abstract

Quick Links

Connect With Us

Faculties & Depts

Contact Us