First catch your corpus - Research Portal

Linguistics and English Language

Associated organisational unit

DisTex - Discourse and Text Research Group

Electronic data

Sealey&Pak-Corpora-13-2
Rights statement: This is an Accepted Manuscript of an article published by Edinburgh University Press in Corpora. The Version of Record is available online at: https://www.euppublishing.com/doi/abs/10.3366/cor.2018.0145
Accepted author manuscript, 306 KB, PDF document
Available under license: CC BY-NC: Creative Commons Attribution-NonCommercial 4.0 International License

Text available via DOI:

https://doi.org/10.3366/cor.2018.0145
Final published version

View graph of relations

First catch your corpus: methodological challenges in constructing a thematic corpus

Research output: Contribution to Journal/Magazine › Journal article › peer-review

Published

Standard

First catch your corpus: methodological challenges in constructing a thematic corpus. / Sealey, Alison Jean ; Pak, Christopher Adams.
In: Corpora, Vol. 13, No. 2, 01.08.2018, p. 229-254.

Research output: Contribution to Journal/Magazine › Journal article › peer-review

Bibtex

@article{fd446a8998b647f48d1523c1fd53e77d,

title = "First catch your corpus: methodological challenges in constructing a thematic corpus",

abstract = "This paper describes the process by which we have constructed a corpus of heterogeneous texts about non-human animals. It aims to contribute both methodologically – in respect of the challenges of compiling a thematic corpus – and substantively – in relation to the identification of some features of discourse about animals. Having introduced the research project and its guiding questions, the article describes the principles of data selection and the procedures used in analysis. We highlight the methods we devised both to avoid the potential circularity associated with pre-determined search terms, and to overcome the limitations of a relatively small corpus containing a wide range of relevant vocabulary. We go on to report some initial findings on the most frequent animal naming terms and adjectives describing them, including a small case study of the adjectives {\textquoteleft}live{\textquoteright} and {\textquoteleft}dead{\textquoteright}. The article concludes by indicating the ways in which the iterative methods we have employed are open to further extension, and points to some methodological and substantive implications of this enterprise.",

author = "Sealey, {Alison Jean} and Pak, {Christopher Adams}",

note = "This is an Accepted Manuscript of an article published by Edinburgh University Press in Corpora. The Version of Record is available online at: https://www.euppublishing.com/doi/abs/10.3366/cor.2018.0145",

year = "2018",

month = aug,

day = "1",

doi = "10.3366/cor.2018.0145",

language = "English",

volume = "13",

pages = "229--254",

journal = "Corpora",

issn = "1749-5032",

publisher = "Edinburgh University Press",

number = "2",

}

RIS

TY - JOUR

T1 - First catch your corpus

T2 - methodological challenges in constructing a thematic corpus

AU - Sealey, Alison Jean

AU - Pak, Christopher Adams

N1 - This is an Accepted Manuscript of an article published by Edinburgh University Press in Corpora. The Version of Record is available online at: https://www.euppublishing.com/doi/abs/10.3366/cor.2018.0145

PY - 2018/8/1

Y1 - 2018/8/1

N2 - This paper describes the process by which we have constructed a corpus of heterogeneous texts about non-human animals. It aims to contribute both methodologically – in respect of the challenges of compiling a thematic corpus – and substantively – in relation to the identification of some features of discourse about animals. Having introduced the research project and its guiding questions, the article describes the principles of data selection and the procedures used in analysis. We highlight the methods we devised both to avoid the potential circularity associated with pre-determined search terms, and to overcome the limitations of a relatively small corpus containing a wide range of relevant vocabulary. We go on to report some initial findings on the most frequent animal naming terms and adjectives describing them, including a small case study of the adjectives ‘live’ and ‘dead’. The article concludes by indicating the ways in which the iterative methods we have employed are open to further extension, and points to some methodological and substantive implications of this enterprise.

AB - This paper describes the process by which we have constructed a corpus of heterogeneous texts about non-human animals. It aims to contribute both methodologically – in respect of the challenges of compiling a thematic corpus – and substantively – in relation to the identification of some features of discourse about animals. Having introduced the research project and its guiding questions, the article describes the principles of data selection and the procedures used in analysis. We highlight the methods we devised both to avoid the potential circularity associated with pre-determined search terms, and to overcome the limitations of a relatively small corpus containing a wide range of relevant vocabulary. We go on to report some initial findings on the most frequent animal naming terms and adjectives describing them, including a small case study of the adjectives ‘live’ and ‘dead’. The article concludes by indicating the ways in which the iterative methods we have employed are open to further extension, and points to some methodological and substantive implications of this enterprise.

U2 - 10.3366/cor.2018.0145

DO - 10.3366/cor.2018.0145

M3 - Journal article

VL - 13

SP - 229

EP - 254

JO - Corpora

JF - Corpora

SN - 1749-5032

IS - 2

ER -

Research

Associated organisational unit

Electronic data

Links

Text available via DOI:

First catch your corpus: methodological challenges in constructing a thematic corpus

Standard

Harvard

APA

Vancouver

Author

Bibtex

RIS

Quick Links

Connect With Us

Faculties & Depts

Contact Us