Home > Research > Publications & Outputs > Analysing language, sex and age in a corpus of ...

Electronic data

  • Paper_v3

    Accepted author manuscript, 218 KB, Word document

    Available under license: CC BY-NC: Creative Commons Attribution-NonCommercial 4.0 International License

Links

View graph of relations

Analysing language, sex and age in a corpus of patient feedback: A comparison of approaches

Research output: Book/Report/ProceedingsBook

Published
Publication date21/07/2022
Place of PublicationCambridge
PublisherCambridge University Press
Number of pages75
ISBN (electronic)9781009031042
ISBN (print)9781009013772
<mark>Original language</mark>English

Publication series

NameElements in Corpus Linguistics
PublisherCambridge University Press

Abstract

This Element explores approaches to locating and examining social identity in corpora with and without the aid of demographic metadata. This is a key concern in corpus-aided studies of language and identity, and this Element sets out to explore the main challenges and affordances associated with either approach and to discern what either approach can (and cannot) show. It describes two case studies which each compare two approaches to social identity variables – sex and age – in a corpus of 14-million words of patient comments about NHS cancer services in England. The first approach utilises demographic tags to group comments according to patients' sex/age while the second involves categorising cases where patients disclose their sex/age in their comments. This Element compares the findings from either approach, with the approaches themselves being critically discussed in terms of their implications for corpus-aided studies of language and identity.