Home > Research > Publications & Outputs > Corpus linguistics for indexing

Electronic data

  • Indexing_paper_final(pdf)

    Accepted author manuscript, 517 KB, PDF document

    Available under license: CC BY-NC: Creative Commons Attribution-NonCommercial 4.0 International License


Text available via DOI:

View graph of relations

Corpus linguistics for indexing

Research output: Contribution to Journal/MagazineJournal articlepeer-review

<mark>Journal publication date</mark>1/06/2019
<mark>Journal</mark>The Indexer: The International Journal of Indexing
Issue number2
Number of pages20
Pages (from-to)105-124
Publication StatusPublished
<mark>Original language</mark>English


This methodological paper demonstrates how methods from corpus linguistics – a collection of computer-assisted approaches to the analysis of large volumes of text – can be used in the creation of indexes. We begin this article by introducing corpus linguistics, including its main principles and advantages, before demonstrating how corpus methods can be used by indexers, providing a case study in which we create an index for an academic journal article using the established corpus techniques of frequency, keywords, collocation and concordance. This case study shows how when combined with human input and intuition, corpus linguistics methods can provide indexers with new perspectives on the texts they are working on, all the while increasing the systematicity, replicability and objectivity of the indexing process itself.