Home > Research > Publications & Outputs > Extending the possibilities of corpus-based res...
View graph of relations

Extending the possibilities of corpus-based research on English in the twentieth century: a prequel to LOB and FLOB.

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Published

Standard

Extending the possibilities of corpus-based research on English in the twentieth century: a prequel to LOB and FLOB. / Leech, Geoffrey; Smith, N.
In: ICAME Journal, Vol. 29, 04.2005, p. 83-98.

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Harvard

APA

Vancouver

Author

Bibtex

@article{eb84312ea7d045c596ccb107823d24d5,
title = "Extending the possibilities of corpus-based research on English in the twentieth century: a prequel to LOB and FLOB.",
abstract = "This paper explains the rationale for a new corpus being assembled at Lancaster University to complement the existing Brown {\textquoteleft}family{\textquoteright} of corpora: that is, English language corpora modelled on the original Brown University corpus, such as LOB, Frown, FLOB, Wellington, etc. The purpose of the new corpus, called Lancaster1931, is to extend the chronological span of these corpora into the first half of the twentieth century, and so to afford researchers a stronger empirical basis for examining recent changes in grammatical usage in English. We discuss some methodological issues encountered in extending the Brown model to earlier historical periods. We also outline some developments under way to permit more rigorous computer-assisted analyses within and across these corpora, namely (i) encoding of all the corpora with XML, (ii) adoption of a common grammatical tagset, known as {\textquoteleft}C8{\textquoteright}, and (iii) implementation of a semantic annotation scheme.",
keywords = "cs_eprint_id, 1086 cs_uid, 1",
author = "Geoffrey Leech and N. Smith",
year = "2005",
month = apr,
language = "English",
volume = "29",
pages = "83--98",
journal = "ICAME Journal",
publisher = "Walter de Gruyter GmbH",

}

RIS

TY - JOUR

T1 - Extending the possibilities of corpus-based research on English in the twentieth century: a prequel to LOB and FLOB.

AU - Leech, Geoffrey

AU - Smith, N.

PY - 2005/4

Y1 - 2005/4

N2 - This paper explains the rationale for a new corpus being assembled at Lancaster University to complement the existing Brown ‘family’ of corpora: that is, English language corpora modelled on the original Brown University corpus, such as LOB, Frown, FLOB, Wellington, etc. The purpose of the new corpus, called Lancaster1931, is to extend the chronological span of these corpora into the first half of the twentieth century, and so to afford researchers a stronger empirical basis for examining recent changes in grammatical usage in English. We discuss some methodological issues encountered in extending the Brown model to earlier historical periods. We also outline some developments under way to permit more rigorous computer-assisted analyses within and across these corpora, namely (i) encoding of all the corpora with XML, (ii) adoption of a common grammatical tagset, known as ‘C8’, and (iii) implementation of a semantic annotation scheme.

AB - This paper explains the rationale for a new corpus being assembled at Lancaster University to complement the existing Brown ‘family’ of corpora: that is, English language corpora modelled on the original Brown University corpus, such as LOB, Frown, FLOB, Wellington, etc. The purpose of the new corpus, called Lancaster1931, is to extend the chronological span of these corpora into the first half of the twentieth century, and so to afford researchers a stronger empirical basis for examining recent changes in grammatical usage in English. We discuss some methodological issues encountered in extending the Brown model to earlier historical periods. We also outline some developments under way to permit more rigorous computer-assisted analyses within and across these corpora, namely (i) encoding of all the corpora with XML, (ii) adoption of a common grammatical tagset, known as ‘C8’, and (iii) implementation of a semantic annotation scheme.

KW - cs_eprint_id

KW - 1086 cs_uid

KW - 1

M3 - Journal article

VL - 29

SP - 83

EP - 98

JO - ICAME Journal

JF - ICAME Journal

ER -