Home > Research > Publications & Outputs > Short term diachronic shifts in part-of-speech ...
View graph of relations

Short term diachronic shifts in part-of-speech frequencies: a comparison of the tagged LOB and F-LOB corpora.

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Published

Standard

Short term diachronic shifts in part-of-speech frequencies: a comparison of the tagged LOB and F-LOB corpora. / Mair, C.; Hundt, M.; Leech, Geoffrey et al.
In: International Journal of Corpus Linguistics, Vol. 7, No. 2, 2003, p. 245-264.

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Harvard

Mair, C, Hundt, M, Leech, G & Smith, N 2003, 'Short term diachronic shifts in part-of-speech frequencies: a comparison of the tagged LOB and F-LOB corpora.', International Journal of Corpus Linguistics, vol. 7, no. 2, pp. 245-264. https://doi.org/10.1075/ijcl.7.2.05mai

APA

Vancouver

Mair C, Hundt M, Leech G, Smith N. Short term diachronic shifts in part-of-speech frequencies: a comparison of the tagged LOB and F-LOB corpora. International Journal of Corpus Linguistics. 2003;7(2):245-264. doi: 10.1075/ijcl.7.2.05mai

Author

Mair, C. ; Hundt, M. ; Leech, Geoffrey et al. / Short term diachronic shifts in part-of-speech frequencies: a comparison of the tagged LOB and F-LOB corpora. In: International Journal of Corpus Linguistics. 2003 ; Vol. 7, No. 2. pp. 245-264.

Bibtex

@article{eacd589f3783413ba0e0f05acd720599,
title = "Short term diachronic shifts in part-of-speech frequencies: a comparison of the tagged LOB and F-LOB corpora.",
abstract = "The paper presents a comparison of tag frequencies in two matching one-million word reference corpora of British standard English, the 1961 LOB-corpus and its 1991 “clone” produced at Freiburg. Both corpora were tagged using a version of the CLAWS part-of-speech-tagger developed at Lancaster, and part of the material was post-edited manually in Freiburg to assess the accuracy of the automatic procedure. The comparison of tag frequencies is an essential complement to work on recent linguistic change carried out on the untagged material, because this work has been based on the – so far unverified – assumption that tag frequencies have remained constant over the thirty-year period in question. In addition, the paper discusses some common and partly contradictory claims about the prevalence of a “nominal” style in present-day written English. It is shown that while part-of-speech frequencies have not remained constant over the period investigated, the shifts are usually not big enough to invalidate the results obtained in analyses of the untagged material. With regard to style, the material shows a significant rise in the frequency of nouns, which, however, is not paralleled by a corresponding decrease in verbs.",
author = "C. Mair and M. Hundt and Geoffrey Leech and N. Smith",
year = "2003",
doi = "10.1075/ijcl.7.2.05mai",
language = "English",
volume = "7",
pages = "245--264",
journal = "International Journal of Corpus Linguistics",
issn = "1569-9811",
publisher = "John Benjamins Publishing Company",
number = "2",

}

RIS

TY - JOUR

T1 - Short term diachronic shifts in part-of-speech frequencies: a comparison of the tagged LOB and F-LOB corpora.

AU - Mair, C.

AU - Hundt, M.

AU - Leech, Geoffrey

AU - Smith, N.

PY - 2003

Y1 - 2003

N2 - The paper presents a comparison of tag frequencies in two matching one-million word reference corpora of British standard English, the 1961 LOB-corpus and its 1991 “clone” produced at Freiburg. Both corpora were tagged using a version of the CLAWS part-of-speech-tagger developed at Lancaster, and part of the material was post-edited manually in Freiburg to assess the accuracy of the automatic procedure. The comparison of tag frequencies is an essential complement to work on recent linguistic change carried out on the untagged material, because this work has been based on the – so far unverified – assumption that tag frequencies have remained constant over the thirty-year period in question. In addition, the paper discusses some common and partly contradictory claims about the prevalence of a “nominal” style in present-day written English. It is shown that while part-of-speech frequencies have not remained constant over the period investigated, the shifts are usually not big enough to invalidate the results obtained in analyses of the untagged material. With regard to style, the material shows a significant rise in the frequency of nouns, which, however, is not paralleled by a corresponding decrease in verbs.

AB - The paper presents a comparison of tag frequencies in two matching one-million word reference corpora of British standard English, the 1961 LOB-corpus and its 1991 “clone” produced at Freiburg. Both corpora were tagged using a version of the CLAWS part-of-speech-tagger developed at Lancaster, and part of the material was post-edited manually in Freiburg to assess the accuracy of the automatic procedure. The comparison of tag frequencies is an essential complement to work on recent linguistic change carried out on the untagged material, because this work has been based on the – so far unverified – assumption that tag frequencies have remained constant over the thirty-year period in question. In addition, the paper discusses some common and partly contradictory claims about the prevalence of a “nominal” style in present-day written English. It is shown that while part-of-speech frequencies have not remained constant over the period investigated, the shifts are usually not big enough to invalidate the results obtained in analyses of the untagged material. With regard to style, the material shows a significant rise in the frequency of nouns, which, however, is not paralleled by a corresponding decrease in verbs.

U2 - 10.1075/ijcl.7.2.05mai

DO - 10.1075/ijcl.7.2.05mai

M3 - Journal article

VL - 7

SP - 245

EP - 264

JO - International Journal of Corpus Linguistics

JF - International Journal of Corpus Linguistics

SN - 1569-9811

IS - 2

ER -