Corpus Linguistics

Lancaster Environment Centre

Text available via DOI:

https://doi.org/10.1093/oxfordhb/9780199276349.013.0024
Final published version

Keywords

Annotation, Application, Comparative, Contrast, Evidence, Multilingual

View graph of relations

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSN › Chapter

Published

Standard

Corpus Linguistics. / McEnery, Tony.
The Oxford Handbook of Computational Linguistics. ed. / Ruslan Mitkov. Vol. 9780199276349 Oxford University Press Inc, 2012.

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSN › Chapter

Harvard

McEnery, T 2012, Corpus Linguistics. in R Mitkov (ed.), The Oxford Handbook of Computational Linguistics. vol. 9780199276349, Oxford University Press Inc. https://doi.org/10.1093/oxfordhb/9780199276349.013.0024

APA

McEnery, T. (2012). Corpus Linguistics. In R. Mitkov (Ed.), The Oxford Handbook of Computational Linguistics (Vol. 9780199276349). Oxford University Press Inc. https://doi.org/10.1093/oxfordhb/9780199276349.013.0024

Vancouver

McEnery T. Corpus Linguistics. In Mitkov R, editor, The Oxford Handbook of Computational Linguistics. Vol. 9780199276349. Oxford University Press Inc. 2012 doi: 10.1093/oxfordhb/9780199276349.013.0024

Author

McEnery, Tony. / Corpus Linguistics. The Oxford Handbook of Computational Linguistics. editor / Ruslan Mitkov. Vol. 9780199276349 Oxford University Press Inc, 2012.

Bibtex

@inbook{20d43c326dc94101a6d37515e17dfe6c,

title = "Corpus Linguistics",

abstract = "Corpus data have emerged as the raw data/benchmark for several NLP applications. Corpus is described as a large body of linguistic evidence composed of attested language use. It may be contrasted against sentences constructed from metalinguist reflection upon language use, rather than as a result of communication in context. Corpus can be both spoken and written. It can be categorized as follows: monolingual, representing one language; comparable, using multiple monolingual corpora to create a comparative framework; parallel corpora, wherein, corpus of one language is considered, and the data obtained, is translated in other languages. The choice of corpus depends on the research question/the chosen application. Adding linguistic information can enhance a corpus. Analysts, human or mechanical, or a combination achieves annotation. The modern computerized corpus has been in vogue only since the 1940s. Ever since, the volume of corpus banks have risen steadily and assumed an increasingly multilingual nature.",

keywords = "Annotation, Application, Comparative, Contrast, Evidence, Multilingual",

author = "Tony McEnery",

year = "2012",

month = sep,

day = "18",

doi = "10.1093/oxfordhb/9780199276349.013.0024",

language = "English",

isbn = "9780199276349",

volume = "9780199276349",

editor = "Ruslan Mitkov",

booktitle = "The Oxford Handbook of Computational Linguistics",

publisher = "Oxford University Press Inc",

}

RIS

TY - CHAP

T1 - Corpus Linguistics

AU - McEnery, Tony

PY - 2012/9/18

Y1 - 2012/9/18

N2 - Corpus data have emerged as the raw data/benchmark for several NLP applications. Corpus is described as a large body of linguistic evidence composed of attested language use. It may be contrasted against sentences constructed from metalinguist reflection upon language use, rather than as a result of communication in context. Corpus can be both spoken and written. It can be categorized as follows: monolingual, representing one language; comparable, using multiple monolingual corpora to create a comparative framework; parallel corpora, wherein, corpus of one language is considered, and the data obtained, is translated in other languages. The choice of corpus depends on the research question/the chosen application. Adding linguistic information can enhance a corpus. Analysts, human or mechanical, or a combination achieves annotation. The modern computerized corpus has been in vogue only since the 1940s. Ever since, the volume of corpus banks have risen steadily and assumed an increasingly multilingual nature.

AB - Corpus data have emerged as the raw data/benchmark for several NLP applications. Corpus is described as a large body of linguistic evidence composed of attested language use. It may be contrasted against sentences constructed from metalinguist reflection upon language use, rather than as a result of communication in context. Corpus can be both spoken and written. It can be categorized as follows: monolingual, representing one language; comparable, using multiple monolingual corpora to create a comparative framework; parallel corpora, wherein, corpus of one language is considered, and the data obtained, is translated in other languages. The choice of corpus depends on the research question/the chosen application. Adding linguistic information can enhance a corpus. Analysts, human or mechanical, or a combination achieves annotation. The modern computerized corpus has been in vogue only since the 1940s. Ever since, the volume of corpus banks have risen steadily and assumed an increasingly multilingual nature.

KW - Annotation

KW - Application

KW - Comparative

KW - Contrast

KW - Evidence

KW - Multilingual

U2 - 10.1093/oxfordhb/9780199276349.013.0024

DO - 10.1093/oxfordhb/9780199276349.013.0024

M3 - Chapter

AN - SCOPUS:84924034821

SN - 9780199276349

VL - 9780199276349

BT - The Oxford Handbook of Computational Linguistics

A2 - Mitkov, Ruslan

PB - Oxford University Press Inc

ER -

Research

Links

Text available via DOI:

Keywords