Home > Research > Publications & Outputs > EMILLE: a 67-million word corpus of Indic langu...
View graph of relations

EMILLE: a 67-million word corpus of Indic languages: data collection, mark-up and harmonization.

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSNChapter

Published

Standard

EMILLE: a 67-million word corpus of Indic languages: data collection, mark-up and harmonization. / Baker, Paul; Hardie, Andrew; McEnery, Tony; Cunningham, Hamish; Gaizauskas, Robert.

Proceedings of LREC 2002. Lancaster : Lancaster University, 2002. p. 819-827.

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSNChapter

Harvard

Baker, P, Hardie, A, McEnery, T, Cunningham, H & Gaizauskas, R 2002, EMILLE: a 67-million word corpus of Indic languages: data collection, mark-up and harmonization. in Proceedings of LREC 2002. Lancaster University, Lancaster, pp. 819-827.

APA

Baker, P., Hardie, A., McEnery, T., Cunningham, H., & Gaizauskas, R. (2002). EMILLE: a 67-million word corpus of Indic languages: data collection, mark-up and harmonization. In Proceedings of LREC 2002 (pp. 819-827). Lancaster: Lancaster University.

Vancouver

Baker P, Hardie A, McEnery T, Cunningham H, Gaizauskas R. EMILLE: a 67-million word corpus of Indic languages: data collection, mark-up and harmonization. In Proceedings of LREC 2002. Lancaster: Lancaster University. 2002. p. 819-827

Author

Baker, Paul ; Hardie, Andrew ; McEnery, Tony ; Cunningham, Hamish ; Gaizauskas, Robert. / EMILLE: a 67-million word corpus of Indic languages: data collection, mark-up and harmonization. Proceedings of LREC 2002. Lancaster : Lancaster University, 2002. pp. 819-827

Bibtex

@inbook{0d3fe94f210247899fd8b0fa43243be3,
title = "EMILLE: a 67-million word corpus of Indic languages: data collection, mark-up and harmonization.",
author = "Paul Baker and Andrew Hardie and Tony McEnery and Hamish Cunningham and Robert Gaizauskas",
year = "2002",
language = "English",
pages = "819--827",
booktitle = "Proceedings of LREC 2002",
publisher = "Lancaster University",

}

RIS

TY - CHAP

T1 - EMILLE: a 67-million word corpus of Indic languages: data collection, mark-up and harmonization.

AU - Baker, Paul

AU - Hardie, Andrew

AU - McEnery, Tony

AU - Cunningham, Hamish

AU - Gaizauskas, Robert

PY - 2002

Y1 - 2002

M3 - Chapter

SP - 819

EP - 827

BT - Proceedings of LREC 2002

PB - Lancaster University

CY - Lancaster

ER -