Home > Research > Publications & Outputs > lexiDB

Electronic data

  • lexidb-scalable-corpus

    Rights statement: ©2016 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

    Accepted author manuscript, 147 KB, PDF document

    Available under license: CC BY-NC: Creative Commons Attribution-NonCommercial 4.0 International License

Links

Text available via DOI:

View graph of relations

lexiDB: a scalable corpus database management system

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSNConference contribution/Paperpeer-review

Published

Standard

lexiDB : a scalable corpus database management system. / Coole, Matt; Rayson, Paul Edward; Mariani, John Amedeo.

2016 IEEE International Conference on Big Data (Big Data). IEEE, 2016. p. 3880-3884.

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSNConference contribution/Paperpeer-review

Harvard

Coole, M, Rayson, PE & Mariani, JA 2016, lexiDB: a scalable corpus database management system. in 2016 IEEE International Conference on Big Data (Big Data). IEEE, pp. 3880-3884. https://doi.org/10.1109/BigData.2016.7841062

APA

Coole, M., Rayson, P. E., & Mariani, J. A. (2016). lexiDB: a scalable corpus database management system. In 2016 IEEE International Conference on Big Data (Big Data) (pp. 3880-3884). IEEE. https://doi.org/10.1109/BigData.2016.7841062

Vancouver

Coole M, Rayson PE, Mariani JA. lexiDB: a scalable corpus database management system. In 2016 IEEE International Conference on Big Data (Big Data). IEEE. 2016. p. 3880-3884 doi: 10.1109/BigData.2016.7841062

Author

Coole, Matt ; Rayson, Paul Edward ; Mariani, John Amedeo. / lexiDB : a scalable corpus database management system. 2016 IEEE International Conference on Big Data (Big Data). IEEE, 2016. pp. 3880-3884

Bibtex

@inproceedings{4c8856ab1a6940b581ffd7ccca4a1269,
title = "lexiDB: a scalable corpus database management system",
abstract = "lexiDB is a scalable corpus database management system designed to fulfill corpus linguistics retrieval queries on multi-billion-word multiply-annotated corpora. It is based on a distributed architecture that allows the system to scale out to support ever larger text collections. This paper presents an overview of the architecture behind lexiDB as well as a demonstration of its functionality. We present lexiDB's performance metrics based on the AWS (Amazon Web Services) infrastructure with two part-of-speech and semantically tagged billion word corpora: Historical Hansard and EEBO (Early English Books Online).",
author = "Matt Coole and Rayson, {Paul Edward} and Mariani, {John Amedeo}",
note = "{\textcopyright}2016 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.",
year = "2016",
month = dec,
day = "5",
doi = "10.1109/BigData.2016.7841062",
language = "English",
isbn = "9781467390064",
pages = "3880--3884",
booktitle = "2016 IEEE International Conference on Big Data (Big Data)",
publisher = "IEEE",

}

RIS

TY - GEN

T1 - lexiDB

T2 - a scalable corpus database management system

AU - Coole, Matt

AU - Rayson, Paul Edward

AU - Mariani, John Amedeo

N1 - ©2016 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

PY - 2016/12/5

Y1 - 2016/12/5

N2 - lexiDB is a scalable corpus database management system designed to fulfill corpus linguistics retrieval queries on multi-billion-word multiply-annotated corpora. It is based on a distributed architecture that allows the system to scale out to support ever larger text collections. This paper presents an overview of the architecture behind lexiDB as well as a demonstration of its functionality. We present lexiDB's performance metrics based on the AWS (Amazon Web Services) infrastructure with two part-of-speech and semantically tagged billion word corpora: Historical Hansard and EEBO (Early English Books Online).

AB - lexiDB is a scalable corpus database management system designed to fulfill corpus linguistics retrieval queries on multi-billion-word multiply-annotated corpora. It is based on a distributed architecture that allows the system to scale out to support ever larger text collections. This paper presents an overview of the architecture behind lexiDB as well as a demonstration of its functionality. We present lexiDB's performance metrics based on the AWS (Amazon Web Services) infrastructure with two part-of-speech and semantically tagged billion word corpora: Historical Hansard and EEBO (Early English Books Online).

U2 - 10.1109/BigData.2016.7841062

DO - 10.1109/BigData.2016.7841062

M3 - Conference contribution/Paper

SN - 9781467390064

SP - 3880

EP - 3884

BT - 2016 IEEE International Conference on Big Data (Big Data)

PB - IEEE

ER -