Home > Research > UCREL - University Centre for Computer Corpus Research on Language > Datasets
View graph of relations

UCREL - University Centre for Computer Corpus Research on Language

  1. Annual Reports Key Sections Corpora 2003 to 2017

    El-Haj, M. (Creator), Young, S. (Creator), Rayson, P. (Creator), Lancaster University, 13/03/2019, 10.17635/lancaster/researchdata/271

    Dataset

  2. April Fools Corpus

    Dearden, E. (Creator), Lancaster University, 1/03/2022, 10.17635/lancaster/researchdata/512

    Dataset

  3. Arabic Infectious Disease Ontology

    Alsudias, L. (Creator), Rayson, P. (Creator), Lancaster University, 25/02/2020, 10.17635/lancaster/researchdata/350

    Dataset

  4. Arabic tweets about infectious diseases.

    Alsudias, L. (Creator), Rayson, P. (Creator), Lancaster University, 21/06/2019, 10.17635/lancaster/researchdata/303

    Dataset

  5. Chinese CALLHOME Corpus, XML edition

    McEnery, T. (Creator), Xiao, Z. (Creator), Linguistic Data Consortium, 15/09/2008

    Dataset

  6. COrpus of Urdu News TExt Reuse (COUNTER)

    Muhammad, S. (Creator), Nawab, R. M. A. (Creator), Rayson, P. (Creator), Lancaster University, 2016, 10.17635/lancaster/researchdata/96

    Dataset

  7. COVID-19 Arabic tweets

    Alsudias, L. (Creator), Rayson, P. (Creator), Lancaster University, 7/07/2020, 10.17635/lancaster/researchdata/375

    Dataset

  8. COVID-19 Arabic tweets

    Alsudias, L. (Creator), Rayson, P. (Creator), Lancaster University, 2020, 10.17635/lancaster/researchdata/394

    Dataset

  9. Cross-Language English-Urdu Corpus (CLEU)

    Muneer, I. (Creator), Muhammad, S. (Creator), Iqbal, M. (Creator), Nawab, R. M. A. (Creator), Rayson, P. (Creator), Lancaster University, 2017, 10.17635/lancaster/researchdata/176

    Dataset

  10. Data and scripts for extracting plant names and collocates from historical texts

    Smail, R. (Creator), Donaldson, C. (Creator), Stevens, C. (Creator), Rayson, P. (Creator), Govaerts, R. (Creator), Lancaster University, 2020, 10.17635/lancaster/researchdata/385

    Dataset

  11. Flat Earth Dataset

    Dearden, E. (Creator), Lancaster University, 1/03/2022, 10.17635/lancaster/researchdata/513

    Dataset

  12. Human Judgements of Sentiment Values

    Pak, I. (Creator), Teh, P. L. (Creator), Rayson, P. (Creator), Piao, S. (Creator), Ho, J. S. Y. (Creator), Moore, A. (Creator), Cheah, Y. (Creator), Lancaster University, 2020, 10.17635/lancaster/researchdata/368

    Dataset

  13. Igbo-English Machine Translation: An Evaluation Benchmark

    Ezeani, I. (Creator), Onyenwe, I. E. (Creator), Chinedu, U. (Creator), Rayson, P. (Creator), Hepple, M. (Creator), Github, 1/04/2020

    Dataset

  14. N-gram list for the StratScore metric

    Athanasakou, V. (Creator), El-Haj, M. (Creator), Rayson, P. (Creator), Walker, M. (Creator), Young, S. (Creator), Lancaster University, 2018, 10.17635/lancaster/researchdata/232

    Dataset

  15. Strategic Commentary

    El-Haj, M. (Creator), Young, S. (Creator), Lancaster University, 7/02/2019, 10.17635/lancaster/researchdata/261

    Dataset

  16. UK Annual Reports Key Sections

    El-Haj, M. (Creator), Young, S. (Creator), Rayson, P. (Creator), Lancaster University, 28/02/2019, 10.17635/lancaster/researchdata/262

    Dataset

  17. UNLT: Urdu Natural Language Toolkit

    Shafi, J. (Creator), Nawab, R. M. A. (Creator), Rayson, P. (Creator), Iqbal, R. (Creator), Lancaster University, 2021, 10.17635/lancaster/researchdata/494

    Dataset

  18. Urdu Paraphrase Plagiarism Corpus (UPPC)

    Muhammad, S. (Creator), Rayson, P. (Creator), Nawab, R. M. A. (Creator), Lancaster University, 2016, 10.17635/lancaster/researchdata/67

    Dataset

  19. Urdu Short Text Reuse Corpus (USTRC)

    Sameen, S. (Creator), Muhammad, S. (Creator), Nawab, R. M. A. (Creator), Rayson, P. (Creator), Muneer, I. (Creator), Lancaster University, 2017, 10.17635/lancaster/researchdata/192

    Dataset

Back to top