Home > Research > Researchers > Professor Paul Rayson > Datasets

Professor Paul Rayson

Professor of Natural Language Processing

  1. Igbo-English Machine Translation: An Evaluation Benchmark

    Ezeani, I. (Creator), Onyenwe, I. E. (Creator), Chinedu, U. (Creator), Rayson, P. (Creator), Hepple, M. (Creator), Github, 1/04/2020

    Dataset

  2. Arabic Infectious Disease Ontology

    Alsudias, L. (Creator), Rayson, P. (Creator), Lancaster University, 25/02/2020, 10.17635/lancaster/researchdata/350

    Dataset

  3. Arabic tweets about infectious diseases.

    Alsudias, L. (Creator), Rayson, P. (Creator), Lancaster University, 21/06/2019, 10.17635/lancaster/researchdata/303

    Dataset

  4. Annual Reports Key Sections Corpora 2003 to 2017

    El-Haj, M. (Creator), Young, S. (Creator), Rayson, P. (Creator), Lancaster University, 13/03/2019, 10.17635/lancaster/researchdata/271

    Dataset

  5. UK Annual Reports Key Sections

    El-Haj, M. (Creator), Young, S. (Creator), Rayson, P. (Creator), Lancaster University, 28/02/2019, 10.17635/lancaster/researchdata/262

    Dataset

  6. N-gram list for the StratScore metric

    Athanasakou, V. (Creator), El-Haj, M. (Creator), Rayson, P. (Creator), Walker, M. (Creator), Young, S. (Creator), Lancaster University, 2018, 10.17635/lancaster/researchdata/232

    Dataset

  7. Urdu Short Text Reuse Corpus (USTRC)

    Sameen, S. (Creator), Muhammad, S. (Creator), Nawab, R. M. A. (Creator), Rayson, P. (Creator), Muneer, I. (Creator), Lancaster University, 2017, 10.17635/lancaster/researchdata/192

    Dataset

  8. Cross-Language English-Urdu Corpus (CLEU)

    Muneer, I. (Creator), Muhammad, S. (Creator), Iqbal, M. (Creator), Nawab, R. M. A. (Creator), Rayson, P. (Creator), Lancaster University, 2017, 10.17635/lancaster/researchdata/176

    Dataset

  9. COrpus of Urdu News TExt Reuse (COUNTER)

    Muhammad, S. (Creator), Nawab, R. M. A. (Creator), Rayson, P. (Creator), Lancaster University, 2016, 10.17635/lancaster/researchdata/96

    Dataset

Back to top