Home > Research > Datasets > UNLT: Urdu Natural Language Toolkit

Electronic data

  • UNLT.zip

    25.8 MB, multipart/x-zip

    Dataset

    Available under license: CC BY

    Date added: 11/11/21

View graph of relations

UNLT: Urdu Natural Language Toolkit

Dataset

  • Jawad Shafi (Creator)
  • Rao Muhammad Adeel Nawab (Creator)
  • Paul Rayson (Creator)
  • Rizwan Iqbal (Creator)

Description

The zip file contains the first version of the UNLT (Urdu Natural Language Toolkit) which includes three key text processing tools required for an Urdu NLP pipeline; word tokenizer, sentence tokenizer and Part-Of-Speech (POS) tagger.
Date made available2021
PublisherLancaster University
Date of data production2021

Contact person