Home > Research > Datasets > UNLT: Urdu Natural Language Toolkit

Electronic data

  • UNLT.zip

    25.8 MB, multipart/x-zip


    Available under license: CC BY

    Date added: 11/11/21

View graph of relations

UNLT: Urdu Natural Language Toolkit


  • Jawad Shafi (Creator)
  • Rao Muhammad Adeel Nawab (Creator)
  • Paul Rayson (Creator)
  • Rizwan Iqbal (Creator)


The zip file contains the first version of the UNLT (Urdu Natural Language Toolkit) which includes three key text processing tools required for an Urdu NLP pipeline; word tokenizer, sentence tokenizer and Part-Of-Speech (POS) tagger.
Date made available2021
PublisherLancaster University
Date of data production2021

Contact person