Home > Research > Publications & Outputs > MultiLS

Links

Text available via DOI:

View graph of relations

MultiLS: An End-to-End Lexical Simplification Framework

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSNConference contribution/Paperpeer-review

Published
Close
Publication date15/11/2024
Host publicationProceedings of the Third Workshop on Text Simplification, Accessibility and Readability (TSAR 2024)
EditorsMatthew Shardlow, Horacio Saggion, Fernando Alva-Manchego, Marcos Zampieri, Kai North, Sanja Štajner, Regina Stodden
Place of PublicationKerrville
PublisherAssociation for Computational Linguistics (ACL Anthology)
Pages1-11
Number of pages11
ISBN (electronic)9798891761766
<mark>Original language</mark>English
EventThird Workshop on Text Simplification, Accessibility and Readability: TSAR -
Duration: 15/11/2024 → …

Workshop

WorkshopThird Workshop on Text Simplification, Accessibility and Readability
Period15/11/24 → …

Workshop

WorkshopThird Workshop on Text Simplification, Accessibility and Readability
Period15/11/24 → …

Abstract

Lexical Simplification (LS) automatically replaces difficult to read words for easier alternatives while preserving a sentence’s original meaning. Several datasets exist for LS and each of them specialize in one or two sub-tasks within the LS pipeline. However, as of this moment, no single LS dataset has been developed that covers all LS sub-tasks. We present MultiLS, the first LS framework that allows for the creation of a multi-task LS dataset. We also present MultiLS-PT, the first dataset created using the MultiLS framework. We demonstrate the potential of MultiLS-PT by carrying out all LS sub-tasks of (1) lexical complexity prediction (LCP), (2) substitute generation, and (3) substitute ranking for Portuguese.