Home > Research > Publications & Outputs > Analysing and visualising free-text comments
View graph of relations

Analysing and visualising free-text comments: a corpus-based toolkit

Research output: Contribution to conference - Without ISBN/ISSN Conference paperpeer-review

Published
Publication date5/07/2023
<mark>Original language</mark>English
EventCorpus Linguistics - Lancaster University, Lancaster, United Kingdom
Duration: 2/07/20236/07/2023
Conference number: 12
https://wp.lancs.ac.uk/cl2023/

Conference

ConferenceCorpus Linguistics
Abbreviated titleCL2023
Country/TerritoryUnited Kingdom
CityLancaster
Period2/07/236/07/23
Internet address

Abstract

Free-text qualitative comments (e.g. from surveys and questionnaires), pose a particular challenge to a range of companies/institutions, who may not have the expertise to analyse these comments with ease. Following the Welsh Language (Wales) Measure (2011), survey respondents in Wales should be given the opportunity to respond to surveys in English or Welsh, posing even more of a challenge when analysing the resultant data, if adequate Welsh language expertise do not exist. Although a range of sophisticated tools for the analysis of text-based data are already available, many of these tools are not necessarily affordable, quick and easy to use, and/or accessible to non-expert user, nor do they fully support the task of systematically processing free-text responses in Welsh and English.

This presentation reports on the developments of a unique open-source online free-text analysis tool that has been designed to respond to this need: FreeTxt. Funded by the AHRC, and co-designed/co-developed in collaboration with project partners National Museum Wales, National Trust Wales and Cadw, FreeTxt is a unique corpus-based analysis toolkit that is designed to enable the quick and easy analysis of English and Welsh language data, and to engage new user groups with corpus-based methods in new ways. In this presentation we will:

- underline the importance of user feedback, and articulate the key challenges of tackling such data,
- present a novel corpus-based approach to the analysis of FreeTxt data, which can be adapted to multiple languages and contexts,
- outline the key functionalities of the tool, which include: KWIC, POS tagging, semantic tagging, summarisation and sentiment analysis utilities, a novel n-gram frequency tool, text visualisation and multilingual support, and
- provide a demonstration of an early version of the FreeTxt tool in action, using data from survey responses and online feedback forums.