Corpus Linguistics software - Research Portal

Linguistics and English Language

Electronic data

2020rodriguesgomidephd
Final published version, 7.04 MB, PDF document

Text available via DOI:

https://doi.org/10.17635/lancaster/thesis/1165
Final published version

View graph of relations

Corpus Linguistics software: Understanding their usages and delivering two new tools

Research output: Thesis › Doctoral Thesis

Published

Andressa Rodrigues Gomide

More...

Publication date	2020
Number of pages	241
Qualification	PhD
Awarding Institution	Lancaster University
Supervisors/Advisors	Hardie, Andrew, Supervisor
Award date	30/09/2018
Publisher	Lancaster University
<mark>Original language</mark>	English

Abstract

The increasing availability of computers to ordinary users in the last few decades has led to an exponential increase in the use of Corpus Linguistics (CL) methodologies. The people exploring this data come from a variety of backgrounds and, in many cases, are not proficient corpus linguists. Despite the ongoing development of new tools, there is still an immense gap between what CL can offer and what is currently being done by researchers. This study has two outcomes. It (a) identifies the gap between potential and actual uses of CL methods and tools, and (b) enhances the usability of CL software and complement statistical application through the use of data visualization and user-friendly interfaces. The first outcome is achieved through (i) an investigation of how CL methods are reported in academic publications; (ii) a systematic observation of users of CL software as they engage in the routine tasks; and (iii) a review of four well-established pieces of software used for corpus exploration. Based on the findings, two new statistical tools for CL studies with high usability were developed and implemented on to an existing system, CQPweb. The Advanced Dispersion tool allows users to graphically explore how queries are distributed in a corpus, which makes it easier for users to understand the concept of dispersion. The tool also provides accurate dispersion measures. The Parlink Tool was designed having as its primary target audience beginners with interest in translations studies and second language education. The tool’s primary function is to make it easier for users to see possible translations for corpus queries in the parallel concordances, without the need to use external resources, such as translation memories.

Research

Electronic data

Text available via DOI:

Corpus Linguistics software: Understanding their usages and delivering two new tools

Abstract

Quick Links

Connect With Us

Faculties & Depts

Contact Us