Final published version
Research output: Contribution to Journal/Magazine › Special issue › peer-review
Research output: Contribution to Journal/Magazine › Special issue › peer-review
}
TY - JOUR
T1 - Text and speech corpora for natural language processing and corpus linguistics
A2 - Demner-Fushman, Dina
A2 - Gatherer, Derek
A2 - Wu, Jian
PY - 2025/7/24
Y1 - 2025/7/24
N2 - Corpus Linguistics (CL) and Natural Language Processing (NLP) are two of the transformative forces in research across the sciences and humanities, reshaping how insights are gleaned from vast text and speech datasets. Their applications span the natural, medical, social and applied sciences, leading the cutting edge in fields such as healthcare diagnostics, biomedicine, environmental science, and computer vision. This Collection presents a series of annotated text and speech corpora alongside linguistic models tailored for CL and NLP applications. These resources aim to enrich the arsenals of CL and NLP users and facilitate interdisciplinary research.
AB - Corpus Linguistics (CL) and Natural Language Processing (NLP) are two of the transformative forces in research across the sciences and humanities, reshaping how insights are gleaned from vast text and speech datasets. Their applications span the natural, medical, social and applied sciences, leading the cutting edge in fields such as healthcare diagnostics, biomedicine, environmental science, and computer vision. This Collection presents a series of annotated text and speech corpora alongside linguistic models tailored for CL and NLP applications. These resources aim to enrich the arsenals of CL and NLP users and facilitate interdisciplinary research.
KW - Natural Language Processing
KW - Corpus Linguistics
KW - corpora
KW - Artificial Intelligence
KW - Machine Learning
KW - Bioinformatics
M3 - Special issue
VL - Special Collection
JO - Scientific Data
JF - Scientific Data
SN - 2052-4463
ER -