Final published version
Licence: CC BY: Creative Commons Attribution 4.0 International License
Research output: Contribution to Journal/Magazine › Journal article › peer-review
Research output: Contribution to Journal/Magazine › Journal article › peer-review
}
TY - JOUR
T1 - A Dataset for Toponym Resolution in Nineteenth-Century English Newspapers
AU - Ardanuy, Mariona Coll
AU - Beavan, David
AU - Beelen, Kaspar
AU - Hosseini, Kasra
AU - Lawrence, Jon
AU - McDonough, Katherine
AU - Nanni, Federico
AU - van Strien, Daniel
AU - Wilson, Daniel C. S.
PY - 2022/1/24
Y1 - 2022/1/24
N2 - We present a new dataset for the task of toponym resolution in digitized historical newspapers in English. It consists of 343 annotated articles from newspapers based in four different locations in England (Manchester, Ashton-under-Lyne, Poole and Dorchester), published between 1780 and 1870. The articles have been manually annotated with mentions of places, which are linked—whenever possible—to their corresponding entry on Wikipedia. The dataset consists of 3,364 annotated toponyms, of which 2,784 have been provided with a link to Wikipedia. The dataset is published in the British Library shared research repository, and is especially of interest to researchers working on improving semantic access to historical newspaper content.
AB - We present a new dataset for the task of toponym resolution in digitized historical newspapers in English. It consists of 343 annotated articles from newspapers based in four different locations in England (Manchester, Ashton-under-Lyne, Poole and Dorchester), published between 1780 and 1870. The articles have been manually annotated with mentions of places, which are linked—whenever possible—to their corresponding entry on Wikipedia. The dataset consists of 3,364 annotated toponyms, of which 2,784 have been provided with a link to Wikipedia. The dataset is published in the British Library shared research repository, and is especially of interest to researchers working on improving semantic access to historical newspaper content.
U2 - 10.5334/johd.56
DO - 10.5334/johd.56
M3 - Journal article
VL - 8
JO - Journal of Open Humanities Data
JF - Journal of Open Humanities Data
M1 - 3
ER -