Accepted author manuscript, 410 KB, PDF document
Available under license: CC BY-NC: Creative Commons Attribution-NonCommercial 4.0 International License
Final published version
Research output: Contribution to Journal/Magazine › Journal article › peer-review
Research output: Contribution to Journal/Magazine › Journal article › peer-review
}
TY - JOUR
T1 - From digital resources to historical scholarship with the British Library 19th Century Newspaper Collection
AU - Gregory, Ian Norman
AU - Atkinson, Paul David
AU - Hardie, Andrew
AU - Joulain-Jay, Amelia
AU - Kershaw, Daniel
AU - Porter, Catherine
AU - Rayson, Paul Edward
AU - Rupp, Christopher John
PY - 2016
Y1 - 2016
N2 - It is increasingly acknowledged that the Digital Humanities have placed too much emphasis on data creation and that the major priority should be turning digital sources into contributions to knowledge. While this sounds relatively simple, doing it involves intermediate stages of research that enhance digital sources, develop new methodologies and explore their potential to generate new knowledge from the source. While these stages are familiar in the social sciences they are less so in the humanities. In this paper we explore these stages based on research on the British Library’s Nineteenth Century Newspaper Collection, a corpus of many billion words that has much to offer to our understanding of the nineteenth century but whose size and complexity makes it difficult to work with.
AB - It is increasingly acknowledged that the Digital Humanities have placed too much emphasis on data creation and that the major priority should be turning digital sources into contributions to knowledge. While this sounds relatively simple, doing it involves intermediate stages of research that enhance digital sources, develop new methodologies and explore their potential to generate new knowledge from the source. While these stages are familiar in the social sciences they are less so in the humanities. In this paper we explore these stages based on research on the British Library’s Nineteenth Century Newspaper Collection, a corpus of many billion words that has much to offer to our understanding of the nineteenth century but whose size and complexity makes it difficult to work with.
KW - Corpora
KW - GIS
KW - Resource enhancement
KW - Research Methods
KW - OCR quality
M3 - Journal article
VL - 9
SP - 994
EP - 1006
JO - Journal of Siberian Federal University: Humanities and social sciences
JF - Journal of Siberian Federal University: Humanities and social sciences
SN - 1997-1370
IS - 4
ER -