Home > Research > Publications & Outputs > Annotating speaker stance in discourse

Electronic data

Links

Text available via DOI:

View graph of relations

Annotating speaker stance in discourse: the Brexit Blog Corpus

Research output: Contribution to journalJournal articlepeer-review

Published

Standard

Annotating speaker stance in discourse : the Brexit Blog Corpus. / Simaki, Vasiliki; Paradis, Carita; Skeppstedt, Maria; Sahlgren, Magnus; Kucher, Kostiantyn; Kerren, Andreas.

In: Corpus Linguistics and Linguistic Theory, 01.10.2020.

Research output: Contribution to journalJournal articlepeer-review

Harvard

Simaki, V, Paradis, C, Skeppstedt, M, Sahlgren, M, Kucher, K & Kerren, A 2020, 'Annotating speaker stance in discourse: the Brexit Blog Corpus', Corpus Linguistics and Linguistic Theory. https://doi.org/10.1515/cllt-2016-0060

APA

Simaki, V., Paradis, C., Skeppstedt, M., Sahlgren, M., Kucher, K., & Kerren, A. (2020). Annotating speaker stance in discourse: the Brexit Blog Corpus. Corpus Linguistics and Linguistic Theory. https://doi.org/10.1515/cllt-2016-0060

Vancouver

Simaki V, Paradis C, Skeppstedt M, Sahlgren M, Kucher K, Kerren A. Annotating speaker stance in discourse: the Brexit Blog Corpus. Corpus Linguistics and Linguistic Theory. 2020 Oct 1. https://doi.org/10.1515/cllt-2016-0060

Author

Simaki, Vasiliki ; Paradis, Carita ; Skeppstedt, Maria ; Sahlgren, Magnus ; Kucher, Kostiantyn ; Kerren, Andreas. / Annotating speaker stance in discourse : the Brexit Blog Corpus. In: Corpus Linguistics and Linguistic Theory. 2020.

Bibtex

@article{dd05f327028a45f2a6fe8c27e765d797,
title = "Annotating speaker stance in discourse: the Brexit Blog Corpus",
abstract = "The aim of this study is to explore the possibility of identifying speaker stance in discourse, provide an analytical resource for it and an evaluation of the level of agreement across speakers. We also explore to what extent language users agree about what kind of stances are expressed in natural language use or whether their interpretations diverge. In order to perform this task, a comprehensive cognitive-functional framework of ten stance categories was developed based on previous work on speaker stance in the literature. A corpus of opinionated texts was compiled, the Brexit Blog Corpus (BBC). An analytical protocol and interface (Active Learning and Visual Analytics) for the annotations was set up and the data were independently annotated by two annotators. The annotation procedure, the annotation agreements and the co-occurrence of more than one stance in the utterances are described and discussed. The careful, analytical annotation process has returned satisfactory inter- and intra-annotation agreement scores, resulting in a gold standard corpus, the final version of the BBC.",
keywords = "text annotation, blog post texts, modality, evaluation, positioning",
author = "Vasiliki Simaki and Carita Paradis and Maria Skeppstedt and Magnus Sahlgren and Kostiantyn Kucher and Andreas Kerren",
year = "2020",
month = oct,
day = "1",
doi = "10.1515/cllt-2016-0060",
language = "English",
journal = "Corpus Linguistics and Linguistic Theory",
issn = "1613-7027",
publisher = "Walter de Gruyter GmbH & Co. KG",

}

RIS

TY - JOUR

T1 - Annotating speaker stance in discourse

T2 - the Brexit Blog Corpus

AU - Simaki, Vasiliki

AU - Paradis, Carita

AU - Skeppstedt, Maria

AU - Sahlgren, Magnus

AU - Kucher, Kostiantyn

AU - Kerren, Andreas

PY - 2020/10/1

Y1 - 2020/10/1

N2 - The aim of this study is to explore the possibility of identifying speaker stance in discourse, provide an analytical resource for it and an evaluation of the level of agreement across speakers. We also explore to what extent language users agree about what kind of stances are expressed in natural language use or whether their interpretations diverge. In order to perform this task, a comprehensive cognitive-functional framework of ten stance categories was developed based on previous work on speaker stance in the literature. A corpus of opinionated texts was compiled, the Brexit Blog Corpus (BBC). An analytical protocol and interface (Active Learning and Visual Analytics) for the annotations was set up and the data were independently annotated by two annotators. The annotation procedure, the annotation agreements and the co-occurrence of more than one stance in the utterances are described and discussed. The careful, analytical annotation process has returned satisfactory inter- and intra-annotation agreement scores, resulting in a gold standard corpus, the final version of the BBC.

AB - The aim of this study is to explore the possibility of identifying speaker stance in discourse, provide an analytical resource for it and an evaluation of the level of agreement across speakers. We also explore to what extent language users agree about what kind of stances are expressed in natural language use or whether their interpretations diverge. In order to perform this task, a comprehensive cognitive-functional framework of ten stance categories was developed based on previous work on speaker stance in the literature. A corpus of opinionated texts was compiled, the Brexit Blog Corpus (BBC). An analytical protocol and interface (Active Learning and Visual Analytics) for the annotations was set up and the data were independently annotated by two annotators. The annotation procedure, the annotation agreements and the co-occurrence of more than one stance in the utterances are described and discussed. The careful, analytical annotation process has returned satisfactory inter- and intra-annotation agreement scores, resulting in a gold standard corpus, the final version of the BBC.

KW - text annotation

KW - blog post texts

KW - modality

KW - evaluation

KW - positioning

U2 - 10.1515/cllt-2016-0060

DO - 10.1515/cllt-2016-0060

M3 - Journal article

JO - Corpus Linguistics and Linguistic Theory

JF - Corpus Linguistics and Linguistic Theory

SN - 1613-7027

ER -