Home > Research > Publications & Outputs > Annotating speaker stance in discourse

Electronic data


Text available via DOI:

View graph of relations

Annotating speaker stance in discourse: the Brexit Blog Corpus

Research output: Contribution to Journal/MagazineJournal articlepeer-review

  • Vasiliki Simaki
  • Carita Paradis
  • Maria Skeppstedt
  • Magnus Sahlgren
  • Kostiantyn Kucher
  • Andreas Kerren
<mark>Journal publication date</mark>1/10/2020
<mark>Journal</mark>Corpus Linguistics and Linguistic Theory
Publication StatusPublished
Early online date28/10/17
<mark>Original language</mark>English


The aim of this study is to explore the possibility of identifying speaker stance in discourse, provide an analytical resource for it and an evaluation of the level of agreement across speakers. We also explore to what extent language users agree about what kind of stances are expressed in natural language use or whether their interpretations diverge. In order to perform this task, a comprehensive cognitive-functional framework of ten stance categories was developed based on previous work on speaker stance in the literature. A corpus of opinionated texts was compiled, the Brexit Blog Corpus (BBC). An analytical protocol and interface (Active Learning and Visual Analytics) for the annotations was set up and the data were independently annotated by two annotators. The annotation procedure, the annotation agreements and the co-occurrence of more than one stance in the utterances are described and discussed. The careful, analytical annotation process has returned satisfactory inter- and intra-annotation agreement scores, resulting in a gold standard corpus, the final version of the BBC.