Home > Research > Publications & Outputs > Rhetorical move detection in English abstracts
View graph of relations

Rhetorical move detection in English abstracts: multi-label sentence classifiers and their annotated corpora

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSNConference contribution/Paperpeer-review

Published

Standard

Rhetorical move detection in English abstracts: multi-label sentence classifiers and their annotated corpora. / Dayrell, Carmen; Candido Jr, Arnaldo ; Lima, Gabriel et al.
Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012). 2012.

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSNConference contribution/Paperpeer-review

Harvard

Dayrell, C, Candido Jr, A, Lima, G, Machado Jr, D, Copestake, A, Feltrim, V, Tagnin, S & Aluísio, S 2012, Rhetorical move detection in English abstracts: multi-label sentence classifiers and their annotated corpora. in Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012). <http://www.lrec-conf.org/proceedings/lrec2012/pdf/734_Paper.pdf>

APA

Dayrell, C., Candido Jr, A., Lima, G., Machado Jr, D., Copestake, A., Feltrim, V., Tagnin, S., & Aluísio, S. (2012). Rhetorical move detection in English abstracts: multi-label sentence classifiers and their annotated corpora. In Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012) http://www.lrec-conf.org/proceedings/lrec2012/pdf/734_Paper.pdf

Vancouver

Dayrell C, Candido Jr A, Lima G, Machado Jr D, Copestake A, Feltrim V et al. Rhetorical move detection in English abstracts: multi-label sentence classifiers and their annotated corpora. In Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012). 2012

Author

Dayrell, Carmen ; Candido Jr, Arnaldo ; Lima, Gabriel et al. / Rhetorical move detection in English abstracts : multi-label sentence classifiers and their annotated corpora. Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012). 2012.

Bibtex

@inproceedings{605c34a5d8424f6ab0b263431902d5c7,
title = "Rhetorical move detection in English abstracts: multi-label sentence classifiers and their annotated corpora",
abstract = "The relevance of automatically identifying rhetorical moves in scientific texts has been widely acknowledged in the literature. Thisstudy focuses on abstracts of standard research papers written in English and aims to tackle a fundamental limitation of currentmachine-learning classifiers: they are mono-labeled, that is, a sentence can only be assigned one single label. However, such approachdoes not adequately reflect actual language use since a move can be realized by a clause, a sentence, or even several sentences. Here,we present MAZEA (Multi-label Argumentative Zoning for English Abstracts), a multi-label classifier which automatically identifiesrhetorical moves in abstracts but allows for a given sentence to be assigned as many labels as appropriate. We have resorted to variousother NLP tools and used two large training corpora: (i) one corpus consists of 645 abstracts from physical sciences and engineering(PE) and (ii) the other corpus is made up of 690 from life and health sciences (LH). This paper presents our preliminary results and alsodiscusses the various challenges involved in multi-label tagging and works towards satisfactory solutions. In addition, we also makeour two training corpora publicly available so that they may serve as benchmark for this new task.",
keywords = "corpus linguistics, English Abstract, rhetorical moves, multi-label sentence classifier",
author = "Carmen Dayrell and {Candido Jr}, Arnaldo and Gabriel Lima and {Machado Jr}, Danilo and Ann Copestake and Val{\'e}ria Feltrim and Stella Tagnin and Sandra Alu{\'i}sio",
year = "2012",
language = "English",
booktitle = "Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012)",

}

RIS

TY - GEN

T1 - Rhetorical move detection in English abstracts

T2 - multi-label sentence classifiers and their annotated corpora

AU - Dayrell, Carmen

AU - Candido Jr, Arnaldo

AU - Lima, Gabriel

AU - Machado Jr, Danilo

AU - Copestake, Ann

AU - Feltrim, Valéria

AU - Tagnin, Stella

AU - Aluísio, Sandra

PY - 2012

Y1 - 2012

N2 - The relevance of automatically identifying rhetorical moves in scientific texts has been widely acknowledged in the literature. Thisstudy focuses on abstracts of standard research papers written in English and aims to tackle a fundamental limitation of currentmachine-learning classifiers: they are mono-labeled, that is, a sentence can only be assigned one single label. However, such approachdoes not adequately reflect actual language use since a move can be realized by a clause, a sentence, or even several sentences. Here,we present MAZEA (Multi-label Argumentative Zoning for English Abstracts), a multi-label classifier which automatically identifiesrhetorical moves in abstracts but allows for a given sentence to be assigned as many labels as appropriate. We have resorted to variousother NLP tools and used two large training corpora: (i) one corpus consists of 645 abstracts from physical sciences and engineering(PE) and (ii) the other corpus is made up of 690 from life and health sciences (LH). This paper presents our preliminary results and alsodiscusses the various challenges involved in multi-label tagging and works towards satisfactory solutions. In addition, we also makeour two training corpora publicly available so that they may serve as benchmark for this new task.

AB - The relevance of automatically identifying rhetorical moves in scientific texts has been widely acknowledged in the literature. Thisstudy focuses on abstracts of standard research papers written in English and aims to tackle a fundamental limitation of currentmachine-learning classifiers: they are mono-labeled, that is, a sentence can only be assigned one single label. However, such approachdoes not adequately reflect actual language use since a move can be realized by a clause, a sentence, or even several sentences. Here,we present MAZEA (Multi-label Argumentative Zoning for English Abstracts), a multi-label classifier which automatically identifiesrhetorical moves in abstracts but allows for a given sentence to be assigned as many labels as appropriate. We have resorted to variousother NLP tools and used two large training corpora: (i) one corpus consists of 645 abstracts from physical sciences and engineering(PE) and (ii) the other corpus is made up of 690 from life and health sciences (LH). This paper presents our preliminary results and alsodiscusses the various challenges involved in multi-label tagging and works towards satisfactory solutions. In addition, we also makeour two training corpora publicly available so that they may serve as benchmark for this new task.

KW - corpus linguistics

KW - English Abstract

KW - rhetorical moves

KW - multi-label sentence classifier

M3 - Conference contribution/Paper

BT - Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012)

ER -