Home > Research > Publications & Outputs > Development of the multilingual semantic annota...

Electronic data

  • N15-1137

    Final published version, 270 KB, PDF document

Links

View graph of relations

Development of the multilingual semantic annotation system

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSNConference contribution/Paperpeer-review

Published

Standard

Development of the multilingual semantic annotation system. / Piao, Scott; Bianchi, Francesca; Dayrell, Carmen et al.
The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Proceedings of the Conference. Association for Computational Linguistics, 2015. p. 1268-1274.

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSNConference contribution/Paperpeer-review

Harvard

Piao, S, Bianchi, F, Dayrell, C, D'egidio, A & Rayson, P 2015, Development of the multilingual semantic annotation system. in The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Proceedings of the Conference. Association for Computational Linguistics, pp. 1268-1274, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics – Human Language Technologies (NAACL HLT 2015), Denver, Colorado, United States, 31/05/15. <http://naacl.org/naacl-hlt-2015/>

APA

Piao, S., Bianchi, F., Dayrell, C., D'egidio, A., & Rayson, P. (2015). Development of the multilingual semantic annotation system. In The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Proceedings of the Conference (pp. 1268-1274). Association for Computational Linguistics. http://naacl.org/naacl-hlt-2015/

Vancouver

Piao S, Bianchi F, Dayrell C, D'egidio A, Rayson P. Development of the multilingual semantic annotation system. In The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Proceedings of the Conference. Association for Computational Linguistics. 2015. p. 1268-1274

Author

Piao, Scott ; Bianchi, Francesca ; Dayrell, Carmen et al. / Development of the multilingual semantic annotation system. The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Proceedings of the Conference. Association for Computational Linguistics, 2015. pp. 1268-1274

Bibtex

@inproceedings{6d2896057ebc48e29a1f6ec3c048674e,
title = "Development of the multilingual semantic annotation system",
abstract = "This paper reports on our research to generate multilingual semantic lexical resources and develop multilingual semantic annotation software, which assigns each word in running text to a semantic category based on a lexical semantic classification scheme. Such tools have an important role in developing intelligentmultilingual NLP, text mining and ICT systems. In this work, we aim to extend an existing English semantic annotation tool to cover a range of languages, namely Italian, Chinese and Brazilian Portuguese, by bootstrapping new semantic lexical resources via automatically translating existing English semantic lexicons into these languages. We used a set of bilingual dictionaries and word lists for this purpose. In our experiment, with minor manual improvement of the automaticallygenerated semantic lexicons, the prototype tools based on the new lexicons achieved an average lexical coverage of 79.86% and an average annotation precision of 71.42% (if only precise annotations are considered) or 84.64% (if partially correct annotations are included) on the three languages. Our experiment demonstrates that it is feasible to rapidly develop prototype semantic annotation tools for new languages by automatically bootstrappingnew semantic lexicons based on existing ones.",
keywords = "semantic annotation tool, multilingual corpus tool, semantic lexicon, USAS",
author = "Scott Piao and Francesca Bianchi and Carmen Dayrell and Angela D'egidio and Paul Rayson",
note = "Date of Acceptance: 20/02/2015; The 2015 Conference of the North American Chapter of the Association for Computational Linguistics – Human Language Technologies (NAACL HLT 2015) ; Conference date: 31-05-2015 Through 05-06-2015",
year = "2015",
month = jun,
day = "1",
language = "English",
isbn = "9781941643495",
pages = "1268--1274",
booktitle = "The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies",
publisher = "Association for Computational Linguistics",

}

RIS

TY - GEN

T1 - Development of the multilingual semantic annotation system

AU - Piao, Scott

AU - Bianchi, Francesca

AU - Dayrell, Carmen

AU - D'egidio, Angela

AU - Rayson, Paul

N1 - Date of Acceptance: 20/02/2015

PY - 2015/6/1

Y1 - 2015/6/1

N2 - This paper reports on our research to generate multilingual semantic lexical resources and develop multilingual semantic annotation software, which assigns each word in running text to a semantic category based on a lexical semantic classification scheme. Such tools have an important role in developing intelligentmultilingual NLP, text mining and ICT systems. In this work, we aim to extend an existing English semantic annotation tool to cover a range of languages, namely Italian, Chinese and Brazilian Portuguese, by bootstrapping new semantic lexical resources via automatically translating existing English semantic lexicons into these languages. We used a set of bilingual dictionaries and word lists for this purpose. In our experiment, with minor manual improvement of the automaticallygenerated semantic lexicons, the prototype tools based on the new lexicons achieved an average lexical coverage of 79.86% and an average annotation precision of 71.42% (if only precise annotations are considered) or 84.64% (if partially correct annotations are included) on the three languages. Our experiment demonstrates that it is feasible to rapidly develop prototype semantic annotation tools for new languages by automatically bootstrappingnew semantic lexicons based on existing ones.

AB - This paper reports on our research to generate multilingual semantic lexical resources and develop multilingual semantic annotation software, which assigns each word in running text to a semantic category based on a lexical semantic classification scheme. Such tools have an important role in developing intelligentmultilingual NLP, text mining and ICT systems. In this work, we aim to extend an existing English semantic annotation tool to cover a range of languages, namely Italian, Chinese and Brazilian Portuguese, by bootstrapping new semantic lexical resources via automatically translating existing English semantic lexicons into these languages. We used a set of bilingual dictionaries and word lists for this purpose. In our experiment, with minor manual improvement of the automaticallygenerated semantic lexicons, the prototype tools based on the new lexicons achieved an average lexical coverage of 79.86% and an average annotation precision of 71.42% (if only precise annotations are considered) or 84.64% (if partially correct annotations are included) on the three languages. Our experiment demonstrates that it is feasible to rapidly develop prototype semantic annotation tools for new languages by automatically bootstrappingnew semantic lexicons based on existing ones.

KW - semantic annotation tool

KW - multilingual corpus tool

KW - semantic lexicon

KW - USAS

M3 - Conference contribution/Paper

SN - 9781941643495

SP - 1268

EP - 1274

BT - The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

PB - Association for Computational Linguistics

T2 - The 2015 Conference of the North American Chapter of the Association for Computational Linguistics – Human Language Technologies (NAACL HLT 2015)

Y2 - 31 May 2015 through 5 June 2015

ER -