Home > Research > Publications & Outputs > Learning with noise

Electronic data

  • acl2017

    Accepted author manuscript, 3.28 MB, PDF document

    Available under license: CC BY-NC: Creative Commons Attribution-NonCommercial 4.0 International License

Links

Text available via DOI:

View graph of relations

Learning with noise: enhance distantly supervised relation extraction with dynamic transition matrix

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSNConference contribution/Paperpeer-review

Published
  • Binfeng Luo
  • Yansong Feng
  • Zheng Wang
  • Zhanxing Zhu
  • Songfang Huang
  • Rui Yan
  • Dongyan Zhao
Close
Publication date30/07/2017
Host publicationThe 55th Annual Meeting of the Association for Computational Linguistics: Proceedings of the Conference, Vol. 1 (Long Papers)
Place of PublicationStroudsburg, Pa.
PublisherAssociation for Computational Linguistics
Number of pages10
ISBN (print)9781945626753
<mark>Original language</mark>English
EventAnnual Meeting of the Association for Computational Linguistics - Vancouver, Canada
Duration: 30/07/20174/08/2017
http://acl2017.org/

Conference

ConferenceAnnual Meeting of the Association for Computational Linguistics
Abbreviated titleACL 2017
Country/TerritoryCanada
CityVancouver
Period30/07/174/08/17
Internet address

Conference

ConferenceAnnual Meeting of the Association for Computational Linguistics
Abbreviated titleACL 2017
Country/TerritoryCanada
CityVancouver
Period30/07/174/08/17
Internet address

Abstract

Distant supervision significantly reduces human efforts in building training data for many classification tasks. While promising, this technique often introduces noise to the generated training data, which can severely affect the model performance. In this paper, we take a deep look at the application of distant supervision in relation extraction.
We show that the dynamic transition matrix can effectively characterize the noise in the training data built by distant supervision. The transition matrix can be effectively trained using a novel curriculum learning based method without any direct supervision about the noise. We thoroughly evaluate our approach under a wide range of extraction scenarios. Experimental results show that our approach consistently improves the extraction results and outperforms the state-of-the-art in various evaluation scenarios.