Home > Research > Publications & Outputs > Multilingual Financial Narrative Processing

Links

Text available via DOI:

View graph of relations

Multilingual Financial Narrative Processing: Analysing Annual Reports in English, Spanish and Portuguese

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSNChapter (peer-reviewed)peer-review

Published
Publication date02/2019
Host publicationMultilingual Text Analysis: Challenges, Models, and Approaches
EditorsMarina Litvak, Natalia Vanetik
PublisherWorld Scientific Publishing
ISBN (print)9789813274877
<mark>Original language</mark>English

Abstract

This chapter describes and evaluates the use of Information Extraction and Natural Language Processing methods for extraction and analysis of financial annual reports in three languages: English, Spanish and Portuguese.

The work described retains information on document structure which is needed to enable a clear distinction between narrative and financial statement components of annual reports and between individual sections within the narratives component. Extraction accuracy varies between languages with English exceeding 95 %. We apply the extraction methods on a comprehensive sample of annual reports published by UK, Spanish and Portuguese non-financial firms between 2003 and 2014.