Home > Research > Researchers > Mahmoud El Haj
Profile photoDr Mahmoud El-Haj
Senior Research Associate
UCREL - University Centre for Computer Corpus Research on Language
School of Computing and Communications
E-mail: m.el-haj@lancaster.ac.uk

Education

2012PhD, Computer Science, University of Essex, UK
2008MSc, Information Systems, The University of Jordan, Jordan
2005:BSc, Computer Information Systems, The University of Jordan, Jordan

Projects

An Assessment of Corporate Disclosures of IFRS 15: Revenue from Contracts with Customers

El Haj, M. & Trottier, K.

27/04/201/07/22

BioTM Project

Knight, J., Rayson, P., El Haj, M. & Prentice, S.

1/05/1831/03/19

SenseSourcing

El Haj, M. & Rayson, P.

1/12/141/07/15

UCC: Understanding Corporate Communications

El Haj, M., Rayson, P., McEnery, T., Hardie, A. & Young, S.

1/12/141/10/16

Understanding the influences of Financial Reporting, Corporate : Understanding the Influences of Financial Reporting, Corporate Disclosure and financial media on the Corporate Financial Information Environment

Rayson, P., Young, S. & El Haj, M.

1/12/1230/11/14

VardSourcing

El Haj, M. & Rayson, P.

1/12/141/07/15

Research output

Habibi - a multi Dialect multi National Arabic Song Lyrics Corpus

El-Haj, M., 11/05/2020, LREC 2020, Twelfth International Conference on Language Resources and Evaluation: LREC'20. European Language Resources Association (ELRA), 9 p.

Infrastructure for Semantic Annotation in the Genomics Domain

El-Haj, M., Rutherford, N., Coole, M., Ezeani, I., Prentice, S., Ide, N., Knight, J., Piao, S., Mariani, J., Rayson, P. & Suderman, K., 11/05/2020, LREC 2020, Twelfth International Conference on Language Resources and Evaluation: LREC'20. Paris: European Language Resources Association (ELRA), p. 6921-6929 9 p.

Retrieving, Classifying and Analysing Narrative Commentary in Unstructured (Glossy) Annual Reports Published as PDF Files

El Haj, M., Alves, P., Rayson, P., Walker, M. & Young, S., 1/01/2020, In : Accounting and Business Research. 50, 1, p. 6-34 29 p.

Annual Report Commentary on the Value Creation Process

Athanasakou, V., El-Haj, M., Rayson, P., Walker, M. & Young, S., 2020, p. 1-63, 63 p.

Who’s the Fairest of them All? A Comparison of Methods for Classifying Tone and Attribution in Earnings-related Management Discourse

Young, S., Walker, M., Athanasakou, V., El-Haj, M., Rayson, P. & Schleicher, T., 2020, p. 1-47, 49 p.

Proceedings of the Second Financial Narrative Processing Workshop (FNP 2019)

El-Haj, M. (ed.), Rayson, P. (ed.), Young, S. (ed.), Bouamor, H. (ed.) & Ferradans, S. (ed.), 30/09/2019, Stroudsburg, PA: Association for Computational Linguistics. 87 p.

Readability of Patient Educational Materials in English Versus Arabic

Malik, A., El Haj, M. & Paasche-Orlow, M., 19/07/2019, In : HLRP: Health Literacy Research and Practice. 3, 3, p. e170-e173 4 p.

In Search of Meaning: Lessons, Resources and Next Steps for Computational Analysis of Financial Discourse

El Haj, M., Rayson, P. E., Walker, M., Young, S. E. & Simaki, V., 30/04/2019, In : Journal of Business Finance and Accounting. 46, 3-4, p. 265-306 42 p.

Multilingual Financial Narrative Processing: Analysing Annual Reports in English, Spanish and Portuguese

El Haj, M., Rayson, P. E., Young, S. E., Alves, P. & Herrero Zorita, C., 02/2019, Multilingual Text Analysis: Challenges, Models, and Approaches. Litvak, M. & Vanetik, N. (eds.). World Scientific Publishing

Profiling Medical Journal Articles Using a Gene Ontology Semantic Tagger

El Haj, M., Rayson, P. E., Piao, S. S. & Knight, J., 11/05/2018, LREC 2018, Eleventh International Conference on Language Resources and Evaluation. Calzolari, N., Choukri, K., Cieri, C., Declerck, T., Goggi, S., Hasida, K., Isahara, H., Maegaard, B., Mariani, J., Mazo, H., Moreno, A., Odijk, J., Piperidis, S. & Tokunaga, T. (eds.). European Language Resources Association (ELRA), p. 4593-4597 5 p.

Arabic Dialect Identification in the Context of Bivalency and Code-Switching

El Haj, M., Rayson, P. E. & Aboelezz, M., 9/05/2018, LREC 2018, Eleventh International Conference on Language Resources and Evaluation. Calzolari, N., Choukri, K., Cieri, C., Declerck, T., Goggi, S., Hasida, K., Isahara, H., Maegaard, B., Mariani, J., Mazo, H., Moreno, A., Odijk, J., Piperidis, S. & Tokunaga, T. (eds.). p. 3622-3627 6 p.

Towards a Multilingual Financial Narrative Processing System

El Haj, M., Rayson, P. E., Alves, P. & Young, S. E., 7/05/2018, The First Financial Narrative Processing Workshop: Proceedings of the 11th Edition of the Language Resources and Evaluation Conference - Miyazaki, Japan. El-Haj, M., Rayson, P. & Moore, A. (eds.). p. 52-58 7 p.

Readability of Arabic vs English Patient Educational Materials

El Haj, M., Malik, A. & Paasche-Orlow, M. K., 12/04/2018. 2 p.

Does equity analyst research lack rigour and objectivity? Evidence from conference call questions and research notes

Salzedo, C. J., Young, S. E. & El Haj, M., 2018, In : Accounting and Business Research. 48, 1, p. 5-36 32 p.

A Comparison Between Genetics Papers Relating to Immune Disorders and Psychiatric Disorders

El-Haj, M., Piao, S. S., Rayson, P. E. & Knight, J., 11/09/2017.

Creating and validating multilingual semantic representations for six languages: expert versus non-expert crowds

El-Haj, M., Rayson, P., Piao, S. & Wattam, S., 3/04/2017, Proceedings of the 1st Workshop on Sense, Concept and Entity Representations and their Applications: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics. Association for Computational Linguistics, p. 61-71 11 p.

Learning tone and attribution for financial text mining

El-Haj, M., Rayson, P. E., Young, S. E., Walker, M., Moore, A., Athanasakou, V. & Schleicher, T., 23/05/2016, Proceedings of LREC 2016, Tenth International Conference on Language Resources and Evaluation. Calzolari, N., Choukri, K., Declerck, T., Grobelnik, M., Maegaard, B., Mariani, J., Moreno, A., Odijk, J. & Piperidis, S. (eds.). European Language Resources Association (ELRA), p. 1820-1825 6 p.

Lexical coverage evaluation of large-scale multilingual semantic lexicons for twelve languages

Piao, S. S., Rayson, P. E., Archer, D., Bianchi, F., Dayrell, C., El-Haj, M., Jiménez, R-M., Knight, D., Křen, M., Lofberg, L., Nawab, R. M. A., Shafi, J., Teh, P. L. & Mudraya, O., 23/05/2016, LREC 2016, Tenth International Conference on Language Resources and Evaluation. Calzolari, N., Choukri, K., Declerck, T., Grobelnik, M., Maegaard, B., Mariani, J., Moreno, A., Odijk, J. & Piperidis, S. (eds.). European Language Resources Association (ELRA), p. 2614-2619 6 p.

OSMAN: a novel Arabic readability metric

El-Haj, M. & Rayson, P. E., 23/05/2016, Proceedings of the Language Resources and Evaluation Conference 2016. Calzolari, N., Choukri, K., Declerck, T., Grobelnik, M., Maegaard, B., Mariani, J., Moreno, A., Odijk, J. & Piperidis, S. (eds.). 10 ed. Slovenia: European Language Resources Association (ELRA), p. 250-255 6 p. 77

Creating language resources for under-resourced languages: methodologies, and experiments with Arabic

El-Haj, M., Kruschwitz, U. & Fox, C., 09/2015, In : Language Resources and Evaluation. 49, 3, p. 549-580 32 p.

Does equity analyst research lack rigor and objectivity? Evidence from conference call questions and research notes

Salzedo, C., Young, S. & El-Haj, M., 6/08/2014, Lancaster University Management School, p. 1-50, 50 p. (Department of Accounting and Finance Working Paper Series; no. AF2014/15WP01).

Computer-based analysis of the strategic content of UK annual report narratives

El-Haj, M., Athanasakou, V., Rayson, P., Young, S. & Walker, M., 2014. 6 p.

Detecting document structure in a very large corpus of UK financial reports

El-Haj, M., Rayson, P., Young, S. & Walker, M., 2014, LREC'14 Ninth International Conference on Language Resources and Evaluation . Reykjavik, Iceland: European Language Resources Association (ELRA), p. 1335-1338 4 p. 402. (Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC-2014)).

Language independent evaluation of translation style and consistency: comparing human and machine translations of Camus’ novel “The Stranger”

El-Haj, M., Rayson, P. & Hall, D., 2014, Text, speech and dialogue: 17th International Conference, TSD 2014, Brno, Czech Republic, September 8-12, 2014. Proceedings. Sojka, P., Horák, A., Kopecek, I. & Pala, K. (eds.). Springer International Publishing, p. 116-124 9 p. (Lecture Notes in Computer Science; vol. 8655).

An experiment in automatic indexing using the HASSET thesaurus

El-Haj, M., Balkan, L., Barbalet, S., Bell, L. & Shepherdson, J., 17/09/2013, Computer Science and Electronic Engineering Conference (CEEC), 2013 5th. IEEE, p. 13-18 6 p.

Multi-document multilingual summarization corpus preparation, Part 1: Arabic, English, Greek, Chinese, Romanian

Li, L., Forascu, C., El-Haj, M. & Giannakopoulos, G., 08/2013, Proceedings of the MultiLing 2013 Workshop on Multilingual Multi-document Summarization. Sofia, Bulgaria: Association for Computational Linguistics, p. 1-12 12 p.

Using a keyness metric for single and multi document summarisation

El-Haj, M. & Rayson, P., 08/2013, Proceedings of the MultiLing 2013 Workshop on Multilingual Multi-document summarization . Sofia, Bulgaria: Association for Computational Linguistics, p. 64-71 8 p.

Arabic topic detection using automatic text summarisation

Koulali, R., El-Haj, M. & Meziane, A., 2013, Computer Systems and Applications (AICCSA), 2013 ACS International Conference on. IEEE Computer Society, p. 1-4 4 p.

KALIMAT a multipurpose Arabic corpus

El-Haj, M. & Koulali, R., 2013, p. 22-25. 4 p.

UKDA keyword indexing with a SKOS version of HASSET thesaurus

El-Haj, M., 2013, Cologne, Germany: iAssist.

Arabic multi-document text summarisation

El-Haj, M., 2012, Colchester, Essex: University of Essex. 165 p.

Assessing crowdsourcing quality through objective tasks

Aker, A., El-Haj, M., Albakour, M-D. & Kruschwitz, U., 2012, Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12). Istanbul, Turkey: European Language Resources Association (ELRA), p. 1456-1461 6 p.

Experimenting with automatic text summarization for Arabic

El-Haj, M., Kruschwitz, U. & Fox, C., 2011, Human language technology - challenges for computer science and linguistics: 4th Language and Technology Conference, LTC 2009, Poznan, Poland, November 6-8, 2009, Revised Selected Papers. Vetulani, Z. (ed.). Berlin: Springer, p. 490-499 10 p. (Lecture Notes in Computer Science; vol. 6652).

Exploring clustering for multi-document Arabic summarisation

El-Haj, M., Kruschwitz, U. & Fox, C., 2011, Information Retrieval Technology: 7th Asia Information Retrieval Societies Conference, AIRS 2011, Dubai, United Arab Emirates, December 18-20, 2011. Proceedings. Salem, M. V. M., Shaalan, K., Oroumchian, F., Shakery, A. & Khelalfa, H. (eds.). Berlin: Springer, p. 550-561 12 p. (Lecture Notes in Computer Science; vol. 7097).

Multi-document Arabic text summarisation

El-Haj, M., Kruschwitz, U. & Fox, C., 2011, Computer Science and Electronic Engineering Conference (CEEC), 2011 3rd. IEEE, p. 365-369 5 p.

TAC 2011 MultiLing pilot overview

Giannakopoulos, G., El-Haj, M., Favre, B., Litvak, M., Steinberger, J. & Varma, V., 2011, Text Analysis Conference (TAC) 2011, MultiLing Summarisation Pilot. Maryland, USA: TAC, 17 p.

University of Essex at the TAC 2011 Multilingual Summarisation Pilot

El-Haj, M., Kruschwitz, U. & Fox, C., 2011, Proceedings of the Text Analysis Conference (TAC) 2011, MultiLing Summarisation Pilot, Maryland, USA. Maryland, USA: TAC, 7 p.

Understanding the Quran: a new grand challenge for computer science and artificial intelligence

Atwell, E., Habash, N., Louw, B., Abu Shawar, B., McEnery, T., Zaghouani, W. & El-Haj, M., 2010. 2 p.

Using mechanical Turk to create a corpus of Arabic summaries

El-Haj, M., Kruschwitz, U. & Fox, C., 2010, Language Resources (LRs) and Human Language Technologies (HLT) for Semitic Languages workshop held in conjunction with the 7th International Language Resources and Evaluation Conference (LREC 2010). Valletta, Malta: LREC 2010, p. 36-39 4 p.

Enhancing retrieval effectiveness of diacritisized Arabic passages using stemmer and thesaurus

Hammo, B., Sleit, A. & El-Haj, M., 2008, The 19th Midwest Artificial Intelligence And Cognitive Science Conference Maics2008. p. 189–196 8 p.

Evaluation of query-based Arabic text summarization system

El-Haj, M. & Hammo, B., 2008, Natural Language Processing and Knowledge Engineering, 2008. NLP-KE '08. International Conference on. Beijing, China: IEEE Computer Society, p. 1-7 7 p.

Experimenting with automatic summarization of Arabic text

El-Haj, M., 2008, Amman, Jordan: The University of Jordan.

Effectiveness of query expansion in searching the Holy Quran

Hammo, B., Sleit, A. & El-Haj, M., 2007, The Second International Conference on Arabic Language Processing CITALA'07. Rabat, Morocco, p. 1-10 10 p.