Home > Research > Researchers > Mo El-Haj
Profile photoDr Mahmoud El Haj
NLP Lecturer
Computing and Communications
Data Science Institute
DSI - Foundations
SCC (Data Science)
UCREL - University Centre for Computer Corpus Research on Language
Postal address:
InfoLab21
LA1 4WA
Lancaster
United Kingdom
Email: m.el-haj@lancaster.ac.uk
Phone: +44 1524 510348

Education

2012PhD, Computer Science, University of Essex, UK
2008MSc, Information Systems, The University of Jordan, Jordan
2005:BSc, Computer Information Systems, The University of Jordan, Jordan

Projects

An Assessment of Corporate Disclosures of IFRS 15: Revenue from Contracts with Customers

El-Haj, M. (Principal Investigator) & Trottier, K. (Principal Investigator)

27/04/201/07/22

BioTM Project

Knight, J. (Principal Investigator), Rayson, P. (Team Member), El-Haj, M. (Research Associate) & Menadue, S. (Research Associate)

1/05/1831/03/19

DSI: A Small Welsh Language Model Pilot for Sentiment Analysis Testing

Rayson, P. (Co-Investigator) & El-Haj, M. (Principal Investigator)

Welsh Government

1/07/2428/02/25

DSI: FreeTxt: supporting bilingual free-text survey and questionnaire data analysis

El-Haj, M. (Co-Investigator) & Rayson, P. (Principal Investigator)

AHRC

1/03/2231/10/23

DSI: Welsh Automatic Text Summarisation

El-Haj, M. (Principal Investigator)

Welsh Government

1/05/2130/09/22

DSI: Welsh Digital Grid

Rayson, P. (Principal Investigator) & El-Haj, M. (Co-Investigator)

Welsh Government

1/07/2329/03/24

SenseSourcing

El-Haj, M. (Research Associate) & Rayson, P. (Principal Investigator)

1/12/141/07/15

ThACC – Thesawrws Ar-lein Cymraeg Cyfoes: Using Word Embeddings to Create a Thesaurus of Contemporary Welsh

El-Haj, M. (Principal Investigator)

Welsh Government

1/06/2210/06/23

The Canadian Annual Report Extractor Project (CARE)

El-Haj, M. (Principal Investigator)

HEC Montreal

26/04/2230/04/23

The Canadian Annual Report Extractor Project (CARE) - Accelerate International

El-Haj, M. (Principal Investigator)

Mitacs

1/05/2214/02/24

UCC: Understanding Corporate Communications

El-Haj, M. (Research Associate), Rayson, P. (Co-Investigator), McEnery, T. (Principal Investigator), Hardie, A. (Co-Investigator) & Young, S. (Co-Investigator)

1/12/141/10/16

Understanding the Influences of Financial Reporting, Corporate Disclosure and financial media on the Corporate Financial Information Environment

Rayson, P. (Principal Investigator), Young, S. (Co-Investigator) & El-Haj, M. (Research Associate)

ESRC

1/12/1230/11/14

VardSourcing

El-Haj, M. (Research Associate) & Rayson, P. (Principal Investigator)

1/12/141/07/15

ACC: Welsh Summary Creator

El-Haj, M. (Principal Investigator), Knight, D. (Principal Investigator), Morris, J. (Co-Investigator) & Ezeani, I. (Researcher)

1/05/211/05/22

Research output

FreeTxt: A corpus-based bilingual free-text survey and questionnaire data analysis toolkit

Knight, D., Khallaf, N., Rayson, P., El-Haj, M., Ezeani, I. & Morris, S., 31/12/2024, In: Applied Corpus Linguistics. 4, 3, 100103.

AraFinNLP 2024: The First Arabic Financial NLP Shared Task

Malaysha, S., El-Haj, M., Ezzini, S., Khalilia, M., Jarrar, M., Almujaiwel, S., Berrada, I. & Bouamor, H., 16/08/2024, Proceedings of The Second Arabic Natural Language Processing Conference. Habash, N. (ed.). Kerrville, Texas: Association for Computational Linguistics (ACL Anthology), p. 393-402 10 p.

Metric-Oriented Pretraining of Neural Source Code Summarisation Transformers to Enable more Secure Software Development

Phillips, J., El-Haj, M. & Hall, T., 30/07/2024, p. 17-31. 15 p.

The Multilingual Corpus of World’s Constitutions (MCWC): MCWC

El-Haj, M. & Ezzini, S., 25/03/2024, (Accepted/In press). 10 p.

The Financial Document Causality Detection Shared Task (FinCausal 2023)

Moreno-Sandoval, A., Porta-Zamorano, J., Carbajo-Coronado, B., Samy, D., Mariko, D. & El-Haj, M., 1/02/2024, 2023 IEEE International Conference on Big Data. He, J., Palpanas, T., Hu, X., Cuzzocrea, A., Dou, D., Slezak, D., Wang, W., Gruca, A., Lin, J. C.-W. & Agrawal, R. (eds.). Los Alamitos, CA, USA: IEEE Computer Society Press, p. 2855-2860 6 p. (Proceedings - 2023 IEEE International Conference on Big Data, BigData 2023).

The Financial Narrative Summarisation Shared Task (FNS 2023)

Zavitsanos, E., Kosmopoulos, A., Giannakopoulos, G., Litvak, M., Carbajo-Coronado, B., Moreno-Sandoval, A. & El-Haj, M., 1/02/2024, 2023 IEEE International Conference on Big Data (BigData). He, J., Palpanas, T., Hu, X., Cuzzocrea, A., Dou, D., Slezak, D., Wang, W., Gruca, A., Lin, J. C.-W. & Agrawal, R. (eds.). IEEE Computer Society Press, p. 2890-2896 7 p.

Advancements in Financial Document Structure Extraction: Insights from Five Years of FinTOC (2019-2023)

Kang, J., Patel, M., Agrawal, A., Sevitha, S., Srinivasa, R., Bellato, S., Kumar, M. A., Tsang, N. & El-Haj, M., 22/01/2024, Proceedings - 2023 IEEE International Conference on Big Data, BigData 2023. He, J., Palpanas, T., Hu, X., Cuzzocrea, A., Dou, D., Slezak, D., Wang, W., Gruca, A., Lin, J. C.-W. & Agrawal, R. (eds.). Los Alamitos, CA, USA: IEEE Computer Society Press, p. 2839-2844 6 p. (Proceedings - 2023 IEEE International Conference on Big Data, BigData 2023).

#Menopause: Examining the frequency of communications about menopause on twitter between 2014 and 2022

Hunter, M. S., El-Haj, M., Thorne, E., Griffiths, A. & Hardy, C., 30/11/2023, In: Maturitas. 177, 107806.

Open-Source Thesaurus Development for Under-Resourced Languages: a Welsh Case Study

Khallaf, N., Arfon, E., El-Haj, M., Morris, J., Knight, D., Rayson, P., Hammouda, T. & Jarrar, M., 14/09/2023.

Exploring Abstractive Text Summarisation for Podcasts: A Comparative Study of BART and T5 Models

Saxena, P. & El-Haj, M., 6/09/2023. 11 p.

FinAraT5: A text to text model for financial Arabic text understanding and generation

Zmandar, N., El-Haj, M. & Rayson, P., 1/09/2023, Proceedings of the 4th Conference on Language, Data and Knowledge. Carvalho, S., Khan, A. F., Anić, A. O., Spahiu, B., Gracia, J., McCrae, J. P., Gromann, D., Heinisch, B. & Salgado, A. (eds.). Vienna, Austria: NOVA CLUNL, Portugal, p. 262-273 12 p.

Analysing and visualising free-text comments: a corpus-based toolkit

Knight, D., Rayson, P., Khallaf, N., Morris, S., El-Haj, M. & Ezeani, I., 5/07/2023.

A Comparative Study of Evaluation Metrics for Long-Document Financial Narrative Summarization with Transformers

Zmandar, N., El-Haj, M. & Rayson, P., 21/06/2023, Natural Language Processing and Information Systems - 28th International Conference on Applications of Natural Language to Information Systems, NLDB 2023, Proceedings. Métais, E., Meziane, F., Manning, W., Reiff-Marganiec, S. & Sugumaran, V. (eds.). Cham: Springer, p. 391-403 13 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 13913 LNCS).

IgboNER 2.0: Expanding Named Entity Recognition Datasets via Projection

Chukwuneke, C., Rayson, P., Ezeani, I., El-Haj, M., Asogwa, D., Okpalla, C. & Mbonu, C., 3/03/2023, (Accepted/In press).

A Data-driven Latent Semantic Analysis for Automatic Text Summarization using LDA Topic Modelling

Onah, D. F. O., Pang, E. L. L. & El-Haj, M., 26/01/2023, 2022 IEEE International Conference on Big Data (Big Data). IEEE, p. 2771-2780 10 p. (2022 IEEE International Conference on Big Data (Big Data)).

Semantic domains across topics, genders and languages

Khallaf, N., de Souza, E., El-Haj, M. & Rayson, P., 23/12/2022, Bilingual Writers and Corpus Analysis. Palfreyman, D. M. & Habash, N. (eds.). London: Routledge, p. 96-120 25 p. (Routledge Studies in Applied Linguistics).

Improved Evaluation of Automatic Source Code Summarisation

Phillips, J., Bowes, D., El-Haj, M. & Hall, T., 7/12/2022, 2nd Wokshop on Natural Language Generation, Evaluation and Metrics: Proceedings of the Workshop. Stroudsberg, PA.: Association for Computational Linguistics (ACL Anthology), p. 326-335 10 p.

A Domain Based Approach to Semantic Lexicon Expansion

Prentice, S., Rayson, P., Knight, J., El-Haj, M. & Elstein, S., 30/09/2022, In: International Journal of Lexicography. 35, 3, p. 364-377 14 p.

CoFiF Plus: A French Financial Narrative Summarisation Corpus

Zmandar, N., Daudert, T., Ahmadi, S., El-Haj, M. & Rayson, P., 23/06/2022, Language Resources and Evaluation (LREC 2022). Calzolari, N. (ed.). Paris: European Language Resources Association (ELRA), p. 1622-1639 18 p.

IgboBERT Models: Building and Training Transformer Models for the Igbo Language

Chukwuneke, C., Rayson, P., Ezeani, I. & El-Haj, M., 20/06/2022, LREC 2022 Conference Proceedings. Calzolari, N. (ed.). Paris: European Language Resources Association (ELRA), p. 5114–5122 9 p.

Introducing the Welsh Text Summarisation Dataset and Baseline Systems

Ezeani, I., El-Haj, M., Morris, J. & Knight, D., 20/06/2022, LREC 2022 Conference Proceedings. Calzolari, N. (ed.). Paris: European Languages Resources Association, p. 5097-5106 10 p.

AraSAS: The Open Source Arabic Semantic Tagger

El-Haj, M., Rayson, P., de Souza, E., Khallaf, N. & Habash, N., 15/06/2022, p. 23-31. 8 p.

Creation of an Evaluation Corpus and Baseline Evaluation Scores for Welsh Text Summarisation

El-Haj, M., Ezeani, I., Morris, J. & Knight, D., 15/06/2022, p. 14-21. 8 p.

Financial Narrative Summarisation Using a Hybrid TF-IDF and Clustering Summariser: AO-Lancs System at FNS 2022

El-Haj, M. & Ogden, A., 15/06/2022, p. 88-91. 3 p.

Proceedings of the 4th Financial Narrative Processing Workshop (FNP 2022)

El-Haj, M., Rayson, P. & Zmandar, N., 15/06/2022, France: European Language Resources Association (ELRA). 161 p.

The Financial Causality Extraction Shared Task (FinCausal 2022)

Mariko, D., Trottier, K. & El-Haj, M., 15/06/2022, p. 114-116. 3 p.

The Financial Document Structure Extraction Shared Task (FinTOC 2022)

El-Haj, M., Kang, J., Azzi, A. A., Bellato, S., El Maarouf, I., Gan, M., Gisbert, A. & Sandoval, A., 15/06/2022, p. 92-97. 5 p.

The Financial Narrative Summarisation Shared Task (FNS 2022)

El-Haj, M., Zmandar, N., Rayson, P., AbuRa'ed, A., Litvak, M., Pittaras, N., Giannakopoulos, G., Kosmopoulos, A., Carbajo-Coronado, B. & Sandoval, A., 15/06/2022, Language Resources and Evaluation (LREC 2022). Calzolari, N. (ed.). Paris: European Language Resources Association (ELRA), p. 52-61 9 p.

MULDASA: Multifactor Lexical Sentiment Analysis of Social-Media Content in Nonstandard Arabic Social Media

Alwakid, G., Osman, T., El-Haj, M., Alanazi, S., Humayun, M. & Us Sama, N., 9/04/2022, In: Applied Sciences. 12, 8, 18 p., 3806.

Multilingual Financial Word Embeddings for Arabic, English and French

Zmandar, N., El-Haj, M. & Rayson, P., 13/01/2022, 2021 IEEE International Conference on Big Data (Big Data). IEEE, p. 4584-4589 6 p.

The Influence of Social Factors on Mental Health and Wellbeing during the COVID-19 Pandemic

El-Haj, M. & Sartain, A., 13/01/2022, 2021 IEEE International Conference on Big Data (Big Data). IEEE, p. 2818-2827 10 p.

Review of the State of the Art in Financial Narrative Processing

El-Haj, M., Rayson, P., El Maarouf, I., Bentabet, N.-I., Mariko, D., Labidurie, E., Litvak, M., Giannakopoulos, G., AbuRa'ed, A. & Zmandar, N., 13/12/2021, Financial Narrative Processing in Spanish. Moreno Sandoval, A. (ed.). Tirant lo Blanch, p. 51-98 48 p. (Tecnología, traducción y cultura).

Joint abstractive and extractive method for long financial documentsummarization

Zmandar, N., Singh, A., El-Haj, M. & Rayson, P., 26/10/2021, p. 99-105. 7 p.

Proceedings of the 3rd Financial Narrative Processing Workshop: FNP 2021

El-Haj, M. (Editor), Rayson, P. (Editor) & Zmandar, N. (Editor), 26/10/2021, Lancaster, United Kingdom: Association for Computational Linguistics (ACL Anthology).

The Financial Document Causality Detection Shared Task (FinCausal 2021)

Mariko, D., Akl, H. A., Labidurie, E., Durfort, S., de Mazancourt, H. & El-Haj, M., 26/10/2021, p. 58-60. 3 p.

The Financial Document Structure Extraction Shared Task (FinTOC2021): FinTOC 2021

Maarouf, I. E., Kang, J., Azzi, A. A., Bellato, S., Gan, M. & El-Haj, M., 26/10/2021, p. 111-119. 9 p.

The Financial Narrative Summarisation Shared Task FNS 2021

Zmandar, N., El-Haj, M., Rayson, P., Abura'Ed, A., Litvak, M., Giannakopoulos, G. & Pittaras, N., 26/10/2021, Proceedings of the 3rd Financial Narrative Processing Workshop: FNP 2021. Lancaster, United Kingdom: Association for Computational Linguistics (ACL Anthology), Vol. 3. p. 120-125 6 p.

Problematising Characteristicness: A Biomedical Association Case Study

Prentice, S., Knight, J., Rayson, P., El-Haj, M. & Rutherford, N., 31/08/2021, In: International Journal of Corpus Linguistics. 26, 3, p. 305-335 31 p.

Proceedings of the 1st Joint Workshop on Financial Narrative Processing and MultiLing Financial Summarisation

El-Haj, M., Rayson, P., Athanasakou, V., Bouamor, H., Salzedo, C., Giannakopoulos, G., Litvak, M., Pittaras, N., Elhag, A. & Ferradans, S., 1/12/2020, Association for Computational Linguistics.

Proceedings of the Fifth Arabic Natural Language Processing Workshop

Zitouni, I. (Editor), Abdul-Mageed, M. (Editor), Bouamor, H., Bougares, F. (Editor), El-Haj, M. (Editor), Tomeh, N. (Editor) & Zaghouani, W., 1/12/2020, Barcelona, Spain : Association for Computational Linguistics. 14 p.

The Financial Document Causality Detection Shared Task (FinCausal 2020)

Mariko, D., Abi-Akl, H., Labidurie, E., Durfort, S., De Mazancourt, H. & El-Haj, M., 1/12/2020, Proceedings of the 1st Joint Workshop on Financial Narrative Processing and MultiLing Financial Summarisation: FNP-FNS 2020. Barcelona, Spain (Online): COLING, p. 23-32 10 p.

The Financial Document Structure Extraction Shared task (FinToc 2020)

Bentabet, N.-I., Juge, R., El Maarouf, I., Mouilleron, V., Valsamou-Stanislawski, D. & El-Haj, M., 1/12/2020, Proceedings of the 1st Joint Workshop on Financial Narrative Processing and MultiLing Financial Summarisation. Barcelona, Spain (Online): COLING, p. 13-22 10 p.

The Financial Narrative Summarisation Shared Task (FNS 2020)

El Haj, M., Giannakopoulos, G., AbuRa'ed, A., Litvak, M. & Pittaras, N., 1/12/2020, The First Financial Narrative Processing Workshop: Proceedings of the 1st Joint Workshop on Financial Narrative Processing and MultiLing Financial Summarisation. El-Haj, M. (ed.). 12 p.

Habibi - a multi Dialect multi National Arabic Song Lyrics Corpus

El-Haj, M., 11/05/2020, LREC 2020, Twelfth International Conference on Language Resources and Evaluation: LREC'20. European Language Resources Association (ELRA), 9 p.

Infrastructure for Semantic Annotation in the Genomics Domain

El-Haj, M., Rutherford, N., Coole, M., Ezeani, I., Prentice, S., Ide, N., Knight, J., Piao, S., Mariani, J., Rayson, P. & Suderman, K., 11/05/2020, LREC 2020, Twelfth International Conference on Language Resources and Evaluation: LREC'20. Paris: European Language Resources Association (ELRA), p. 6921-6929 9 p.

Retrieving, Classifying and Analysing Narrative Commentary in Unstructured (Glossy) Annual Reports Published as PDF Files

El Haj, M., Alves, P., Rayson, P., Walker, M. & Young, S., 1/01/2020, In: Accounting and Business Research. 50, 1, p. 6-34 29 p.

Annual Report Commentary on the Value Creation Process

Athanasakou, V., El-Haj, M., Rayson, P., Walker, M. & Young, S., 2020, p. 1-63, 63 p.

Who’s the Fairest of them All? A Comparison of Methods for Classifying Tone and Attribution in Earnings-related Management Discourse

Young, S., Walker, M., Athanasakou, V., El-Haj, M., Rayson, P. & Schleicher, T., 2020, p. 1-47, 49 p.

Proceedings of the Second Financial Narrative Processing Workshop (FNP 2019)

El-Haj, M. (Editor), Rayson, P. (Editor), Young, S. (Editor), Bouamor, H. (Editor) & Ferradans, S. (Editor), 30/09/2019, Stroudsburg, PA: Association for Computational Linguistics. 87 p.

Readability of Patient Educational Materials in English Versus Arabic

Malik, A., El Haj, M. & Paasche-Orlow, M., 19/07/2019, In: HLRP: Health Literacy Research and Practice. 3, 3, p. e170-e173 4 p.

In Search of Meaning: Lessons, Resources and Next Steps for Computational Analysis of Financial Discourse

El Haj, M., Rayson, P. E., Walker, M., Young, S. E. & Simaki, V., 30/04/2019, In: Journal of Business Finance and Accounting. 46, 3-4, p. 265-306 42 p.

Multilingual Financial Narrative Processing: Analysing Annual Reports in English, Spanish and Portuguese

El Haj, M., Rayson, P. E., Young, S. E., Alves, P. & Herrero Zorita, C., 02/2019, Multilingual Text Analysis: Challenges, Models, and Approaches. Litvak, M. & Vanetik, N. (eds.). World Scientific Publishing

Profiling Medical Journal Articles Using a Gene Ontology Semantic Tagger

El Haj, M., Rayson, P. E., Piao, S. S. & Knight, J., 11/05/2018, LREC 2018, Eleventh International Conference on Language Resources and Evaluation. Calzolari, N., Choukri, K., Cieri, C., Declerck, T., Goggi, S., Hasida, K., Isahara, H., Maegaard, B., Mariani, J., Mazo, H., Moreno, A., Odijk, J., Piperidis, S. & Tokunaga, T. (eds.). European Language Resources Association (ELRA), p. 4593-4597 5 p.

Arabic Dialect Identification in the Context of Bivalency and Code-Switching

El Haj, M., Rayson, P. E. & Aboelezz, M., 9/05/2018, LREC 2018, Eleventh International Conference on Language Resources and Evaluation. Calzolari, N., Choukri, K., Cieri, C., Declerck, T., Goggi, S., Hasida, K., Isahara, H., Maegaard, B., Mariani, J., Mazo, H., Moreno, A., Odijk, J., Piperidis, S. & Tokunaga, T. (eds.). p. 3622-3627 6 p.

Towards a Multilingual Financial Narrative Processing System

El Haj, M., Rayson, P. E., Alves, P. & Young, S. E., 7/05/2018, The First Financial Narrative Processing Workshop: Proceedings of the 11th Edition of the Language Resources and Evaluation Conference - Miyazaki, Japan. El-Haj, M., Rayson, P. & Moore, A. (eds.). p. 52-58 7 p.

Readability of Arabic vs English Patient Educational Materials

El Haj, M., Malik, A. & Paasche-Orlow, M. K., 12/04/2018. 2 p.

Does equity analyst research lack rigour and objectivity? Evidence from conference call questions and research notes

Salzedo, C. J., Young, S. E. & El Haj, M., 2018, In: Accounting and Business Research. 48, 1, p. 5-36 32 p.

A Comparison Between Genetics Papers Relating to Immune Disorders and Psychiatric Disorders

El-Haj, M., Piao, S. S., Rayson, P. E. & Knight, J., 11/09/2017.

Creating and validating multilingual semantic representations for six languages: expert versus non-expert crowds

El-Haj, M., Rayson, P., Piao, S. & Wattam, S., 3/04/2017, Proceedings of the 1st Workshop on Sense, Concept and Entity Representations and their Applications: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics. Association for Computational Linguistics, p. 61-71 11 p.

Learning tone and attribution for financial text mining

El-Haj, M., Rayson, P. E., Young, S. E., Walker, M., Moore, A., Athanasakou, V. & Schleicher, T., 23/05/2016, Proceedings of LREC 2016, Tenth International Conference on Language Resources and Evaluation. Calzolari, N., Choukri, K., Declerck, T., Grobelnik, M., Maegaard, B., Mariani, J., Moreno, A., Odijk, J. & Piperidis, S. (eds.). European Language Resources Association (ELRA), p. 1820-1825 6 p.

Lexical coverage evaluation of large-scale multilingual semantic lexicons for twelve languages

Piao, S. S., Rayson, P. E., Archer, D., Bianchi, F., Dayrell , C., El-Haj, M., Jiménez, R.-M., Knight, D., Křen, M., Lofberg, L., Nawab, R. M. A., Shafi, J., Teh, P. L. & Mudraya, O., 23/05/2016, LREC 2016, Tenth International Conference on Language Resources and Evaluation. Calzolari, N., Choukri, K., Declerck, T., Grobelnik, M., Maegaard, B., Mariani, J., Moreno, A., Odijk, J. & Piperidis, S. (eds.). European Language Resources Association (ELRA), p. 2614-2619 6 p.

OSMAN: a novel Arabic readability metric

El-Haj, M. & Rayson, P. E., 23/05/2016, Proceedings of the Language Resources and Evaluation Conference 2016. Calzolari, N., Choukri, K., Declerck, T., Grobelnik, M., Maegaard, B., Mariani, J., Moreno, A., Odijk, J. & Piperidis, S. (eds.). 10 ed. Slovenia: European Language Resources Association (ELRA), p. 250-255 6 p. 77

Creating language resources for under-resourced languages: methodologies, and experiments with Arabic

El-Haj, M., Kruschwitz, U. & Fox, C., 09/2015, In: Language Resources and Evaluation. 49, 3, p. 549-580 32 p.

Does equity analyst research lack rigor and objectivity? Evidence from conference call questions and research notes

Salzedo, C., Young, S. & El-Haj, M., 6/08/2014, Lancaster University Management School, p. 1-50, 50 p. (Department of Accounting and Finance Working Paper Series; no. AF2014/15WP01).

Computer-based analysis of the strategic content of UK annual report narratives

El-Haj, M., Athanasakou, V., Rayson, P., Young, S. & Walker, M., 2014. 6 p.

Detecting document structure in a very large corpus of UK financial reports

El-Haj, M., Rayson, P., Young, S. & Walker, M., 2014, LREC'14 Ninth International Conference on Language Resources and Evaluation . Reykjavik, Iceland: European Language Resources Association (ELRA), p. 1335-1338 4 p. 402. (Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC-2014)).

Language independent evaluation of translation style and consistency: comparing human and machine translations of Camus’ novel “The Stranger”

El-Haj, M., Rayson, P. & Hall, D., 2014, Text, speech and dialogue: 17th International Conference, TSD 2014, Brno, Czech Republic, September 8-12, 2014. Proceedings. Sojka, P., Horák, A., Kopecek, I. & Pala, K. (eds.). Springer International Publishing, p. 116-124 9 p. (Lecture Notes in Computer Science; vol. 8655).

An experiment in automatic indexing using the HASSET thesaurus

El-Haj, M., Balkan, L., Barbalet, S., Bell, L. & Shepherdson, J., 17/09/2013, Computer Science and Electronic Engineering Conference (CEEC), 2013 5th. IEEE, p. 13-18 6 p.

Multi-document multilingual summarization corpus preparation, Part 1: Arabic, English, Greek, Chinese, Romanian

Li, L., Forascu, C., El-Haj, M. & Giannakopoulos, G., 08/2013, Proceedings of the MultiLing 2013 Workshop on Multilingual Multi-document Summarization. Sofia, Bulgaria: Association for Computational Linguistics, p. 1-12 12 p.

Using a keyness metric for single and multi document summarisation

El-Haj, M. & Rayson, P., 08/2013, Proceedings of the MultiLing 2013 Workshop on Multilingual Multi-document summarization . Sofia, Bulgaria: Association for Computational Linguistics, p. 64-71 8 p.

Arabic topic detection using automatic text summarisation

Koulali, R., El-Haj, M. & Meziane, A., 2013, Computer Systems and Applications (AICCSA), 2013 ACS International Conference on. IEEE Computer Society, p. 1-4 4 p.

KALIMAT a multipurpose Arabic corpus

El-Haj, M. & Koulali, R., 2013, p. 22-25. 4 p.

UKDA keyword indexing with a SKOS version of HASSET thesaurus

El-Haj, M., 2013, Cologne, Germany: iAssist.

Arabic multi-document text summarisation

El-Haj, M., 2012, Colchester, Essex: University of Essex. 165 p.

Assessing crowdsourcing quality through objective tasks

Aker, A., El-Haj, M., Albakour, M.-D. & Kruschwitz, U., 2012, Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12). Istanbul, Turkey: European Language Resources Association (ELRA), p. 1456-1461 6 p.

Experimenting with automatic text summarization for Arabic

El-Haj, M., Kruschwitz, U. & Fox, C., 2011, Human language technology - challenges for computer science and linguistics: 4th Language and Technology Conference, LTC 2009, Poznan, Poland, November 6-8, 2009, Revised Selected Papers. Vetulani, Z. (ed.). Berlin: Springer, p. 490-499 10 p. (Lecture Notes in Computer Science; vol. 6652).

Exploring clustering for multi-document Arabic summarisation

El-Haj, M., Kruschwitz, U. & Fox, C., 2011, Information Retrieval Technology: 7th Asia Information Retrieval Societies Conference, AIRS 2011, Dubai, United Arab Emirates, December 18-20, 2011. Proceedings. Salem, M. V. M., Shaalan, K., Oroumchian, F., Shakery, A. & Khelalfa, H. (eds.). Berlin: Springer, p. 550-561 12 p. (Lecture Notes in Computer Science; vol. 7097).

Multi-document Arabic text summarisation

El-Haj, M., Kruschwitz, U. & Fox, C., 2011, Computer Science and Electronic Engineering Conference (CEEC), 2011 3rd. IEEE, p. 365-369 5 p.

TAC 2011 MultiLing pilot overview

Giannakopoulos, G., El-Haj, M., Favre, B., Litvak, M., Steinberger, J. & Varma, V., 2011, Text Analysis Conference (TAC) 2011, MultiLing Summarisation Pilot. Maryland, USA: TAC, 17 p.

University of Essex at the TAC 2011 Multilingual Summarisation Pilot

El-Haj, M., Kruschwitz, U. & Fox, C., 2011, Proceedings of the Text Analysis Conference (TAC) 2011, MultiLing Summarisation Pilot, Maryland, USA. Maryland, USA: TAC, 7 p.

Understanding the Quran: a new grand challenge for computer science and artificial intelligence

Atwell, E., Habash, N., Louw, B., Abu Shawar, B., McEnery, T., Zaghouani, W. & El-Haj, M., 2010. 2 p.

Using mechanical Turk to create a corpus of Arabic summaries

El-Haj, M., Kruschwitz, U. & Fox, C., 2010, Language Resources (LRs) and Human Language Technologies (HLT) for Semitic Languages workshop held in conjunction with the 7th International Language Resources and Evaluation Conference (LREC 2010). Valletta, Malta: LREC 2010, p. 36-39 4 p.

Enhancing retrieval effectiveness of diacritisized Arabic passages using stemmer and thesaurus

Hammo, B., Sleit, A. & El-Haj, M., 2008, The 19th Midwest Artificial Intelligence And Cognitive Science Conference Maics2008. p. 189–196 8 p.

Evaluation of query-based Arabic text summarization system

El-Haj, M. & Hammo, B., 2008, Natural Language Processing and Knowledge Engineering, 2008. NLP-KE '08. International Conference on. Beijing, China: IEEE Computer Society, p. 1-7 7 p.

Experimenting with automatic summarization of Arabic text

El-Haj, M., 2008, Amman, Jordan: The University of Jordan.

Effectiveness of query expansion in searching the Holy Quran

Hammo, B., Sleit, A. & El-Haj, M., 2007, The Second International Conference on Arabic Language Processing CITALA'07. Rabat, Morocco, p. 1-10 10 p.