Home > Research > Publications & Outputs > PetBERT

Links

Text available via DOI:

View graph of relations

PetBERT: automated ICD-11 syndromic disease coding for outbreak detection in first opinion veterinary electronic health records

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Published

Standard

PetBERT: automated ICD-11 syndromic disease coding for outbreak detection in first opinion veterinary electronic health records. / Farrell, Sean; Appleton, Charlotte; Noble, Peter-John Mäntylä et al.
In: Scientific Reports, Vol. 13, No. 1, 18015, 21.10.2023.

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Harvard

APA

Vancouver

Farrell S, Appleton C, Noble PJM, Al Moubayed N. PetBERT: automated ICD-11 syndromic disease coding for outbreak detection in first opinion veterinary electronic health records. Scientific Reports. 2023 Oct 21;13(1):18015. doi: 10.1038/s41598-023-45155-7

Author

Farrell, Sean ; Appleton, Charlotte ; Noble, Peter-John Mäntylä et al. / PetBERT : automated ICD-11 syndromic disease coding for outbreak detection in first opinion veterinary electronic health records. In: Scientific Reports. 2023 ; Vol. 13, No. 1.

Bibtex

@article{4bcdab4ca3f946b7b0de50a8a5a39b18,
title = "PetBERT: automated ICD-11 syndromic disease coding for outbreak detection in first opinion veterinary electronic health records",
abstract = "Effective public health surveillance requires consistent monitoring of disease signals such that researchers and decision-makers can react dynamically to changes in disease occurrence. However, whilst surveillance initiatives exist in production animal veterinary medicine, comparable frameworks for companion animals are lacking. First-opinion veterinary electronic health records (EHRs) have the potential to reveal disease signals and often represent the initial reporting of clinical syndromes in animals presenting for medical attention, highlighting their possible significance in early disease detection. Yet despite their availability, there are limitations surrounding their free text-based nature, inhibiting the ability for national-level mortality and morbidity statistics to occur. This paper presents PetBERT, a large language model trained on over 500 million words from 5.1 million EHRs across the UK. PetBERT-ICD is the additional training of PetBERT as a multi-label classifier for the automated coding of veterinary clinical EHRs with the International Classification of Disease 11 framework, achieving F1 scores exceeding 83% across 20 disease codings with minimal annotations. PetBERT-ICD effectively identifies disease outbreaks, outperforming current clinician-assigned point-of-care labelling strategies up to 3 weeks earlier. The potential for PetBERT-ICD to enhance disease surveillance in veterinary medicine represents a promising avenue for advancing animal health and improving public health outcomes.",
author = "Sean Farrell and Charlotte Appleton and Noble, {Peter-John M{\"a}ntyl{\"a}} and {Al Moubayed}, Noura",
year = "2023",
month = oct,
day = "21",
doi = "10.1038/s41598-023-45155-7",
language = "English",
volume = "13",
journal = "Scientific Reports",
issn = "2045-2322",
publisher = "Nature Publishing Group",
number = "1",

}

RIS

TY - JOUR

T1 - PetBERT

T2 - automated ICD-11 syndromic disease coding for outbreak detection in first opinion veterinary electronic health records

AU - Farrell, Sean

AU - Appleton, Charlotte

AU - Noble, Peter-John Mäntylä

AU - Al Moubayed, Noura

PY - 2023/10/21

Y1 - 2023/10/21

N2 - Effective public health surveillance requires consistent monitoring of disease signals such that researchers and decision-makers can react dynamically to changes in disease occurrence. However, whilst surveillance initiatives exist in production animal veterinary medicine, comparable frameworks for companion animals are lacking. First-opinion veterinary electronic health records (EHRs) have the potential to reveal disease signals and often represent the initial reporting of clinical syndromes in animals presenting for medical attention, highlighting their possible significance in early disease detection. Yet despite their availability, there are limitations surrounding their free text-based nature, inhibiting the ability for national-level mortality and morbidity statistics to occur. This paper presents PetBERT, a large language model trained on over 500 million words from 5.1 million EHRs across the UK. PetBERT-ICD is the additional training of PetBERT as a multi-label classifier for the automated coding of veterinary clinical EHRs with the International Classification of Disease 11 framework, achieving F1 scores exceeding 83% across 20 disease codings with minimal annotations. PetBERT-ICD effectively identifies disease outbreaks, outperforming current clinician-assigned point-of-care labelling strategies up to 3 weeks earlier. The potential for PetBERT-ICD to enhance disease surveillance in veterinary medicine represents a promising avenue for advancing animal health and improving public health outcomes.

AB - Effective public health surveillance requires consistent monitoring of disease signals such that researchers and decision-makers can react dynamically to changes in disease occurrence. However, whilst surveillance initiatives exist in production animal veterinary medicine, comparable frameworks for companion animals are lacking. First-opinion veterinary electronic health records (EHRs) have the potential to reveal disease signals and often represent the initial reporting of clinical syndromes in animals presenting for medical attention, highlighting their possible significance in early disease detection. Yet despite their availability, there are limitations surrounding their free text-based nature, inhibiting the ability for national-level mortality and morbidity statistics to occur. This paper presents PetBERT, a large language model trained on over 500 million words from 5.1 million EHRs across the UK. PetBERT-ICD is the additional training of PetBERT as a multi-label classifier for the automated coding of veterinary clinical EHRs with the International Classification of Disease 11 framework, achieving F1 scores exceeding 83% across 20 disease codings with minimal annotations. PetBERT-ICD effectively identifies disease outbreaks, outperforming current clinician-assigned point-of-care labelling strategies up to 3 weeks earlier. The potential for PetBERT-ICD to enhance disease surveillance in veterinary medicine represents a promising avenue for advancing animal health and improving public health outcomes.

U2 - 10.1038/s41598-023-45155-7

DO - 10.1038/s41598-023-45155-7

M3 - Journal article

VL - 13

JO - Scientific Reports

JF - Scientific Reports

SN - 2045-2322

IS - 1

M1 - 18015

ER -