Home > Research > Publications & Outputs > A large multiclass dataset of CT scans for COVI...

Electronic data

  • EVOS_LargeMulticlassCOVID

    Accepted author manuscript, 9.29 MB, PDF document

    Embargo ends: 1/01/40

    Available under license: CC BY: Creative Commons Attribution 4.0 International License

View graph of relations

A large multiclass dataset of CT scans for COVID-19 identification

Research output: Contribution to Journal/MagazineJournal articlepeer-review

<mark>Journal publication date</mark>24/05/2023
<mark>Journal</mark>Evolving Systems
Number of pages10
Pages (from-to)1-10
Publication StatusAccepted/In press
<mark>Original language</mark>English


The infection by SARS-CoV-2 which causes the COVID-19 disease has spread widely over the whole world since the beginning of 2020. Following the epidemic which started in Wuhan, China on January 30, 2020 the World Health Organization (WHO) declared a global health emergency and a pandemic. In this paper, we describe a publicly available multiclass CT scan dataset for SARS-CoV-2 infection identification. Which currently contains 4173 CT-scans of 210 different patients, out of which 2168 correspond to 80 patients infected with SARS-CoV-2 and confirmed by RT-PCR. These data have been collected in the Public Hospital of the Government Employees of Sao Paulo and the Metropolitan Hospital of Lapa, both in Sao Paulo; Brazil. The aim of this data set is to encourage the research and development of artificial intelligent methods that are able to identify SARS-CoV-2 or other diseases through the analysis of CT scans. As a baseline result for this data set, we used the recently introduced eXplainable Deep Learning approach (xDNN), which is a transparent deep learning approach that allows users to inspect the decisions of the network.