Home > Research > Publications & Outputs > Learning from data with structured missingness

Links

Text available via DOI:

View graph of relations

Learning from data with structured missingness

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Published
  • Robin Mitra
  • Sarah F. McGough
  • Tapabrata Chakraborti
  • Chris Holmes
  • Ryan Copping
  • Niels Hagenbuch
  • Stefanie Biedermann
  • Jack Noonan
  • Brieuc Lehmann
  • Aditi Shenvi
  • Xuan Vinh Doan
  • David Leslie
  • Ginestra Bianconi
  • Ruben Sanchez-Garcia
  • Alisha Davies
  • Maxine Mackintosh
  • Eleni-Rosalina Andrinopoulou
  • Anahid Basiri
  • Chris Harbron
  • Ben D. MacArthur
Close
<mark>Journal publication date</mark>25/01/2023
<mark>Journal</mark>Nature Machine Intelligence
Issue number1
Volume5
Number of pages11
Pages (from-to)13-23
Publication StatusPublished
<mark>Original language</mark>English

Abstract

Missing data are an unavoidable complication in many machine learning tasks. When data are ‘missing at random’ there exist a range of tools and techniques to deal with the issue. However, as machine learning studies become more ambitious, and seek to learn from ever-larger volumes of heterogeneous data, an increasingly encountered problem arises in which missing values exhibit an association or structure, either explicitly or implicitly. Such ‘structured missingness’ raises a range of challenges that have not yet been systematically addressed, and presents a fundamental hindrance to machine learning at scale. Here we outline the current literature and propose a set of grand challenges in learning from data with structured missingness.