Final published version
Licence: CC BY: Creative Commons Attribution 4.0 International License
Research output: Contribution to Journal/Magazine › Journal article › peer-review
Research output: Contribution to Journal/Magazine › Journal article › peer-review
}
TY - JOUR
T1 - Challenges and Opportunities for Statistics in the Era of Data Science
AU - Kirch, Claudia
AU - Lahiri, Soumendra
AU - Binder, Harald
AU - Brannath, Werner
AU - Cribben, Ivor
AU - Dette, Holger
AU - Doebler, Philipp
AU - Feng, Oliver
AU - Gandy, Axel
AU - Greven, Sonja
AU - Hammer, Barbara
AU - Harmeling, Stefan
AU - Hotz, Thomas
AU - Kauermann, Göran
AU - Krause, Joscha
AU - Krempl, Georg
AU - Nieto-Reyes, Alicia
AU - Okhrin, Ostap
AU - Ombao, Hernando
AU - Pein, Florian
AU - Pešta, Michal
AU - Politis, Dimitris
AU - Qin, Li-Xuan
AU - Rainforth, Tom
AU - Rauhut, Holger
AU - Reeve, Henry
AU - Salinas, David
AU - Schmidt-Hieber, Johannes
AU - Scott, Clayton
AU - Segers, Johan
AU - Spiliopoulou, Myra
AU - Wilhelm, Adalbert
AU - Wilms, Ines
AU - Yu, Yi
AU - Lederer, Johannes
PY - 2025/4/21
Y1 - 2025/4/21
N2 - Statistics as a scientific discipline is currently facing the great challenge of finding its place in data science once more. While at the beginning of the last century, the development of the discipline of statistics was initiated by data-related research questions, nowadays, it is often viewed to have not kept up with the current developments in data science, which are largely focused on algorithmic, exploratory and computational aspects and often driven by other disciplines, such as computer science. However, statistics can—and should—contribute to the advances of data science. Of most interest are the strengths of statistics, such as the mathematical focus that leads to theoretical guarantees. This includes methods for formal modeling, hypothesis tests, uncertainty quantification and statistical inference. Of particular interest are also established statistical frameworks to handle causality or data deficiencies such as dependence, missingness, biases or confounding. This paper summarizes the findings of a discussion workshop on the topic that was held in June 2023 in Hannover, Germany. The discussion centered around the following questions: How must statistics be set up so that it can contribute (more) to modern data science? In which direction should it develop further? Which strengths can already be used now? What conditions must be created so that this can succeed? What can be done to arrive at a common language? What is the added value of formal modeling, inference, and the mathematical perspective taken in statistics?
AB - Statistics as a scientific discipline is currently facing the great challenge of finding its place in data science once more. While at the beginning of the last century, the development of the discipline of statistics was initiated by data-related research questions, nowadays, it is often viewed to have not kept up with the current developments in data science, which are largely focused on algorithmic, exploratory and computational aspects and often driven by other disciplines, such as computer science. However, statistics can—and should—contribute to the advances of data science. Of most interest are the strengths of statistics, such as the mathematical focus that leads to theoretical guarantees. This includes methods for formal modeling, hypothesis tests, uncertainty quantification and statistical inference. Of particular interest are also established statistical frameworks to handle causality or data deficiencies such as dependence, missingness, biases or confounding. This paper summarizes the findings of a discussion workshop on the topic that was held in June 2023 in Hannover, Germany. The discussion centered around the following questions: How must statistics be set up so that it can contribute (more) to modern data science? In which direction should it develop further? Which strengths can already be used now? What conditions must be created so that this can succeed? What can be done to arrive at a common language? What is the added value of formal modeling, inference, and the mathematical perspective taken in statistics?
U2 - 10.1162/99608f92.abf14c9d
DO - 10.1162/99608f92.abf14c9d
M3 - Journal article
JO - Harvard Data Science Review
JF - Harvard Data Science Review
ER -