Machines Learn to Infer Stellar Parameters Just by Looking at a Large Number of Spectra

Physics

Associated organisational unit

Observational Astrophysics

Electronic data

2009.12872
Rights statement: This is a pre-copy-editing, author-produced PDF of an article accepted for publication in Monthly Notices of the Royal Astronomical Society following peer review. The definitive publisher-authenticated version Nima Sedaghat, Martino Romaniello, Jonathan E Carrick, François-Xavier Pineau, Machines learn to infer stellar parameters just by looking at a large number of spectra, Monthly Notices of the Royal Astronomical Society, Volume 501, Issue 4, March 2021, Pages 6026–6041, https://doi.org/10.1093/mnras/staa3540 is available online at: https://academic.oup.com/mnras/article-abstract/501/4/6026/6121645
Accepted author manuscript, 3.67 MB, PDF document
Available under license: CC BY-NC: Creative Commons Attribution-NonCommercial 4.0 International License

Text available via DOI:

https://doi.org/10.1093/mnras/staa3540
Final published version

Keywords

methods: data analysis, methods: numerical, techniques: spectroscopic

View graph of relations

Research output: Contribution to Journal/Magazine › Journal article › peer-review

Published

Standard

Machines Learn to Infer Stellar Parameters Just by Looking at a Large Number of Spectra. / Sedaghat, Nima; Romaniello, Martino; Carrick, Jon et al.
In: Monthly Notices of the Royal Astronomical Society, Vol. 501, No. 4, 01.03.2021, p. 6026-6041.

Research output: Contribution to Journal/Magazine › Journal article › peer-review

Harvard

Sedaghat, N, Romaniello, M, Carrick, J & Pineau, F-X 2021, 'Machines Learn to Infer Stellar Parameters Just by Looking at a Large Number of Spectra', Monthly Notices of the Royal Astronomical Society, vol. 501, no. 4, pp. 6026-6041. https://doi.org/10.1093/mnras/staa3540

APA

Sedaghat, N., Romaniello, M., Carrick, J., & Pineau, F.-X. (2021). Machines Learn to Infer Stellar Parameters Just by Looking at a Large Number of Spectra. Monthly Notices of the Royal Astronomical Society, 501(4), 6026-6041. https://doi.org/10.1093/mnras/staa3540

Vancouver

Sedaghat N, Romaniello M, Carrick J, Pineau FX. Machines Learn to Infer Stellar Parameters Just by Looking at a Large Number of Spectra. Monthly Notices of the Royal Astronomical Society. 2021 Mar 1;501(4):6026-6041. Epub 2021 Jan 29. doi: 10.1093/mnras/staa3540

Author

Sedaghat, Nima ; Romaniello, Martino ; Carrick, Jon et al. / Machines Learn to Infer Stellar Parameters Just by Looking at a Large Number of Spectra. In: Monthly Notices of the Royal Astronomical Society. 2021 ; Vol. 501, No. 4. pp. 6026-6041.

Bibtex

@article{3e2a840fcbd348e6b5906c136eac5fa6,

title = "Machines Learn to Infer Stellar Parameters Just by Looking at a Large Number of Spectra",

abstract = "Machine learning has been widely applied to clearly defined problems of astronomy and astrophysics. However, deep learning and its conceptual differences to classical machine learning have been largely overlooked in these fields. The broad hypothesis behind our work is that letting the abundant real astrophysical data speak for itself, with minimal supervision and no labels, can reveal interesting patterns that may facilitate discovery of novel physical relationships. Here, as the first step, we seek to interpret the representations a deep convolutional neural network chooses to learn, and find correlations in them with current physical understanding. We train an encoder–decoder architecture on the self-supervised auxiliary task of reconstruction to allow it to learn general representations without bias towards any specific task. By exerting weak disentanglement at the information bottleneck of the network, we implicitly enforce interpretability in the learned features. We develop two independent statistical and information-theoretical methods for finding the number of learned informative features, as well as measuring their true correlation with astrophysical validation labels. As a case study, we apply this method to a data set of ∼270 000 stellar spectra, each of which comprising ∼300 000 dimensions. We find that the network clearly assigns specific nodes to estimate (notions of) parameters such as radial velocity and effective temperature without being asked to do so, all in a completely physics-agnostic process. This supports the first part of our hypothesis. Moreover, we find with high confidence that there are ∼4 more independently informative dimensions that do not show a direct correlation with our validation parameters, presenting potential room for future studies.",

keywords = "methods: data analysis, methods: numerical, techniques: spectroscopic",

author = "Nima Sedaghat and Martino Romaniello and Jon Carrick and Fran{\c c}ois-Xavier Pineau",

note = "This is a pre-copy-editing, author-produced PDF of an article accepted for publication in Monthly Notices of the Royal Astronomical Society following peer review. The definitive publisher-authenticated version Nima Sedaghat, Martino Romaniello, Jonathan E Carrick, Fran{\c c}ois-Xavier Pineau, Machines learn to infer stellar parameters just by looking at a large number of spectra, Monthly Notices of the Royal Astronomical Society, Volume 501, Issue 4, March 2021, Pages 6026–6041, https://doi.org/10.1093/mnras/staa3540 is available online at: https://academic.oup.com/mnras/article-abstract/501/4/6026/6121645",

year = "2021",

month = mar,

day = "1",

doi = "10.1093/mnras/staa3540",

language = "English",

volume = "501",

pages = "6026--6041",

journal = "Monthly Notices of the Royal Astronomical Society",

issn = "0035-8711",

publisher = "OXFORD UNIV PRESS",

number = "4",

}

RIS

TY - JOUR

T1 - Machines Learn to Infer Stellar Parameters Just by Looking at a Large Number of Spectra

AU - Sedaghat, Nima

AU - Romaniello, Martino

AU - Carrick, Jon

AU - Pineau, François-Xavier

N1 - This is a pre-copy-editing, author-produced PDF of an article accepted for publication in Monthly Notices of the Royal Astronomical Society following peer review. The definitive publisher-authenticated version Nima Sedaghat, Martino Romaniello, Jonathan E Carrick, François-Xavier Pineau, Machines learn to infer stellar parameters just by looking at a large number of spectra, Monthly Notices of the Royal Astronomical Society, Volume 501, Issue 4, March 2021, Pages 6026–6041, https://doi.org/10.1093/mnras/staa3540 is available online at: https://academic.oup.com/mnras/article-abstract/501/4/6026/6121645

PY - 2021/3/1

Y1 - 2021/3/1

N2 - Machine learning has been widely applied to clearly defined problems of astronomy and astrophysics. However, deep learning and its conceptual differences to classical machine learning have been largely overlooked in these fields. The broad hypothesis behind our work is that letting the abundant real astrophysical data speak for itself, with minimal supervision and no labels, can reveal interesting patterns that may facilitate discovery of novel physical relationships. Here, as the first step, we seek to interpret the representations a deep convolutional neural network chooses to learn, and find correlations in them with current physical understanding. We train an encoder–decoder architecture on the self-supervised auxiliary task of reconstruction to allow it to learn general representations without bias towards any specific task. By exerting weak disentanglement at the information bottleneck of the network, we implicitly enforce interpretability in the learned features. We develop two independent statistical and information-theoretical methods for finding the number of learned informative features, as well as measuring their true correlation with astrophysical validation labels. As a case study, we apply this method to a data set of ∼270 000 stellar spectra, each of which comprising ∼300 000 dimensions. We find that the network clearly assigns specific nodes to estimate (notions of) parameters such as radial velocity and effective temperature without being asked to do so, all in a completely physics-agnostic process. This supports the first part of our hypothesis. Moreover, we find with high confidence that there are ∼4 more independently informative dimensions that do not show a direct correlation with our validation parameters, presenting potential room for future studies.

AB - Machine learning has been widely applied to clearly defined problems of astronomy and astrophysics. However, deep learning and its conceptual differences to classical machine learning have been largely overlooked in these fields. The broad hypothesis behind our work is that letting the abundant real astrophysical data speak for itself, with minimal supervision and no labels, can reveal interesting patterns that may facilitate discovery of novel physical relationships. Here, as the first step, we seek to interpret the representations a deep convolutional neural network chooses to learn, and find correlations in them with current physical understanding. We train an encoder–decoder architecture on the self-supervised auxiliary task of reconstruction to allow it to learn general representations without bias towards any specific task. By exerting weak disentanglement at the information bottleneck of the network, we implicitly enforce interpretability in the learned features. We develop two independent statistical and information-theoretical methods for finding the number of learned informative features, as well as measuring their true correlation with astrophysical validation labels. As a case study, we apply this method to a data set of ∼270 000 stellar spectra, each of which comprising ∼300 000 dimensions. We find that the network clearly assigns specific nodes to estimate (notions of) parameters such as radial velocity and effective temperature without being asked to do so, all in a completely physics-agnostic process. This supports the first part of our hypothesis. Moreover, we find with high confidence that there are ∼4 more independently informative dimensions that do not show a direct correlation with our validation parameters, presenting potential room for future studies.

KW - methods: data analysis

KW - methods: numerical

KW - techniques: spectroscopic

U2 - 10.1093/mnras/staa3540

DO - 10.1093/mnras/staa3540

M3 - Journal article

VL - 501

SP - 6026

EP - 6041

JO - Monthly Notices of the Royal Astronomical Society

JF - Monthly Notices of the Royal Astronomical Society

SN - 0035-8711

IS - 4

ER -

Research

Associated organisational unit

Electronic data

Links

Text available via DOI:

Keywords