Final published version, 264 KB, PDF document
Available under license: CC BY: Creative Commons Attribution 4.0 International License
Final published version
Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSN › Conference contribution/Paper
Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSN › Conference contribution/Paper
}
TY - GEN
T1 - Discovering dynamical models of speech using physics-informed machine learning
AU - Kirkham, Sam
N1 - Conference code: 13th
PY - 2024/5/13
Y1 - 2024/5/13
N2 - Spoken language is characterised by a high-dimensional and highly variable set of physical movements that unfold over time. What are the fundamental dynamical principles that underlie this signal? In this study, we demonstrate the use of physics- informed machine learning (sparse symbolic regression) for discovering new dynamical models of speech articulation. We first demonstrate the model discovery procedure on simulated data and show that the algorithm is able to discover the original model with near-perfect accuracy, even when the data contain extensive variation in duration, initial conditions and tar- get positions, as well as in the presence of added noise. We then demonstrate a proof-of-concept applying the same technique to empirical data, which reveals a small set of candidate dynamical models with increasing levels of complexity and accuracy.
AB - Spoken language is characterised by a high-dimensional and highly variable set of physical movements that unfold over time. What are the fundamental dynamical principles that underlie this signal? In this study, we demonstrate the use of physics- informed machine learning (sparse symbolic regression) for discovering new dynamical models of speech articulation. We first demonstrate the model discovery procedure on simulated data and show that the algorithm is able to discover the original model with near-perfect accuracy, even when the data contain extensive variation in duration, initial conditions and tar- get positions, as well as in the presence of added noise. We then demonstrate a proof-of-concept applying the same technique to empirical data, which reveals a small set of candidate dynamical models with increasing levels of complexity and accuracy.
U2 - 10.21437/issp.2024-45
DO - 10.21437/issp.2024-45
M3 - Conference contribution/Paper
SP - 177
EP - 180
BT - Proceedings of the 13th International Seminar on Speech Production
A2 - Fougeron, Cécile
A2 - Perrier, Pascal
PB - International Speech Communication Association
CY - Autrans, France
T2 - ISSP 2024 : 13th International Seminar on Speech Production
Y2 - 13 May 2024 through 17 May 2024
ER -