Localising speech, footsteps and other sounds using resource-constrained devices

Computing and Communications

Keywords

Design, Experimentation , General Terms-Algorithms , Measurement , Performance

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSN › Conference contribution/Paper › peer-review

Published

Standard

Localising speech, footsteps and other sounds using resource-constrained devices. / Guo, Yukang; Hazas, M.
Information Processing in Sensor Networks (IPSN), 2011 10th International Conference on. IEEE Press, 2011. p. 330-341.

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSN › Conference contribution/Paper › peer-review

Harvard

Guo, Y & Hazas, M 2011, Localising speech, footsteps and other sounds using resource-constrained devices. in Information Processing in Sensor Networks (IPSN), 2011 10th International Conference on. IEEE Press, pp. 330-341. <http://ieeexplore.ieee.org/xpl/articleDetails.jsp?tp=&arnumber=5779049&contentType=Conference+Publications&refinements%3D4294017427%26sortType%3Dasc_p_Sequence%26filter%3DAND%28p_IS_Number%3A5779036%29>

APA

Guo, Y., & Hazas, M. (2011). Localising speech, footsteps and other sounds using resource-constrained devices. In Information Processing in Sensor Networks (IPSN), 2011 10th International Conference on (pp. 330-341). IEEE Press. http://ieeexplore.ieee.org/xpl/articleDetails.jsp?tp=&arnumber=5779049&contentType=Conference+Publications&refinements%3D4294017427%26sortType%3Dasc_p_Sequence%26filter%3DAND%28p_IS_Number%3A5779036%29

Vancouver

Guo Y, Hazas M. Localising speech, footsteps and other sounds using resource-constrained devices. In Information Processing in Sensor Networks (IPSN), 2011 10th International Conference on. IEEE Press. 2011. p. 330-341

Author

Guo, Yukang ; Hazas, M. / Localising speech, footsteps and other sounds using resource-constrained devices. Information Processing in Sensor Networks (IPSN), 2011 10th International Conference on. IEEE Press, 2011. pp. 330-341

Bibtex

@inproceedings{5a5cdf85ea3c459fa02bfea4a2a3851c,

title = "Localising speech, footsteps and other sounds using resource-constrained devices",

abstract = "While a number of acoustic localisation systems have been proposed over the last few decades, these have typically either relied on expensive dedicated microphone arrays and workstation-class processing, or have been developed to detect a very specific type of sound in a particular scenario. However, as people live and work indoors, they generate a wide variety of sounds as they interact and move about. These human-generated sounds can be used to infer the positions of people, without requiring them to wear trackable tags. In this paper, we take a practical yet general approach to localising a number of human-generated sounds. Drawing from signal processing literature, we identify methods for resource-constrained devices in a sensor network to detect, classify and locate acoustic events such as speech, footsteps and objects being placed onto tables. We evaluate the classification and time-of-arrival estimation algorithms using a data set of human-generated sounds we captured with sensor nodes in a controlled setting. We show that despite the variety and complexity of the sounds, their localisation is feasible for sensor networks, with typical accuracies of a half metre or better. We specifically discuss the processing and networking considerations, and explore the performance trade-offs which can be made to further conserve resources.",

keywords = "Design, Experimentation , General Terms-Algorithms , Measurement , Performance",

author = "Yukang Guo and M. Hazas",

year = "2011",

month = apr,

day = "1",

language = "English",

isbn = "978-1-61284-854-9",

pages = "330--341",

booktitle = "Information Processing in Sensor Networks (IPSN), 2011 10th International Conference on",

publisher = "IEEE Press",

}

RIS

TY - GEN

T1 - Localising speech, footsteps and other sounds using resource-constrained devices

AU - Guo, Yukang

AU - Hazas, M.

PY - 2011/4/1

Y1 - 2011/4/1

N2 - While a number of acoustic localisation systems have been proposed over the last few decades, these have typically either relied on expensive dedicated microphone arrays and workstation-class processing, or have been developed to detect a very specific type of sound in a particular scenario. However, as people live and work indoors, they generate a wide variety of sounds as they interact and move about. These human-generated sounds can be used to infer the positions of people, without requiring them to wear trackable tags. In this paper, we take a practical yet general approach to localising a number of human-generated sounds. Drawing from signal processing literature, we identify methods for resource-constrained devices in a sensor network to detect, classify and locate acoustic events such as speech, footsteps and objects being placed onto tables. We evaluate the classification and time-of-arrival estimation algorithms using a data set of human-generated sounds we captured with sensor nodes in a controlled setting. We show that despite the variety and complexity of the sounds, their localisation is feasible for sensor networks, with typical accuracies of a half metre or better. We specifically discuss the processing and networking considerations, and explore the performance trade-offs which can be made to further conserve resources.

AB - While a number of acoustic localisation systems have been proposed over the last few decades, these have typically either relied on expensive dedicated microphone arrays and workstation-class processing, or have been developed to detect a very specific type of sound in a particular scenario. However, as people live and work indoors, they generate a wide variety of sounds as they interact and move about. These human-generated sounds can be used to infer the positions of people, without requiring them to wear trackable tags. In this paper, we take a practical yet general approach to localising a number of human-generated sounds. Drawing from signal processing literature, we identify methods for resource-constrained devices in a sensor network to detect, classify and locate acoustic events such as speech, footsteps and objects being placed onto tables. We evaluate the classification and time-of-arrival estimation algorithms using a data set of human-generated sounds we captured with sensor nodes in a controlled setting. We show that despite the variety and complexity of the sounds, their localisation is feasible for sensor networks, with typical accuracies of a half metre or better. We specifically discuss the processing and networking considerations, and explore the performance trade-offs which can be made to further conserve resources.

KW - Design

KW - Experimentation

KW - General Terms-Algorithms

KW - Measurement

KW - Performance

UR - http://www.scopus.com/inward/record.url?scp=79959297255&partnerID=8YFLogxK

M3 - Conference contribution/Paper

SN - 978-1-61284-854-9

SP - 330

EP - 341

BT - Information Processing in Sensor Networks (IPSN), 2011 10th International Conference on

PB - IEEE Press

ER -

Research

Links

Keywords