Feature extraction for speech and music discrimination

Computing and Communications

Text available via DOI:

https://doi.org/10.1109/CBMI.2008.4564943
Final published version

View graph of relations

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSN › Conference contribution/Paper › peer-review

Published

Huiyu Zhou
Abdul Sadka
Richard M. Jiang

More...

Publication date	22/09/2008
Host publication	2008 International Workshop on Content-Based Multimedia Indexing, CBMI 2008, Conference Proceedings
Publisher	IEEE
Pages	170-173
Number of pages	4
ISBN (print)	9781424420445
<mark>Original language</mark>	English
Event	2008 International Workshop on Content-Based Multimedia Indexing, CBMI 2008 - London, United Kingdom Duration: 18/06/2008 → 20/06/2008

Conference

Conference	2008 International Workshop on Content-Based Multimedia Indexing, CBMI 2008
Country/Territory	United Kingdom
City	London
Period	18/06/08 → 20/06/08

Publication series

Name	2008 International Workshop on Content-Based Multimedia Indexing, CBMI 2008, Conference Proceedings

Conference

Conference	2008 International Workshop on Content-Based Multimedia Indexing, CBMI 2008
Country/Territory	United Kingdom
City	London
Period	18/06/08 → 20/06/08

Abstract

Driven by the demand of information retrieval, video editing and human-computer interface, in this paper we propose a novel spectral feature for music and speech discrimination. This scheme attempts to simulate a biological model using the averaged cepstrum, where human perception tends to pick up the areas of large cepstral changes. The cepstrum data that is away from the mean value will be exponentially reduced in magnitude. We conduct experiments of music/speech discrimination by comparing the performance of the proposed feature with that of previously proposed features in classification. The dynamic time warping based classification verifies that the proposed feature has the best quality of music/speech classification in the test database.

Research

Links

Text available via DOI: