Paper data
Title:
Voice Activity Detection with Array Signal Processing in the Wavelet Domain Author(s): Hioka Yusuke, Department of System Design Engineering, Keio University Hamada Nozomu, Department of System Design Engineering, Keio University Page numbers in the proceedings: Volume I pp 255-258 Session: Segmentation and Voice Detection
Paper abstract
In many conventional voice activity detection (VAD) methods, speech signal is assumed to be acquired in high quality. However, human-machine interface based on speech is usually employed in indoor environment where various interferences exist, therefore, the VAD performance is seriously deteriorated. In this paper, we propose a novel VAD method with array signal processing on wavelet domain, in which we utilize the time, frequency and space information in the speech signal to separate interferences. In the proposed method, speech signal acquired by microphone array is at first decomposed into appropriate subbands with wavelet packet, and then array signal processing is executed on each subbands to realize VAD system for speech signal arriving from particular direction.
Paper
|