[1]周琳,赵小燕,程旭,等.基于子带信噪比估计和软判决的鲁棒双耳声源定位算法[J].东南大学学报(自然科学版),2015,45(4):619-624.[doi:10.3969/j.issn.1001-0505.2015.04.001]
 Zhou Lin,Zhao Xiaoyan,Cheng Xu,et al.Robust binaural sound source localization based on sub-band SNR estimation and soft decision[J].Journal of Southeast University (Natural Science Edition),2015,45(4):619-624.[doi:10.3969/j.issn.1001-0505.2015.04.001]
点击复制

基于子带信噪比估计和软判决的鲁棒双耳声源定位算法()
分享到:

《东南大学学报(自然科学版)》[ISSN:1001-0505/CN:32-1178/N]

卷:
45
期数:
2015年第4期
页码:
619-624
栏目:
信息与通信工程
出版日期:
2015-07-20

文章信息/Info

Title:
Robust binaural sound source localization based on sub-band SNR estimation and soft decision
作者:
周琳赵小燕程旭李拟珺吴镇扬
东南大学水声信号处理教育部重点实验室, 南京210096
Author(s):
Zhou Lin Zhao Xiaoyan Cheng Xu Li Nijun Wu Zhenyang
Key Laboratory of Underwater Acoustic Signal Processing of Ministry of Education, Southeast University, Nanjing 210096, China
关键词:
双耳声源定位 子带信噪比估计 软判决 耳间时间差
Keywords:
binaural sound source localization sub-band signal-to-noise ratio estimation soft decision inter-aural time difference
分类号:
TN912.3
DOI:
10.3969/j.issn.1001-0505.2015.04.001
摘要:
为了提高噪声和混响环境下的双耳声源定位算法性能,提出了一种基于子带信噪比估计和软判决的双耳互功率谱和耳间时间差估计算法.首先根据每帧中每个子带双耳声信号的自相关矩阵估计子带信噪比;其次,将子带信噪比映射为软判决值,并对双耳互功率谱进行加权;最后利用加权后的互功率谱估计耳间时间差,从而判断目标声源方位.仿真测试和实际环境测试均表明:与基于互相关函数、过零率的传统双耳声源定位算法相比,所提算法在噪声和混响的复杂声学环境下,显著提高了双耳声源定位性能.
Abstract:
In order to improve the localization performance in noisy and reverberation environments, a robust binaural sound source localization(SSL)algorithm based on sub-band signal-to-noise ratio(SNR)estimation and soft decision is proposed. First, sub-band SNR is estimated based on the autocorrelation matrix of sub-band binaural sound signals in each frame. Then, the sub-band SNR is mapped to soft decision value, and the cross power spectrum density(PSD)of binaural sound signal is weighted by soft decision. Finally, inter-aural time difference(ITD)is computed by weighted cross PSD, and the azimuth of sound source is estimated. Simulation and real environment test results show that, compared with the conventional binaural SSL algorithms based on cross correlation and zeros crossing, the localization performance of the proposed algorithm is significantly improved in complex acoustic environments.

参考文献/References:

[1] Rayleigh L. On our perception of sound direction [J]. Philosophical Magazine, 1907, 13(74):214-232.
[2] Raspaud M, Viste H, Evangelista G. Binaural source localization by joint estimation of ILD and ITD [J]. IEEE Transactions on Audio, Speech and Language Processing, 2010, 18(1):68-77.
[3] Kim Y I,Kil R M. Estimation of interaural time differences based on zero-crossings in noisy multisource environments [J]. IEEE Transactions on Audio, Speech and Language Processing, 2007, 15(2):734-743.
[4] Chau D T, Li J, Akagi M. A DOA estimation algorithm based on equalization cancellation theory [C]//Proceedings of INTERSPEECH-2010. Makuhari, Chiba, Japan, 2010:2770-2773.
[5] Parisi R, Camoes F, Scarpiniti M, et al. Cepstrum prefiltering for binaural source localization in reverberant environments [J]. IEEE Signal Processing Letters, 2012, 19(2): 99-102.
[6] May T, van de Par S, Kohlrausch A. A probabilistic model for robust localization based on a binaural auditory front-end [J]. IEEE Transactions on Audio, Speech and Language Processing, 2011, 19(1):1-13.
[7] May T, van de Par S, Kohlrausch A. A binaural scene analyzer for joint localization and recognition of speakers in the presence of interfering noise sources and reverberation [J]. IEEE Transactions on Audio, Speech and Language Processing, 2012, 20(7):2016-2030.
[8] Roman N, Wang D L. Binaural tracking of multiple moving sources [J]. IEEE Transactions on Audio, Speech and Language Processing, 2008, 16(4): 728-739.
[9] Karim Y, Sylvain A, Jean-Luc Z. A binaural sound source localization method using auditive cues and vision [C]//Proceedings of ICASSP-2012. Kyoto, Japan, 2012:217-220.
[10] Kim C, Kumar K, Stern R M. Binaural sound source separation motivated by auditory processing [C]//Proceedings of ICASSP-2011. Prague, Czech, 2011:5072-5075.
[11] Chau D T, Akagi M, Li J F. Improve equalization-cancellation-based sound localization in noisy reverberant environments using direct-to-reverberant energy ratio [C]//Proceedings of ChinaSIP-2013. Beijing, China, 2013:322-326.
[12] Woodruff J, Wang D L. Binaural detection, localization and segregation in reverberant environments based on joint pitch and azimuth cues [J]. IEEE Transactions on Audio, Speech and Language Processing, 2013, 21(4):806-815.

备注/Memo

备注/Memo:
收稿日期: 2014-12-31.
作者简介: 周琳(1978—),女,博士,副教授,Linzhou@seu.edu.cn.
基金项目: 国家自然科学基金资助项目(61201345)、中央高校基本科研业务费专项资金资助项目(2242013K30010).
引用本文: 周琳,赵小燕,程旭,等.基于子带信噪比估计和软判决的鲁棒双耳声源定位算法[J].东南大学学报:自然科学版,2015,45(4):619-624. [doi:10.3969/j.issn.1001-0505.2015.04.001]
更新日期/Last Update: 2015-07-20