[1]蔡卫平,吴镇扬.一种基于离散时延的鲁棒声源三维定位方法[J].东南大学学报(自然科学版),2009,39(1):1-5.[doi:10.3969/j.issn.1001-0505.2009.01.001]
 Cai Weiping,Wu Zhenyang.Robust speech source 3D localization method based on discrete time delay[J].Journal of Southeast University (Natural Science Edition),2009,39(1):1-5.[doi:10.3969/j.issn.1001-0505.2009.01.001]
点击复制

一种基于离散时延的鲁棒声源三维定位方法()
分享到:

《东南大学学报(自然科学版)》[ISSN:1001-0505/CN:32-1178/N]

卷:
39
期数:
2009年第1期
页码:
1-5
栏目:
信息与通信工程
出版日期:
2009-01-20

文章信息/Info

Title:
Robust speech source 3D localization method based on discrete time delay
作者:
蔡卫平 吴镇扬
东南大学信息科学与工程学院, 南京 210096
Author(s):
Cai Weiping Wu Zhenyang
School of Information Science and Engineering, Southeast University, Nanjing 210096, China
关键词:
麦克风阵列 声源定位 SRP-PHAT算法
Keywords:
microphone arrays speech source localization SRP-PHAT(steer response power-phase transform)algorithm
分类号:
TN912.3
DOI:
10.3969/j.issn.1001-0505.2009.01.001
摘要:
为了减少相位变换加权的可控响应功率(SRP-PHAT)声源定位算法的计算量,提出一种基于离散时延的改进算法.该方法首先利用FFT将麦克风阵列的每一帧接受信号变换到频域,然后在频域补零至16倍帧长,再运用IFFT将所有麦克风对的广义互相关函数在搜索之前计算好,从而可大幅度减少计算量.频域补零提高了广义互相关函数的采样率,因而由时延离散带来的定位误差很小.仿真结果表明,无论在远场还是近场条件下,该算法均能将计算量降低一个数量级而保持原算法的鲁棒性.
Abstract:
To reduce the computation load of the steered response power-phase transform(SRP-PHAT)which is a robust speech source localization algorithm, an improved SRP-PHAT algorithm based on discrete time delay is presented in this paper. In this method, a frame of signal from microphone arrays is transformed into frequency domain by FFT(fast Fourier transform), then the sample points increase by 16 times by padding zeros in frequency domain. As a result, a generalized cross-correlation(GCC)of higher sampling rate can be achieved by taking IFFT(inverse fast Fourier transform). All the GCCs can be calculated before searching; the computation load will be significantly reduced. Moreover, the localization errors introduced by discrete time delay are small enough to ignore because of the high sampling rate of GCC. Simulation results show that the method can save computation load by one order of magnitude, while still remaining robust in both far-field and near-field.

参考文献/References:

[1] 居太亮.基于麦克风阵列的声源定位算法研究[D].成都:电子科技大学通信与信息工程学院,2006.
[2] Brandstein M S.A framework for speech source location using sensor arrays [D].Providence,RI,USA:Brown University,1995.
[3] Parisi R,Cirillo A,Panella M,et al.Source localization in reverberant environments by consistent peak selection[C] //IEEE International Conference on Acoustics,Speech,and Signal Processing.Honolulu,HI,USA,2007,1:37-40.
[4] Talantzis F,Constantinides A G,Polymenakos L C.Estimation of direction of arrival using information theory[J]. IEEE Signal Processing Letters,2005,12(8):561-564.
[5] 李承智,曲天书,吴玺宏.一种改进的AEDA声源定位及跟踪算法[J].北京大学学报:自然科学版,2005,41(5):809-814.
  Li Chengzhi,Qu Tianshu,Wu Xihong.A modified AEDA algorithm for sound source localization and tracking[J].Acta Scientiarum Naturalium Universitatis Pekinensis,2005,41(5):809-814.(in Chinese)
[6] DiBiase J H.A high-accuracy,low-latency technique for talker localization in reverberant environments using microphone arrays[D].Providence,RI,USA:Brown University,2000.
[7] Griebel S M.A microphone array system for speech source localization,denoising,and dereverberation[D].Cambridge,MA,USA:Harvard University,2002.
[8] Zotkin D N,Duraiswami R.Accelerated speech source localization via a hierarchical search of steered response power[J].IEEE Trans on Speech Audio Process,2004,12(5):499-508.
[9] Peterson J M,Kyriakakis C.Hybrid algorithm for robust,real-time source localization in reverberant environments[C] //IEEE International Conference on Acoustics,Speech,and Signal Processing.Philadelphia,PA,USA,2005,4:1053-1056.
[10] Do H,Silverman H F,Yu Ying.A real-time SRP-PHAT source location implementation using stochastic region contraction(SRC)on a large-aperture microphone array[C] //IEEE International Conference on Acoustics,Speech,and Signal Processing.Honolulu,HI,USA,2007,1:121-124.
[11] Allen J B,Berkley D A.Image method for efficiently simulating small-room acoustics[J]. Journal of Acoustical Society of America,1979,65(4):943-950.

相似文献/References:

[1]万新旺,吴镇扬.基于自适应频率选择的鲁棒时延估计算法[J].东南大学学报(自然科学版),2010,40(5):890.[doi:10.3969/j.issn.1001-0505.2010.05.002]
 Wan Xinwang,Wu Zhenyang.Robust time delay estimation algorithm based on adaptive frequency selection[J].Journal of Southeast University (Natural Science Edition),2010,40(1):890.[doi:10.3969/j.issn.1001-0505.2010.05.002]
[2]赵小燕,周琳,吴镇扬.基于压缩感知的麦克风阵列声源定位算法[J].东南大学学报(自然科学版),2015,45(2):203.[doi:10.3969/j.issn.1001-0505.2015.02.001]
 Zhao Xiaoyan,Zhou Lin,et al.Compressed sensing-based sound source localization algorithm for microphone array[J].Journal of Southeast University (Natural Science Edition),2015,45(1):203.[doi:10.3969/j.issn.1001-0505.2015.02.001]

备注/Memo

备注/Memo:
作者简介: 蔡卫平(1973—),男,博士生; 吴镇扬(联系人),男,教授,博士生导师,zywu@seu.edu.cn.
基金项目: 国家重点基础研究发展计划(973计划)资助项目(2002 CB312102).
引文格式: 蔡卫平,吴镇扬.一种基于离散时延的鲁棒声源三维定位方法[J].东南大学学报:自然科学版,2009,39(1):1-5.
更新日期/Last Update: 2009-01-20