[1]王膂,伍家松,Senhadji Lotfi,等.音频压缩中3种整数型MDCT变换的比较[J].东南大学学报(自然科学版),2012,42(2):259-264.[doi:10.3969/j.issn.1001-0505.2012.02.013]
 Wang Lü,Wu Jiasong,et al.Comparison of three IntMDCT algorithms in audio compression[J].Journal of Southeast University (Natural Science Edition),2012,42(2):259-264.[doi:10.3969/j.issn.1001-0505.2012.02.013]
点击复制

音频压缩中3种整数型MDCT变换的比较()
分享到:

《东南大学学报(自然科学版)》[ISSN:1001-0505/CN:32-1178/N]

卷:
42
期数:
2012年第2期
页码:
259-264
栏目:
计算机科学与工程
出版日期:
2012-03-20

文章信息/Info

Title:
Comparison of three IntMDCT algorithms in audio compression
作者:
王膂13 伍家松123 Senhadji Lotfi23 舒华忠13
1 东南大学影像科学与技术实验室, 南京 210096; 2 雷恩第一大学信号与图像处理实验室,法国雷恩 35042; 3 东南大学中法生物医学信息研究中心,南京 210096
Author(s):
Wang Lü1 3 Wu Jiasong1 2 3 Senhadji Lotfi2 3 Shu Huazhong1 3
1 Laboratory of Image Science and Technology, Southeast University, Nanjing 210096, China
2 Laboratoire Traitement du Signal et de I’Image, Université de Rennes 1, Rennes 35042, France
3 Centre de Recherche en
关键词:
提升算法 模变换 无穷范数旋转变换 整数型MDCT 音频压缩
Keywords:
lifting scheme modulo transform infinity norm rotation integer modified discrete cosine transform audio compression
分类号:
TP391
DOI:
10.3969/j.issn.1001-0505.2012.02.013
摘要:
为了快速计算整数型改进的离散余弦变换(IntMDCT),构造了基于提升变换、模变换以及无穷范数旋转变换的3种计算12点IntMDCT的算法.首先将12点MDCT转化为6点Ⅳ型离散余弦变换(DCT-Ⅳ),并将后者分解为7个Givens旋转变换的乘积; 然后分别利用提升变换算法、模变换算法和无穷范数旋转变换算法实现Givens旋转变换的整数型近似计算; 最后,对这3种算法在语音信号无损和有损压缩中的运行速度和计算精确度进行比较.实验结果表明,在这3种算法中,基于模变换的IntMDCT算法的运行速度最快; 基于无穷范数旋转变换的IntMDCT算法的计算精度最高,并在有损音频压缩中获得的信噪比最高.
Abstract:
In order to improve the computation efficiency of the integer modified discrete cosine transform(IntMDCT), three algorithms based on the lifting scheme, modulo transform and infinity norm rotation transform are formulated respectively for computing the 12-point IntMDCT. First, the 12-point IntMDCT is converted into the 6-point type-Ⅳ discrete cosine transform(DCT-Ⅳ), which is then factorized into a product of 7 Givens rotation matrices. The integer type Givens rotation matrices are approximated by lifting scheme, modulo transform and infinity norm rotation transform, respectively. Finally, the speed and accuracy of these three IntMDCT algorithms are compared in both lossless and lossy audio compression. The experimental results show that in the three algorithms, the IntMDCT algorithm based on the modulo transform has the highest computation speed. The IntMDCT algorithm based on the infinity norm rotation transform has the highest accuracy, and can achieve the highest signal to noise ratio(SNR)in lossy audio compression.

参考文献/References:

[1] Zeng Y,Cheng L,Bi G.Integer DCTs and fast algorithms [J].IEEE Transactions on Signal Processing,2001,49(19):2774-2782.
[2] Huang H,Rahardja S,Yu R.Integer MDCT with enhanced approximation of the DCT-Ⅳ [J].IEEE Transactions on Signal Processing,2006,54(11):1156-1159.
[3] 林福宗.多媒体技术基础 [M].北京:清华大学出版社,2000.
[4] Pereira F,Ebrahimi T.The MPEG-4 book [M].Upper Saddle River,USA:Prentice Hall,2002.
[5] Britanak V.New universal rotation — based fast computational structures for an efficient implementation of the DCT-Ⅳ/DST-Ⅳ and analysis/synthesis MDCT/MDST filter banks [J].Signal Processing,2009,89(11):2213-2232.
[6] Shu H,Bao X,Toumoulin C.Radix-3 algorithm for the fast computation of forward and inverse MDCT [J].IEEE Signal Processing Letters,2007,14(10):93-96.
[7] Wu J,Shu H.Mixed-radix algorithm for the computation of forward and inverse MDCTs [J].IEEE Transactions on Circuits and Systems Ⅰ:Regular Papers,2009,56(4):784-794.
[8] Britanak V.New fast computational structures for an efficient implementation of the forward/backward MDCT in MP3 audio coding standard [J].Signal Processing,2010,90(2):536-547.
[9] Britanak V.A survey of efficient MDCT implementations in MP3 audio coding standard:retrospective and state-of-the-art [J].Signal Processing,2011,91(4):624-672.
[10] Krishnan T,Oraintara S.Fast and lossless implementation of the forward and inverse MDCT computation in MPEG audio coding [C] // IEEE International Symposium on Circuits and Systems.Phoenix,AZ,USA,2002:181-184.
[11] Li J.Low noise reversible MDCT(RMDCT)and its application in progressive-to-lossless embedded audio coding [J].IEEE Transactions on Signal Processing,2005,53(5):1870-1880.
[12] Srinivasan S.Modulo transforms — an alternative to lifting [J].IEEE Transactions on Signal Processing,2006,54(13):1864-1874.
[13] Yang L,Hao P.Infinity-norm rotation transforms [J].IEEE Transactions on Signal Processing,2009,57(7):2594-2603.
[14] Princen J P,Johnson A W,Bradley A B.Subband/transform coding using filter bank designs based on time domain aliasing cancellation [C] //IEEE International Conference on Acoustics,Speech,and Signal Processing.Dallas,Texas,USA,1987:2161-2164.
[15] Geiger R,Sporer T,Koller J.Audio coding based on integer transforms [C] //The 111th Audio Engineering Society Convention. New York,USA,2001:5471-5479.

备注/Memo

备注/Memo:
作者简介: 王膂(1986—),男,硕士生; 舒华忠(联系人),男,博士,教授,博士生导师,shu.list@seu.edu.cn.
基金项目: 国家自然科学基金资助项目(61073138,60873048)、国家自然科学基金国际合作与交流资助项目(60911130370).
引文格式: 王膂,伍家松,Senhadji Lotfi,等.音频压缩中3种整数型MDCT变换的比较[J].东南大学学报:自然科学版,2012,42(2):259-264. [doi:10.3969/j.issn.1001-0505.2012.02.013]
更新日期/Last Update: 2012-03-20