音频DCT系数分布函数的建模

doi:10.7523/j.issn.2095-6134.2011.6.008

中国科学院大学学报 ›› 2011, Vol. 28 ›› Issue (6): 752-758.DOI: 10.7523/j.issn.2095-6134.2011.6.008

音频DCT系数分布函数的建模

王翠平^1,2, 郭立¹, 王昱洁¹, 陈运必¹

1. 中国科学技术大学电子科学与技术系, 合肥 230027;
2. 青岛大学自动化工程学院电子学系, 山东青岛 266071

收稿日期:2010-09-08 修回日期:2010-11-15 发布日期:2011-11-15
基金资助:
国家自然科学基金(60772032)资助 

Distributions of audio DCT coefficients

WANG Cui-Ping^1,2, GUO Li¹, WANG Yu-Jie¹, CHEN Yun-Bi¹

1. Department of Electronic Science and Technology, USTC, Hefei 230027, China;
2. Department of Electronics, College of Automation Engineering, Qingdao University, Qingdao 266071, Shandong, China

Received:2010-09-08 Revised:2010-11-15 Published:2011-11-15

摘要/Abstract

摘要：

针对DCT系数的非高斯性以及广义高斯分布和α稳定分布在音频DCT系数建模上的局限性,提出了一种混合模型.该模型由广义高斯分布和α稳定分布线性加权而成,其加权系数由遗传算法产生.模型的准确性由衡量2个概率分布之间差异的物理量Kullback-Leibler Divergence(KL散度)测量.实验结果表明,提出的混合模型接近于真实分布,并可用于音频检索和隐写分析.

关键词: 离散余弦变换, 混合模型, 广义高斯分布, α稳定分布, 遗传算法, KL散度

Abstract:

Considering the non-gaussian property of discrete cosine transform(DCT) coefficients of audio and the limitation of modeling the DCT coefficient distribution with a single distribution such as generalized gaussian distribution(GGD) or alpha stable distribution, we propose a mixed model. This mixed model is a linearly weighted average of GGD and alpha stable distribution, while the weight values are obtained using a genetic algorithm. The model accuracy is measured with the Kullback-Leibler Divergence, a metric for evaluating the difference between two probability distributions. Experiment results show that the mixed model is close to the true distribution of DCT coefficients and is of practical use in the field of audio retrieval and steganalysis.

Key words: DCT, mixture model, generalized Gaussian distribution, alpha stable distribution, genetic algorithm, KL divergence

中图分类号:

TP391

王翠平, 郭立, 王昱洁, 陈运必. 音频DCT系数分布函数的建模[J]. 中国科学院大学学报, 2011, 28(6): 752-758.

WANG Cui-Ping, GUO Li, WANG Yu-Jie, CHEN Yun-Bi. Distributions of audio DCT coefficients[J]. , 2011, 28(6): 752-758.

参考文献

[1] Ma Y P, Han J Q. Audio watermarking in DCT: embedding strategy and algorithm
[J]. Chinese Journal of Electronics, 2006, 34(7): 1260-1264(in Chinese). 马翼平, 韩纪庆. DCT域音频水印:嵌入对策和算法
[J]. 电子学报, 2006, 34(7): 1260-1264.

[2] Wang Q S, Sun S H. A novel algorithm for embedding watermarks into digital audio signals
[J]. Acta Acustica, 2001, 26(5): 464-467(in Chinese). 王秋生, 孙圣和. 一种在数字音频信号中嵌入水印的新算法
[J]. 声学学报, 2001, 26(5): 464-467.

[3] Ma Y P, Han J Q. Choice approach for embedding position of audio watermarking in the DCT domain
[J]. Computer Science, 2005, 32(11): 139-141(in Chinese). 马翼平, 韩纪庆. DCT 域音频水印算法的嵌入位置选择策略
[J]. 计算机科学, 2005, 32(11): 139-141.

[4] Zhou Z P, Zhou L H. A novel algorithm for robust audio watermarking based on quantification DCT domain //Third International Conference on Intelligent Information Hiding and Multimedia Signal Processing. Kaohsiung, 2007: 441-444.

[5] Liu Q, Sung A, Qiao M. Spectrum steganalysis of WAV audio streams //Proceedings of the 6th International Conference on Machine Learning and Data Mining in Pattern Recognition. 2009: 582-593.

[6] Li C, Zeng W, Ai H, et al. Steganalysis of spread spectrum hiding based on DWT and GMM //International Conference on Networks Security, Wireless Communications and Trusted Computing Wuhan. 2009: 240-243.

[7] Liu Q, Sung A, Qiao M. Temporal derivative-based spectrum and mel-cepstrum audio steganalysis
[J]. IEEE Transactions on Information Forensics and Security, 2009, 4(3): 359-368.

[8] Liu S, Ma L, Yao H X, et al. Universal steganalysis based on statistical models using reorganization of block-based DCT coefficients //Fifth International Conference on Information Assurance and Security. Xi'an, 2009:778-781.

[9] Reininger R, Gibson J. Distributions of the two-dimensional DCT coefficients for images
[J]. IEEE Transactions on Communications, 1983, 31(6): 835-839.

[10] Lam E, Goodman J. A mathematical analysis of the DCT coefficient distributions for images
[J]. IEEE Transactions on Image Processing, 2000, 9(10): 1661-1666.

[11] Muller F. Distributions shape of two-dimensional DCT coefficients for nature images
[J]. Electronics Letters, 1993, 29(22): 1935-1936.

[12] Tanabe N, Farvardin N. Subband image coding using entropy-coded quantization over noisy channels
[J]. IEEE Journal on Selected Areas in Communications, 1992, 10(5): 926-943.

[13] Eude T, Grkel R. On the distribution of the DCT coefficients //International Conference on Acoustics, Speech, and Signal Processing, ICASSP-94. Adelaide, 1994 (5):365-368.

[14] Yu R, Xiao L, Rahardja S, et al. A statistics study of the MDCT coefficient distribution for audio //IEEE International Conference on Multimedia and Expo,2004, ICME'04. Taipei, 2004, 2: 1483-1486.

[15] Yaroslavsky L, Wang Y. DFT, DCT, MDCT, DST and signal fourier spectrum analysis //European Signal Processing Conference. Finlande, 2000: 1065-1068.

[16] Sharifi K, Leon-Garcia A. Estimation of shape parameter for generalized Gaussian distributions in subband decompositions of video
[J]. IEEE Transactions on Circuits and Systems for Video Technology, 1995, 5(1): 52-56.

[17] Do M, Vetterli M. Wavelet-based texture retrieval using generalized Gaussian density and Kullback-Leibler distance
[J]. IEEE Transactions on Image Processing, 2002, 11(2): 146-158.

[18] Nolan J. Maximum likelihood estimation and diagnostics for stable distributions
[M]. Lévy Processes: Theory and Applications, 2001: 379-400.

[19] Nolan J. Parameterizations and modes of stable distributions
[J]. Statistics & Probability Letters, 1998, 38(2): 187-195.

[20] Bilmes J. A gentle tutorial of the EM algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models . U C Berkely, TR-97-021, 1998.

[21] Houck C, Joines J, Kay M. A genetic algorithm for function optimization: a Matlab implementation //NCSU-IE TR95-09. 1995.

音频DCT系数分布函数的建模

Distributions of audio DCT coefficients

PDF (PC)

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 14

编辑推荐

Metrics

本文评价

访问统计

联系我们

[1]	明卫鹏, 马广彬, 章文毅. 基于基因表达式编程的多星成像任务规划算法[J]. 中国科学院大学学报, 2020, 37(4): 532-538.
[2]	杨洋, 王岩飞. 一种基于混合模型的合成孔径雷达自聚焦算法[J]. 中国科学院大学学报, 2016, 33(5): 656-663.
[3]	常楠, 张三国. 基于核方法和高斯混合模型的手机上网时长统计分析及应用[J]. 中国科学院大学学报, 2015, 32(1): 136-139.
[4]	王华朋, 杨军, 吴鸣, 许勇. 一种改进的基于GMM-UBM的法庭自动说话人识别系统[J]. 中国科学院大学学报, 2013, 30(6): 800-805.
[5]	桑睿, 吴杰, 许华, 郭强. 基于改进的遗传算法的后非线性盲源分离[J]. 中国科学院大学学报, 2012, 29(5): 707-713.
[6]	陈刚, 王宏琦, 孙显. 基于核函数原型和自适应遗传算法的 SVM模型选择方法[J]. 中国科学院大学学报, 2012, 29(1): 62-69.
[7]	刘丽峰, 张树清. 人工免疫算法规划复杂环境下三维飞行航迹[J]. 中国科学院大学学报, 2011, 28(6): 746-751.
[8]	李朝晖, 王旻. 基于置乱变换的DCT域数字水印改进算法[J]. 中国科学院大学学报, 2011, 28(5): 684-689.
[9]	赵冬, 赵光恒. 基于改进遗传算法的高光谱图像波段选择[J]. 中国科学院大学学报, 2009, 26(6): 795-802.
[10]	徐菡;　张敏洪. 大规模逆向物流网络非线性优化模型的研究[J]. 中国科学院大学学报, 2007, 24(6): 749-755.
[11]	罗德林; 沈春林; 王彪 ; 吴文海. 基于混合自适应遗传算法的协同多目标攻击空战决策（英文）[J]. 中国科学院大学学报, 2006, 23(3): 382-389.
[12]	贵刚, 于军, 苏丽杰, 聂义勇. 压力铸造充型过程多工艺参数的优化选择[J]. 中国科学院大学学报, 2004, 21(4): 532-537.
[13]	甘国辉, 刘长岐, 杨丹. 基于遗传算法的农业用地结构优化研究——以北京市通州区为例[J]. 中国科学院大学学报, 2004, 21(1): 50-55.
[14]	孔祥蕾, 张先燚, 罗晓琳, 李海洋. 一种引入强制变异的改进遗传算法[J]. 中国科学院大学学报, 2003, 20(3): 316-320.