An audio watermarking technique using Complementary Ensemble Empirical Mode Decomposition and group differential relations of average absolute amplitudes of the last Intrinsic Mode Function (IMF) is proposed. By using group differential relations, the relationship with neighboring samples in the last IMF is well preserved, and near-imperceptibility can be achieved. Placing a watermark on low-frequency components, the last IMF, which is perceptually significant, therefore makes the watermark difficult to be removed. The embedding watermark, which is a logo image in our experiment, is processed by Arnold transformation, secret key encryption, and Bose--Chaudhuri--Hocquenghem coding to enhance robustness and security. Experimental results of the signal-to-noise ratio fit the recommendations of imperceptibility of the International Federation of the Phonographic Industry. The average Objective Difference Grade (an objective measure that correlates very well with subjective assessment) and subjective quality assessment were performed to evaluate the imperceptibility. Furthermore, our method accomplishes robustness under 13 different categories of attacks, including noise corruption, amplitude scaling, echo addition, resampling, re-quantization, low-pass filtering, MPEG-1 Audio Layer III compression, Digital-to-Analog/Analog-to-Digital conversion, cropping, time shift, zero thresholding, jittering, and combined attacks.
Authors:
Lai, Wen-Hsing; Chou, Tsung-Yuan; Chou, Meng-Chen; Schuller, Björn W.
Affiliations:
Department of Computer and Communication Engineering, National Kaohsiung University of Science and Technology, No. 1, University Rd., Yanchao Dist., Kaohsiung City 82445, Taiwan; ZD.B Chair of Embedded Intelligence for Health Care and Wellbeing, University of Augsburg, Germany; Ph.D. Program in Engineering Science and Technology, College of Engineering, National Kaohsiung University of Science and Technology, Taiwan; Department of Computer and Communication Engineering, National Kaohsiung University of Science and Technology, No. 1, University Rd., Yanchao Dist., Kaohsiung City 82445, Taiwan; ZD.B Chair of Embedded Intelligence for Health Care and Wellbeing, University of Augsburg, Germany; GLAM – Group on Language, Audio & Music, Imperial College London, U.K.(See document for exact affiliation information.)
JAES Volume 71 Issue 3 pp. 100-117; March 2023
Publication Date:
March 7, 2023
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.