We propose two methods for coding and transmitting three-dimensional multichannel sound signals: scalable coding and transmission, and modeling of the quantization error. The first method converts N-channel sound signals into M-channel basic and (N-M)-channel additional signals using a matrix operation. The matrix is trained by simulated annealing to reduce its condition number and the energy of additional signals. The unmasking artifact may occur when the N-channel signals are restored from the decoded signals using the inverse matrix. The second method estimates the quantization error signals by the polynomial approximation of the decoded signals. Experimental results showed that the combination of both methods could realize a 1.2 Mbps scalable transmission of 22-channel sounds without a notable sound degradation.
Author:
Ando, Akio
Affiliations:
NHK Science and Technical Research Labratories, Setagaya, Tokyo, Japan; Tokyo Institute of Technology, Meguro, Tokyo, Japan(See document for exact affiliation information.)
AES Convention:
132 (April 2012)
Paper Number:
8647
Publication Date:
April 26, 2012
Subject:
Spatial Audio
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.