In teleconferencing application of Directional Audio Coding, the transmitted data consists of monophonic audio signal and directional metadata measured in frequency bands depending on time. In reproduction, each frequency channel of the signal is reproduced to corresponding direction with corresponding diﬀuseness. This paper examines methods for reducing the data rate of the metadata. The compression methods are based on psychoacoustic studies about the accuracy of directional hearing, and further developed and validated. Informal tests with one-way reproduction, as well as usability testing where an actual teleconference was arranged, were utilized for this purpose. The results indicate that the data rate can be as low as approx. 3 kbit/s without a signiﬁcant loss in the reproduced spatial quality.
Hirvonen, Toni; Ahonen, Jukka; Pulkki, Ville
Affiliations: Institute of Computer Science (ICS) of the Foundation for Research and Technology, Hellas, Greece; TKK, Espoo, Finland(See document for exact affiliation information.)
AES Convention: 126 (May 2009) Paper Number: 7706
Publication Date: May 1, 2009
Subject: Audio for Telecommunications
No AES members have commented on this paper yet.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.