In teleconferencing application of Directional Audio Coding, the transmitted data consists of monophonic audio signal and directional metadata measured in frequency bands depending on time. In reproduction, each frequency channel of the signal is reproduced to corresponding direction with corresponding diffuseness. This paper examines methods for reducing the data rate of the metadata. The compression methods are based on psychoacoustic studies about the accuracy of directional hearing, and further developed and validated. Informal tests with one-way reproduction, as well as usability testing where an actual teleconference was arranged, were utilized for this purpose. The results indicate that the data rate can be as low as approx. 3 kbit/s without a significant loss in the reproduced spatial quality.
Authors:
Hirvonen, Toni; Ahonen, Jukka; Pulkki, Ville
Affiliations:
Institute of Computer Science (ICS) of the Foundation for Research and Technology, Hellas, Greece; TKK, Espoo, Finland(See document for exact affiliation information.)
AES Convention:
126 (May 2009)
Paper Number:
7706
Publication Date:
May 1, 2009
Subject:
Audio for Telecommunications
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.