Audio enhancement is a signal processing method that improves the listening experience. Although most audio devices provide a variety of sound-enhancing effects, it is reported that very few people are active users of this feature. This lack of usability comes from insufficient sound improvement because of concerns about scene-rendering mismatch, which means that the processing applied to an unintended target may even damage the sound quality. The key solution to this problem is sound intelligence that provides an optimal sound effect with very low latency. The authors propose a real-time audio enhancement system based on a highly precise audio scene classifier using convolutional neural networks. The entire computation including convolutions is optimized for digital signal processing--level implementation, resulting in enhanced audio outputs for every audio frame.
Authors:
Hwang, Inwoo; Kim, Kibeom; Kim, Sunmin
Affiliations:
Sound Laboratory, Visual Display Division, Samsung Electronics, Suwon, South Korea; Sound Laboratory, Visual Display Division, Samsung Electronics, Suwon, South Korea; Sound Laboratory, Visual Display Division, Samsung Electronics, Suwon, South Korea(See document for exact affiliation information.)
JAES Volume 71 Issue 10 pp. 719-728; October 2023
Publication Date:
October 10, 2023
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this report yet.
To be notified of new comments on this report you can
subscribe to this RSS feed.
Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this report then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.