Since a cathode-ray tube television was introduced first to the consumers in the late 1920s, a variety of multimedia device form-factors has appeared in the consumer market until now. Although the values of multimedia devices had been mostly put on picture quality and sound quality in the past, it is undoubtedly told at this moment that user experience is the most important and attractive value for the multimedia products. As for now, almost all outstanding features of the brand-new products are the technologies about user convenience such as voice user interaction, device unlock, contents recommendation, and so on. Likewise, an unprecedented feature of smart TVs, the Active Voice Amplifier, was introduced in the Consumer Electronics Show 2020, and it detects disturbing noise and enhances voice clarity accordingly and automatically. To design this feature and make it work in real time on real devices, state-ofthe- art signal processing methods and deep learning technologies are integrated in a function for the novel approach of noisy environment detection and speech extraction from multimedia audio contents. This paper overviews what this function pursues in user experience, describes how it was designed in terms of signal processing methods, and demonstrates how effectively it works on real-time TV systems.
Cho, Jaeyoun; Kim, Sunmin; Hwang, Inwoo
Affiliation: Samsung Electronics Co., Ltd., Suwon, South Korea
JAES Volume 71 Issue 3 pp. 129-137; March 2023
Publication Date: March 7, 2023
No AES members have commented on this report yet.
If you are not yet an AES member and have something important to say about this report then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.