We present a novel approach to detect infant cry in actual outdoor and indoor settings. Using computationally inexpensive features like Mel Frequency Cepstral Coefficients (MFCCs) and timbre-related features, the proposed algorithm yields very high recall rates for detecting infant cry in challenging settings such as café, street, playground, office, and home environments, even when Signal to Noise Ratio (SNR) is as low as 6 dB, while maintaining high precision. The results indicate that our approach is highly accurate, robust and, works in real-time.
Authors:
Baijal, Anant; Kim, Jinsung; Jeong, Jae-hoon; Hwang, Inwoo; Park, JungEun; Ko, Byeong-Seob
Affiliation:
Samsung Electronics Co. Ltd., Suwon, Korea
AES Convention:
137 (October 2014)
Paper Number:
9142
Publication Date:
October 8, 2014
Subject:
Perception
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.