Speech enhancement (SE) systems typically operate on monaural input and are used for applications including voice communications and capture cleanup for user-generated content. Recent advancements and changes in the devices used for these applications are likely to lead to an increase in the amount of two-channel content for the same applications. However, SE systems are typically designed for monaural input; stereo results produced using trivial methods such as channel-independent or mid-side processing may be unsatisfactory, including substantial speech distortions. To address this, the authors propose a system that creates a novel representation of stereo signals called custom mid-side signals (CMSS). CMSS allow benefits of mid-side signals for center-panned speech to be extended to a much larger class of input signals. This, in turn, allows any existing monaural SE system to operate as an efficient stereo system by processing the custom mid signal. This paper describes how the parameters needed for CMSS can be efficiently estimated by a component of the spatio-level--filtering source separation system. Subjective listening using state-of-the-art deep learning--based SE systems on stereo content with various speech mixing styles shows that CMSS processing leads to improved speech quality at approximately half the cost of channel-independent processing.
		
			
				Authors:
			
			Master, Aaron S.; Lu, Lie; Swedlow, Nathan
			
				
					Affiliation:
				
				Dolby Laboratories, Inc., San Francisco, CA
			
				JAES Volume 71 Issue 7/8 pp. 431-440; July 2023
			
		
			
			Publication Date:
		
		July 10, 2023
		
		
Download Now (579 KB)
This paper is Open Access which means you can download it for free.
No AES members have commented on this paper yet.
 To be notified of new comments on this paper you can 
				subscribe to this RSS feed.
				
					Forum users should login to see additional options.
				To be notified of new comments on this paper you can 
				subscribe to this RSS feed.
				
					Forum users should login to see additional options.
				
				
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.
