In This Section
Audibility of a CD-Standard A/DA/A Loop Inserted into High-Resolution Audio Playback - September 2007
Sound Board: Food for Thought, Aesthetics in Orchestra Recording - April 2015
Reflecting on Reflections - June 2014
AES Journal Forum
A Simple Hybrid Approach to the Time-Scale Modification of Speech
Time-domain methods of time-scale modification (TSM) are attractive from the point of view of computational effort. However, they suffer from audible artifacts for larger timestretch ratios (greater than 1.3 times the original duration). The occurrence of these artifacts is often the main justification for the use of more involved analysis/synthesis methods at these ratios. For speech signals these artifacts take the form of transient repetition—causing a “stuttering” effect and roughness due to spectral mismatch at segment boundaries—most obvious during voiced signal periods. These phenomena are not addressed by existing timedomain methods. A simple hybrid algorithm utilizing both time-domain and analysis/synthesis methods is presented which illustrates how these distortions may be minimized. Results of formal listening tests illustrate an improvement in basic audio quality for timestretched speech signals when compared to equivalent samples processed by the synchronized overlap and add (SOLA) algorithm.
No AES members have commented on this paper yet.
Subscribe to this discussion
Start a discussion!
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.