Time-domain methods of time-scale modification (TSM) are attractive from the point of view of computational effort. However, they suffer from audible artifacts for larger timestretch ratios (greater than 1.3 times the original duration). The occurrence of these artifacts is often the main justification for the use of more involved analysis/synthesis methods at these ratios. For speech signals these artifacts take the form of transient repetition—causing a “stuttering” effect and roughness due to spectral mismatch at segment boundaries—most obvious during voiced signal periods. These phenomena are not addressed by existing timedomain methods. A simple hybrid algorithm utilizing both time-domain and analysis/synthesis methods is presented which illustrates how these distortions may be minimized. Results of formal listening tests illustrate an improvement in basic audio quality for timestretched speech signals when compared to equivalent samples processed by the synchronized overlap and add (SOLA) algorithm.
Authors:
Knox, Don; Bailey, Nicholas; Stewart, Iain
Affiliations:
Glasgow Caledonian University, Glasgow, UK; University of Glasgow, Glasgow, UK(See document for exact affiliation information.)
JAES Volume 53 Issue 7/8 pp. 612-619; July 2005
Publication Date:
July 15, 2005
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.