Time-domain digital time-scale modification algorithms based on the overlap-and-add technique were developed mainly for speech signal applications and may not perform equally well for broad-band audio signals. Broad-band audio signals in general do not have a single 'pitch period' or similar structure to take advantage of, as exists in single-talker speech. Due to the basic assumption of quasi-periodicity in such methods, this means that the best alignment point between two successive frames in the overlapping region may not always be ideal. It results in a distortion in the form of repetitive waveforms, which produce echo and stuttering effects that are audible, especially after time-scale expansion. To overcome this problem, the broad-band signal is first divided into smaller subbands before the overlap-and-add technique is applied. The proposed subband analysis synchronous overlap-and-add (SASOLA) algorithm provides an alternative for time-scale modification of broad-band audio and speech signal applications without the various undesirable artifacts and high computational cost associated with the frequency-domain techniques.
Authors:
Tan, Roland K. C.; Lin, Amerson H. J.
Affiliations:
Kent Ridge Digital Labs (KRDL), Singapore ; Raffles Junior College, Singapore(See document for exact affiliation information.)
JAES Volume 48 Issue 5 pp. 437-449; May 2000
Publication Date:
May 1, 2000
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.