Time domain speech compression using the SDA (sample, discard, abut) procedure at compression ratios of 0.25 0.75 is studied by means of a new analog speech processor and minicomputer algorithms. Fourier transform methods have been used to establish correspondence between the quality of the reconstructed compressed speech waveforms and the subjective recognition of compressed speech. The result of two psychoacoustic experiments indicate that: 1. The interruption frequency should be equal to the pitch frequency of the voice waveform for the optimum recognition of the compressed speech. 2. Smoothing of the discontinuities with electronic techniques significantly improves the recognition of the compressed speech. The optimum smoothing parameters, window-width, and characteristic function are also obtained from this study.
Authors:
Bennett, Ian M.; Linvill, J. G.
Affiliation:
DEPARTMENT OF ELECTRICAL ENGINEERING, STANFORD UNIVERSITY, STANFORD, CA
AES Convention:
51 (May 1975)
Paper Number:
1044
Publication Date:
May 1, 1975
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.