The method of speaker recognition based on wavelet functions and neural networks is presented in this paper. The wavelet functions are used to obtain the approximation function and the details of the speaker’s averaged spectrum in order to extract speaker’s voice characteristics from the frequency spectrum. The approximation function and the details are then used as input data for decision-making neural networks. In this recognition process, not only the decision on the speaker’s identity is made, but also the probability that the decision is correct can be provided.
Authors:
Domitrovic, Hrvoje; Grubesa, Sanja; Grubesa, Tomislav
Affiliations:
Faculty of EE and Computing; RIZ Transmitters Co.(See document for exact affiliation information.)
AES Conference:
26th International Conference: Audio Forensics in the Digital Age (July 2005)
Paper Number:
2-2
Publication Date:
July 1, 2005
Subject:
Audio Forensics in the Digital Age
Click to purchase paper as a non-member or you can login as an AES member to see more options.
No AES members have commented on this paper yet.
To be notified of new comments on this paper you can subscribe to this RSS feed. Forum users should login to see additional options.
If you are not yet an AES member and have something important to say about this paper then we urge you to join the AES today and make your voice heard. You can join online today by clicking here.