Speech Recognition using Wavelets and Improved SVM
Abstract
Speaker recognition (identification/verification) is the computing task of validating a user’s claimed identity using speaker specific information included in speech waves: that is, it enables access control of various services by voice. Discrete Wavelet Transform (DWT) based systems for speaker recognition have shown robust results for several years and are widely used in speaker recognition applications. This paper is based on text independent speaker recognition system that makes use of Discrete Wavelet Transform (DWT) as a feature extraction and kernel Support Vector Machine (SVM) approach as a classification tool for taking the decision through applying simplified-Class Support Vector Machine approach. The proposed SVM approach can convert local Euclidean distances between frame vectors to angles by projecting these -dimensional vectors together, and get the minimum global distance from the non-linear aligned speech path in order to address audio classification, and hence, sound recognition.The DWT for each frame of the spoken word are taken as a tool for extracting the main feature as a data code vectors, next these data is normalized utilizing the normalized power algorithm that is used to reduce the number of feature vector coefficients then these data is scaled and tested with those stored of the training spoken words to achieve the speaker identification tasks, also the DWT gives fixed amount of data that can be utilized modesty by SVM.Finally, the proposed method is tested and trained upon a very large data base with results limited to ten speakers only (5 males and 5 females) with words of maximally 17 phenomena and its performance gives an accurate and stable results which rises the algorithm efficiency and reduce the execution time with 97% overall accuracy.
Downloads
Metrics
References
2. Meysam Mohamad pour, Fardad Farokhi, “An Advanced Method for Speech Recognition”, World Academy of Science, Engineering and Technology, Vol. 25, 2009.
3. Nitin Trivedi, Sachin Ahuja, Dr. Vikesh Kumar, Raman Chadha, Saurabh Singh, “Speech Recognition by Wavelet Analysis”, International Journal of Computer Applications (0975 – 8887),Volume 15– No.8, February 2011.
4. Toni Giorgino, “Computing and Visualizing Dynamic Time Warping Alignments in R: The dtw Package”, University of Pavia, 2009.
5. Siwar Rekik, Driss Guerchi, Habib Hamam & Sid-Ahmed Selouani, “Audio Steganography Coding Using the Discrete Wavelet Transforms”, International Journal of Computer Science and Security (IJCSS), Volume (6), 2012.
6. A. Rabaoui , M. Davy, S. Rossignol, Z. Lachiri, N. Ellouze, “Improved One-Class SVM Classifier for Sounds Classification”, Campus University, Lille France.
7. W. M. Campbell, J. P. Campbell, D. A. Reynolds, D. A. Jones, and T. R. Leek, “Phonetic Speaker Recognition with Support Vector Machines”, MIT Lincoln Laboratory.
8. Asma Rabaoui, Manuel Davyy, St´ephane Rossignoly, Zied Lachiri and Noureddine Ellouze, “Using One-Class SVMs and Wavelets for Audio Surveillance Systems”, Campus University, Lille France.
9. Joseph Keshet, and Samy Bengio, “Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods”, John Wiley & Sons, 2009.
