Speaker Verification with the Constraint of Limited Data

Thyamagondlu Renukamurthy Jayanthi Kumari, Haradagere Siddaramaiah Jayanna, Journal of Information Processing Systems Vol. 14, No. 4, pp. 807-823, Aug. 2018  

Keywords: Gaussian Mixture Model (GMM), GMM-UBM, Multiple Frame Rate (MFR), Multiple Frame Size (MFS), MFSR, SFSR


Speaker verification system performance depends on the utterance of each speaker. To verify the speaker, important information has to be captured from the utterance. Nowadays under the constraints of limited data, speaker verification has become a challenging task. The testing and training data are in terms of few seconds in limited data. The feature vectors extracted from single frame size and rate (SFSR) analysis is not sufficient for training and testing speakers in speaker verification. This leads to poor speaker modeling during training and may not provide good decision during testing. The problem is to be resolved by increasing feature vectors of training and testing data to the same duration. For that we are using multiple frame size (MFS), multiple frame rate (MFR), and multiple frame size and rate (MFSR) analysis techniques for speaker verification under limited data condition. These analysis techniques relatively extract more feature vector during training and testing and develop improved modeling and testing for limited data. To demonstrate this we have used mel-frequency cepstral coefficients (MFCC) and linear prediction cepstral coefficients (LPCC) as feature. Gaussian mixture model (GMM) and GMM-universal background model (GMM-UBM) are used for modeling the speaker. The database used is NIST-2003. The experimental results indicate that, improved performance of MFS, MFR, and MFSR analysis radically better compared with SFSR analysis. The experimental results show that LPCC based MFSR analysis perform better compared to other analysis techniques and feature extraction techniques.

Show / Hide Statistics

Statistics (Cumulative Counts from November 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.

Cite this article
[APA Style]
Kumari, T. & Jayanna, H. (2018). Speaker Verification with the Constraint of Limited Data. Journal of Information Processing Systems, 14(4), 807-823. DOI: 10.3745/JIPS.01.0030.

[IEEE Style]
T. R. J. Kumari and H. S. Jayanna, "Speaker Verification with the Constraint of Limited Data," Journal of Information Processing Systems, vol. 14, no. 4, pp. 807-823, 2018. DOI: 10.3745/JIPS.01.0030.

[ACM Style]
Thyamagondlu Renukamurthy Jayanthi Kumari and Haradagere Siddaramaiah Jayanna. 2018. Speaker Verification with the Constraint of Limited Data. Journal of Information Processing Systems, 14, 4, (2018), 807-823. DOI: 10.3745/JIPS.01.0030.