Combination of Classifiers Decisions for Multilingual Speaker Identification

B. G. Nagaraja and H. S. Jayanna
Volume: 13, No: 4, Page: 928 ~ 940, Year: 2017
10.3745/JIPS.02.0025
Keywords: Classifier Combination, Cross-lingual, Monolingual, Multilingual, Speaker Identification
Full Text:

Abstract
State-of-the-art speaker recognition systems may work better for the English language. However, if the same system is used for recognizing those who speak different languages, the systems may yield a poor performance. In this work, the decisions of a Gaussian mixture model-universal background model (GMM- UBM) and a learning vector quantization (LVQ) are combined to improve the recognition performance of a multilingual speaker identification system. The difference between these classifiers is in their modeling techniques. The former one is based on probabilistic approach and the latter one is based on the fine-tuning of neurons. Since the approaches are different, each modeling technique identifies different sets of speakers for the same database set. Therefore, the decisions of the classifiers may be used to improve the performance. In this study, multitaper mel-frequency cepstral coefficients (MFCCs) are used as the features and the monolingual and cross-lingual speaker identification studies are conducted using NIST-2003 and our own database. The experimental results show that the combined system improves the performance by nearly 10% compared with that of the individual classifier.

Article Statistics
Multiple requests among the same broswer session are counted as one view (or download).
If you mouse over a chart, a box will show the data point's value.


Cite this article
IEEE Style
B. G. Nagaraja and H. S. Jayanna, "Combination of Classifiers Decisions for Multilingual Speaker Identification," Journal of Information Processing Systems, vol. 13, no. 4, pp. 928~940, 2017. DOI: 10.3745/JIPS.02.0025.

ACM Style
B. G. Nagaraja and H. S. Jayanna, "Combination of Classifiers Decisions for Multilingual Speaker Identification," Journal of Information Processing Systems, 13, 4, (2017), 928~940. DOI: 10.3745/JIPS.02.0025.