Text Independent Speaker Identification for Myanmar Speech

WCSE 2019 SPRING ISBN: 978-981-14-1455-8
DOI: 10.18178/wcse.2019.03.015

Win Lai Lai Phyu, Win Pa Pa

Abstract— Nowadays, speech signal processing is one of the emerging applicat ion areas of digital processing. There are many research areas related to speech processing such as speaker recognition, speech recognition, and speech synthesis. Speaker identificat ion is the task of analy zing the speakers’ characteristics in speech to exactly identify individuals. The identification task performs better when there is enough background training data. Mel Frequency Cepstral Coefficient (MFCC), Perceptual Linear Prediction (PLP) and Filter Bank features are extracted as front-end processing. Constructing Universal Background Model (UBM) is the main component of i-vector system as it is essential for collecting statistics from speech utterances and for clustering the speaker models. This paper indicates that the impacts of unlimited speech data in speaker identification by using i-vector method with probabilistic linear discriminative analysis (PLDA) approach and the import role of speaker models in identification process.

Index Terms— Speaker Identification (SI), Universal Background Model (UBM), MFCC, Filter Bank, PLP, ivectors, Speaker Model, PLDA, Myanmar Speech

Win Lai Lai Phyu, Win Pa Pa
Natural Language Processing Lab., University of Computer Studies, MYANMAR

[Download]

Cite: Win Lai Lai Phyu, Win Pa Pa, "Text Independent Speaker Identification for Myanmar Speech," Proceedings of 2019 the 9th International Workshop on Computer Science and Engineering WCSE_2019_SPRING, pp. 86-89, Yangon, Myanmar, February 27-March 1, 2019.

PREVIOUS PAPER
Newly-Coined Words and Emoticons Dictionary Construction for Social Data Sentiment Analysis

NEXT PAPER
Anaphora Resolution for Myanmar Text Using K-Nearest Neighbor Algorithm