Anaphora Resolution for Myanmar Text Using K-Nearest Neighbor Algorithm
Abstract— Anaphora resolution which most commonly appears as pronoun resolution is the problem of resolving references to earlier or later items in the discourse. Anaphora resolution is an active area of research, such as text mining, text summarization, dialogue interpretations, information extraction, and so on. Anaphora resolution in English and other European languages has been well done in early. But Myanmar Language has not sufficiently applied. This paper presents Myanmar anaphora resolution system by using rule-based part of speech tagging and machine learning approach. Rule-based manner with morphological informat ion is used to collect anaphora and possible antecedents. K-Nearest Neighbor (k-NN) approach is used to select the most probable candidate as the antecedent of the anaphor.
Index Terms— anaphora resolution, machine learning, k-nearest neighbour, morphological features
Khin Theink Theink Soe, Tin Htar Nwe, Khin Thandar Nwet
Natural Language Processing Lab, University of Computer Studies, MYANMAR
Cite: Khin Theink Theink Soe, Tin Htar Nwe, Khin Thandar Nwet, "Anaphora Resolution for Myanmar Text Using K-Nearest Neighbor Algorithm," Proceedings of 2019 the 9th International Workshop on Computer Science and Engineering WCSE_2019_SPRING, pp. 90-94, Yangon, Myanmar, February 27-March 1, 2019.