A Method of Chinese and Thai Cross-Lingual Query Expansion Based on Comparable Corpus

Peili Tang, Jing Zhao, Zhengtao Yu, Zhuo Wang and Yantuan Xian
Volume: 13, No: 4, Page: 805 ~ 817, Year: 2017
10.3745/JIPS.04.0039
Keywords: Comparable Corpus, Cross-Language Query Expansion, Cross-Language Information Retrieval, Words Relationship
Full Text:

Abstract
Cross-lingual query expansion is usually based on the relationship among monolingual words. Bilingual comparable corpus contains relationships among bilingual words. Therefore, this paper proposes a method based on these relationships to conduct query expansion. First, the word vectors which characterize the bilingual words are trained using Chinese and Thai bilingual comparable corpus. Then, the correlation between Chinese query words and Thai words are computed based on these word vectors, followed with selecting the Thai candidate expansion terms via the correlative value. Then, multi-group Thai query expansion sentences are built by the Thai candidate expansion words based on Chinese query sentence. Finally, we can get the optimal sentence using the Chinese and Thai query expansion method, and perform the Thai query expansion. Experiment results show that the cross-lingual query expansion method we proposed can effectively improve the accuracy of Chinese and Thai cross-language information retrieval.

Article Statistics
Multiple requests among the same broswer session are counted as one view (or download).
If you mouse over a chart, a box will show the data point's value.


Cite this article
IEEE Style
Peili Tang, Jing Zhao, Zhengtao Yu, Zhuo Wang, and Yantuan Xian, "A Method of Chinese and Thai Cross-Lingual Query Expansion Based on Comparable Corpus," Journal of Information Processing Systems, vol. 13, no. 4, pp. 805~817, 2017. DOI: 10.3745/JIPS.04.0039.

ACM Style
Peili Tang, Jing Zhao, Zhengtao Yu, Zhuo Wang, and Yantuan Xian, "A Method of Chinese and Thai Cross-Lingual Query Expansion Based on Comparable Corpus," Journal of Information Processing Systems, 13, 4, (2017), 805~817. DOI: 10.3745/JIPS.04.0039.