A Method of Chinese and Thai Cross-Lingual Query Expansion Based on Comparable Corpus


Peili Tang, Jing Zhao, Zhengtao Yu, Zhuo Wang, Yantuan Xian, Journal of Information Processing Systems Vol. 13, No. 4, pp. 805-817, Aug. 2017  

10.3745/JIPS.04.0039
Keywords: Comparable Corpus, Cross-Language Query Expansion, Cross-Language Information Retrieval, Words Relationship
Fulltext:

Abstract

Cross-lingual query expansion is usually based on the relationship among monolingual words. Bilingual comparable corpus contains relationships among bilingual words. Therefore, this paper proposes a method based on these relationships to conduct query expansion. First, the word vectors which characterize the bilingual words are trained using Chinese and Thai bilingual comparable corpus. Then, the correlation between Chinese query words and Thai words are computed based on these word vectors, followed with selecting the Thai candidate expansion terms via the correlative value. Then, multi-group Thai query expansion sentences are built by the Thai candidate expansion words based on Chinese query sentence. Finally, we can get the optimal sentence using the Chinese and Thai query expansion method, and perform the Thai query expansion. Experiment results show that the cross-lingual query expansion method we proposed can effectively improve the accuracy of Chinese and Thai cross-language information retrieval.


Statistics
Show / Hide Statistics

Statistics (Cumulative Counts from November 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.




Cite this article
[APA Style]
Tang, P., Zhao, J., Yu, Z., Wang, Z., & Xian, Y. (2017). A Method of Chinese and Thai Cross-Lingual Query Expansion Based on Comparable Corpus. Journal of Information Processing Systems, 13(4), 805-817. DOI: 10.3745/JIPS.04.0039.

[IEEE Style]
P. Tang, J. Zhao, Z. Yu, Z. Wang, Y. Xian, "A Method of Chinese and Thai Cross-Lingual Query Expansion Based on Comparable Corpus," Journal of Information Processing Systems, vol. 13, no. 4, pp. 805-817, 2017. DOI: 10.3745/JIPS.04.0039.

[ACM Style]
Peili Tang, Jing Zhao, Zhengtao Yu, Zhuo Wang, and Yantuan Xian. 2017. A Method of Chinese and Thai Cross-Lingual Query Expansion Based on Comparable Corpus. Journal of Information Processing Systems, 13, 4, (2017), 805-817. DOI: 10.3745/JIPS.04.0039.