Topic Extraction and Classification Method Based on Comment Sets


Xiaodong Tan, Journal of Information Processing Systems Vol. 16, No. 2, pp. 329-342, Apr. 2020  

10.3745/JIPS.04.0165
Keywords: Comment Text Set, Emotional Classification, LDA Topic Model, Support Vector Machine
Fulltext:

Abstract

"In recent years, emotional text classification is one of the essential research contents in the field of natural language processing. It has been widely used in the sentiment analysis of commodities like hotels, and other commentary corpus. This paper proposes an improved W-LDA (weighted latent Dirichlet allocation) topic model to improve the shortcomings of traditional LDA topic models. In the process of the topic of word sampling and its word distribution expectation calculation of the Gibbs of the W-LDA topic model. An average weighted value is adopted to avoid topic-related words from being submerged by high-frequency words, to improve the distinction of the topic. It further integrates the highest classification of the algorithm of support vector machine based on the extracted high-quality document-topic distribution and topic-word vectors. Finally, an efficient integration method is constructed for the analysis and extraction of emotional words, topic distribution calculations, and sentiment classification. Through tests on real teaching evaluation data and test set of public comment set, the results show that the method proposed in the paper has distinct advantages compared with other two typical algorithms in terms of subject differentiation, classification precision, and F1- measure."


Statistics
Show / Hide Statistics

Statistics (Cumulative Counts from November 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.




Cite this article
[APA Style]
Xiaodong Tan (2020). Topic Extraction and Classification Method Based on Comment Sets. Journal of Information Processing Systems, 16(2), 329-342. DOI: 10.3745/JIPS.04.0165.

[IEEE Style]
X. Tan, "Topic Extraction and Classification Method Based on Comment Sets," Journal of Information Processing Systems, vol. 16, no. 2, pp. 329-342, 2020. DOI: 10.3745/JIPS.04.0165.

[ACM Style]
Xiaodong Tan. 2020. Topic Extraction and Classification Method Based on Comment Sets. Journal of Information Processing Systems, 16, 2, (2020), 329-342. DOI: 10.3745/JIPS.04.0165.