Predicting the Unemployment Rate Using Social Media Analysis


Pum-Mo Ryu, Journal of Information Processing Systems Vol. 14, No. 4, pp. 904-915, Aug. 2018  

10.3745/JIPS.04.0079
Keywords: Google Index, Prediction, Sentiment Analysis, Social Media, Unemployment Rate
Fulltext:

Abstract

We demonstrate how social media content can be used to predict the unemployment rate, a real-world indicator. We present a novel method for predicting the unemployment rate using social media analysis based on natural language processing and statistical modeling. The system collects social media contents including news articles, blogs, and tweets written in Korean, and then extracts data for modeling using part-of-speech tagging and sentiment analysis techniques. The autoregressive integrated moving average with exogenous variables (ARIMAX) and autoregressive with exogenous variables (ARX) models for unemployment rate prediction are fit using the analyzed data. The proposed method quantifies the social moods expressed in social media contents, whereas the existing methods simply present social tendencies. Our model derived a 27.9% improvement in error reduction compared to a Google Index-based model in the mean absolute percentage error metric.


Statistics
Show / Hide Statistics

Statistics (Cumulative Counts from November 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.




Cite this article
[APA Style]
Ryu, P. (2018). Predicting the Unemployment Rate Using Social Media Analysis. Journal of Information Processing Systems, 14(4), 904-915. DOI: 10.3745/JIPS.04.0079.

[IEEE Style]
P. Ryu, "Predicting the Unemployment Rate Using Social Media Analysis," Journal of Information Processing Systems, vol. 14, no. 4, pp. 904-915, 2018. DOI: 10.3745/JIPS.04.0079.

[ACM Style]
Pum-Mo Ryu. 2018. Predicting the Unemployment Rate Using Social Media Analysis. Journal of Information Processing Systems, 14, 4, (2018), 904-915. DOI: 10.3745/JIPS.04.0079.