Can social media data be used to make reasonably accurate estimates of electoral outcomes and public opinion? Some studies indicate that such estimates are feasible (Skoric et al., 2012; Tumasjan et al., 2012), while others are skeptical about them (Metaxas & Mustafaraj, 2012). Given that social media users—particularly more active ones—are not representative of the general population, and that the data they generate is both unstructured and unsolicited, how could such analyses yield accurate predictions of public opinion?
In this meta-review of published research, we examine the three main approaches to social media-based predictions of elections and public opinion: (1) volume-based analysis; (2) sentiment analysis, based on lexicons and machine learning; and (3) network analysis. In comparing the predictive power of these three approaches, we find that network analysis outperforms both volume-based and sentiment analysis, while volume-based analysis outperforms sentiment analysis. Finally, we find that methods which combine network analysis with either volume- or sentiment-based analysis yield the most accurate predictions when benchmarked against voting results or public opinion surveys. We discuss the implications of these findings for future research and identify some potential challenges that lie ahead.
References
- Metaxas, P. T., & Mustafaraj, E. (2012). Social media and the elections. Science, 338(6106), 472-473.
- Skoric, M. M., Poor, N. D., Achananuparp, P, Lim, E. P., & Jiang, J. (2012). Tweets and votes: A study of the 2011 Singapore General Election. Proceedings of the Hawaii International Conference on System Sciences. Washington, D.C.: IEEE Computer Society. doi:10.1109/HICSS.2012.607
- Tumasjan, A., Sprenger, T. O., Sander, P. G., & Welpe, I. M. (2012). Predicting elections with Twitter: What 140 characters reveal about political sentiment. Social Science Computer Review, 30(2), 229-234.
Downloads