BINARY LOGISTIC REGRESSION BASED CLASSIFIER FOR FAKE NEWS

Authors

  • Joseph Meynard Gumahin Ogdol Northwestern Mindanao State College of Science and Technology http://orcid.org/0000-0003-3149-1183
  • Bill-Lawrence Tigol Samar Northwestern Mindanao State College of Science and Technology http://orcid.org/0000-0002-2240-9929
  • Charmalyn Catarroja Zaria International Inc., Chicago, USA

Keywords:

News, Fake, Classifier, Machine Learning, Binary Linear Regression, Big Data

Abstract

Addressing modern society's mutual need for reliable News from new media has been a challenge since the development of modern mobile and telecommunication technologies. This study attempts to contribute to current solutions in mitigating the spread of fake news by creating a classifier based on a logistic binary regression commonly prevalent in the field of machine learning and artificial intelligence. A dataset of 10,000 news data has been used in this study and was processed to measure the sentiment neutrality, page rank and content length to content structure error ratio for each data to create a model for the fake news classifier. The model was then subject to prediction tests and has shown to have an 80% accuracy rate, therefore implying that the model derived from this study is capable of classifying legitimate news apart from fake news.

References

Anderson, S. P., De Palma, A., & Thisse, J. F. (1992). Discrete choice theory of product
differentiation. MIT press.

Amarasingam, A. (Ed.). (2011). The Stewart/Colbert Effect: Essays on the real impacts of fake
news. McFarland.

Conroy, N. J., Rubin, V. L., & Chen, Y. (2015). Automatic deception detection: Methods for
finding fake news. Proceedings of the Association for Information Science and Technology,
52(1), 1-4.

Engle, R. F., & Ng, V. K. (1993). Measuring and testing the impact of news on volatility. The
journal of finance, 48(5), 1749-1778.

Gupta, A., Lamba, H., Kumaraguru, P., & Joshi, A. (2013, May). Faking sandy: characterizing
and identifying fake images on twitter during hurricane sandy. In Proceedings of the 22nd
international conference on World Wide Web (pp. 729-736). ACM.

Kucharski, A. (2016). Post-truth: Study epidemiology of fake news. Nature, 540(7634), 525.

McFadden, D. (1986). The choice theory approach to market research. Marketing science, 5(4),
275-297.

Rubin, V. L., Chen, Y., & Conroy, N. J. (2015). Deception detection for news: three types of
fakes. Proceedings of the Association for Information Science and Technology, 52(1), 1-4.

Rubin, V., Conroy, N., Chen, Y., & Cornwell, S. (2016). Fake news or truth? using satirical cues
to detect potentially misleading news. In Proceedings of the Second Workshop on Computational
Approaches to Deception Detection (pp. 7-17).

Tacchini, E., Ballarin, G., Della Vedova, M. L., Moret, S., & de Alfaro, L. (2017). Some like it
hoax: Automated fake news detection in social networks. arXiv preprint arXiv:1704.07506

Wang, W. Y. (2017). A New Benchmark Dataset for Fake News Detection. arXiv preprint arXiv:1705.00648.

Published

2018-06-29