Distinguishing True and Fake News by Using Text Mining and Machine Learning Algorithm
American Journal of Data Mining and Knowledge Discovery
Volume 5, Issue 2, December 2020, Pages: 20-26
Received: Jul. 6, 2020;
Accepted: Jul. 21, 2020;
Published: Sep. 19, 2020
Views 180 Downloads 64
Hyunseo Lee, Seoul International School, Gyeonggi-do, South Korea
Ian Paik Choe, Seoul Foreign School, Seoul, South Korea
Jioh In, Princeton International School of Mathematics and Science, New Jersey, the United States
Han Sol Kim, Fayston Preparatory School, Gyeonggi-do, South Korea
Follow on us
With recent advancements in social media and technology as a whole, online news sources have increased. Therefore there has been a higher demand of people wanting a convenient way to find recent, relevant and updated online news articles and posts from social media platforms. In the current status quo, many people feel comfortable with their main source of news being social media articles. Unfortunately, receiving news via social media platforms and unverified online sites has aroused many problems, one of which being fake news (news which contain incorrect or biased facts and statements). Many individuals all around the world are vulnerable and subject to fake news and becoming victims of propaganda and/or being misinformed. To solve this world-wide complication, we used word preprocessing skills to digest the content of articles, and used several mathematical vectors to pinpoint the legitimacy of a news article. To establish an accurate system, words used in examples of fake news and real news were collected using Python. Verifying fake and real news is an important process that all news should go through as it can result in immense consequences. Data on real news and fake news were collected from Kaggle. We had the conclusion that the trained machine learning algorithms showed high accuracy of distinguishing which indicates our research was successful.
Fake News, Preprocessing Data, Data Analysis, Text Mining, Machine Learning
To cite this article
Ian Paik Choe,
Han Sol Kim,
Distinguishing True and Fake News by Using Text Mining and Machine Learning Algorithm, American Journal of Data Mining and Knowledge Discovery.
Vol. 5, No. 2,
2020, pp. 20-26.
Copyright © 2020 Authors retain the copyright of this article.
This article is an open access article distributed under the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/
) which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Newspapers Statistics-Worldometer. (n.d.). Retrieved June 20, 2020, from https://www.worldometers.info/newspapers/.
How News Has Changed-News-Macalester College. (n.d.). Retrieved June 20, 2020, from https://www.macalester.edu/news/2017/04/how-news-has-changed/.
What Is Digital Media? All You Need to Know About New Media. (n.d.). Retrieved June 20, 2020, from https://online.maryville.edu/blog/what-is-digital-media/.
Pew Research: People prefer social media over print newspapers for news consumption-TechSpot. (n.d.). Retrieved June 20, 2020, from https://www.techspot.com/news/77816-pew-research-people-prefer-social-media-over-print.html.
Nguyen, A. (2010). Harnessing the potential of online news: Suggestions from a study on the relationship between online news advantages and its post-adoption consequences. Journalism: Theory, Practice & Criticism, 11 (2), 223-241. https://doi.org/10.1177/1464884909355910.
Why do people read online news? (Research summary) | Online Journalism Blog. (n.d.). Retrieved June 20, 2020, from https://onlinejournalismblog.com/2010/04/27/why-do-people-read-online-news-research-summary/.
How Social Media Has Changed How We Consume News. (n.d.). Retrieved June 20, 2020, from https://www.forbes.com/sites/nicolemartin1/2018/11/30/how-social-media-has-changed-how-we-consume-news/#3eea114f3c3c.
Hopkins, J. (n.d.). Research Guides: Fake News: Develop Your Fact-Checking Skills: What Kinds of Fake News Exist?
Types of ’Fake News’ and Why They Matter | Ogilvy. (n.d.). Retrieved June 20, 2020, from https://www.ogilvy.com/ideas/5-types-fake-news-why-they-matter.
Engle, M. (n.d.). LibGuides: Fake News, Propaganda, and Bad information: Learning to Critically Evaluate Media Sources.: What Is Fake News?
Types of online news content accessed worldwide 2016 | Statista. (n.d.). Retrieved June 20, 2020, from https://www.statista.com/statistics/262510/types-of-online-news-content-accessed-worldwide/.
Fake news and critical literacy resources | National Literacy Trust. (n.d.). Retrieved June 20, 2020, from https://literacytrust.org.uk/resources/fake-news-and-critical-literacy-resources/.
Fake News & Social Media EuropCom 2017-Media Literacy Workshop. Retrieved November, 2017, from https://cor.europa.eu/en/events/Documents/Europcom/I.%20Heijnen_Session%2014.pdf.
Logistic Regression-Detailed Overview-Towards Data Science. (n.d.). Retrieved July 4, 2020, from https://towardsdatascience.com/logistic-regression-detailed-overview-46c4da4303bc.
KNN Classification. (n.d.). Retrieved July 4, 2020, from https://www.saedsayad.com/k_nearest_neighbors.htm.
Support Vector Machine-Introduction to Machine Learning Algorithms. (n.d.). Retrieved July 4, 2020, from https://towardsdatascience.com/support-vector-machine-introduction-to-machine-learning-algorithms-934a444fca47.
Decision Tree Algorithm-Explained-Towards Data Science. (n.d.). Retrieved July 4, 2020, from https://towardsdatascience.com/decision-tree-algorithm-explained-83beb6e78ef4.
Learn Naive Bayes Algorithm | Naive Bayes Classifier Examples. (n.d.). Retrieved July 4, 2020, from https://www.analyticsvidhya.com/blog/2017/09/naive-bayes-explained/.