dCollection 디지털 학술정보 유통시스템

Constructing a User-Centered Fake News Detection Model by Using Classification Algorithms in Machine Learning Techniques

주제(키워드) INDEX TERMS Classification algorithms , fake news , fake news detection , feature selection , prediction algorithms , predictive models , XGBoost
주제(기타) Computer Science, Information Systems; Engineering, Electrical & Electronic; Telecommunications
설명문(일반) [Park, Minjung; Chai, Sangmi] Ewha Womans Univ, Ewha Sch Business, Seoul 03760, South Korea
등재 SCIE, SCOPUS
OA유형 gold
발행기관 IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
발행년도 2023
총서유형 Journal
URI http://www.dcollection.net/handler/ewha/000000211378
본문언어 영어
Published As https://doi.org/10.1109/ACCESS.2023.3294613

초록/요약

As fake news spreads rapidly in social media, attempts to develop detection technology to automatically identify fake news are actively being developed, recently. However, most of them focus only on the linguistic and compositional characteristics of fake news (e.g., source or authors indication, length of a message, frequency of negative words). Compared to them, this study proposes a fake news detection model based on machine learning that reflects the characteristics of users, news content, and social networks based on social capital. To comprehensively reflect the characteristics related to the spread of fake news, this study applied the XGBoost model to estimate the feature importance of each variable to derive the priority factors that preferentially affect fake news detection. Based on the derived variables, we established SVM, RF, LR, CART, and NNET, which are representative classification models of machine learning, and compared the performance rate of fake news detection. To generalize the established models (i.e., to avoid overfitting or underfitting), this study performed a cross-validation step, and to compare the predictive accuracy of the established models. As a result, the RF model indicated the highest prediction rate at about 94%, while the NNET had the lowest performance rate at about 92.1%. The results of this study are expected to contribute to improve the fake news detection system in preparation for the more sophisticated generation and spread of fake news.

반출 Meta View 목록

검색 상세

Constructing a User-Centered Fake News Detection Model by Using Classification Algorithms in Machine Learning Techniques

초록/요약