Document Type : Original Article

Author

Computer Dept, Engineering Faculty, Arak University, Arak, Iran

Abstract

The spread of fake news on social networks has become a serious challenge in the fields of information and cybersecurity, particularly in the realm of passive defense. Early detection of such news can play a crucial role in improving cybersecurity and controlling the dissemination of misinformation. This paper presents a novel approach that uses the correlation between headlines and news content to identify fake news. Using deep neural networks, the headline and body of news articles are analyzed as two independent components, and their correlation is measured. We fine-tuned two BERT language models on the headline and body text as the two constituent parts of the news to determine whether there is a correlation between the news headline and body text. The results showed that this approach to fake news can enhance model accuracy compared to similar models.

Keywords

Main Subjects

[1]. Meel P, Vishwakarma DK (2020) Fake news, rumor, information pollution in social media and web: A contemporary survey of state-of-the-arts, challenges and opportunities. Expert Syst Appl 153:112986
[2]. Kwon S, Cha M, Jung K, Chen W, Wang Y (2013) Prominent features of rumor propagation in online social media. In: Proceedings - IEEE International Conference on Data Mining, ICDM. pp 1103–1108
[3]. Shu K, Sliva A, Wang S, Tang J, Liu H (2017) Fake News Detection on Social Media. ACM SIGKDD Explorations Newsletter 19:22–36
[4]. Vosoughi S, Roy D, Aral S (2018) The spread of true and false news online. Science (1979) 359:1146–1151
[5]. Karimi H, Tang J (2019) Learning hierarchical discourse-level structure for fake news detection. NAACL HLT 2019 - 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Proceedings of the Conference 1:3432–3442
[6]. Thorne J, Vlachos A, Christodoulopoulos C, Mittal A (2018) FEVER: A large-scale dataset for fact extraction and verification. In: NAACL HLT 2018 - 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Proceedings of the Conference. pp 809–819
[7]. Bronstein MM, Bruna J, Lecun Y, Szlam A, Vandergheynst P (2017) Geometric Deep Learning: Going beyond Euclidean data. IEEE Signal Process Mag 34:18–42
[8]. Monti F, Frasca F, Eynard D, Mannion D, Bronstein MM (2019) Fake News Detection on Social Media using Geometric Deep Learning. Arxiv 1902-06673v1 1–15
9. Albawi S, Mohammed TA, Al-Zawi S (2018) Understanding of a convolutional neural network. In: Proceedings of 2017 International Conference on Engineering and Technology, ICET 2017. pp 1–6
[10]. Zhou X, Zafarani R (2020) A Survey of Fake News: Fundamental Theories, Detection Methods, and Opportunities. ACM Comput Surv. https://doi.org/10.1145/3395046
[11]. Wang B, Feng Y, Xiong X cai, Wang Y heng, Qiang B hua (2022) Multi-modal transformer using two-level visual features for fake news detection. Applied Intelligence 1–15
[12]. Wu K, Yang S, Zhu KQ (2015) False rumors detection on Sina Weibo by propagation structures. In: Proc Int Conf Data Eng. pp 651–662
[13]. Shu K, Wang S, Liu H (2019) Beyond news contents: The role of social context for fake news detection. In: WSDM 2019 - Proceedings of the 12th ACM International Conference on Web Search and Data Mining. pp 312–320
[14]. Zhou X, Cao J, Jin Z, Xie F, Su Y, Zhang J, Chu D, Cao X (2015) Real-time news certification system on sina weibo. In: WWW 2015 Companion - Proceedings of the 24th International Conference on World Wide Web. pp 983–988
[15]. Pan JZ, Pavlova S, Li C, Li N, Li Y, Liu J (2018) Content based fake news detection using knowledge graphs. In: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Springer, pp 669–683
[16]. Ott M, Choi Y, Cardie C, Hancock JT (2011) Finding deceptive opinion spam by any stretch of the imagination. In: ACL-HLT 2011 - Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. pp 309–319
[17]. Abouelenien M, Pérez-Rosas V, Zhao B, Mihalcea R, Burzo M (2017) Gender-based multimodal deception detection. In: Proceedings of the ACM Symposium on Applied Computing. Association for Computing Machinery, New York, New York, USA, pp 137–144
[18]. Pérez-Rosas V, Kleinberg B, Lefevre A, Mihalcea R (2018) Automatic detection of fake news. COLING 2018 - 27th International Conference on Computational Linguistics, Proceedings 2789:3391–3401
[19]. Gravanis G, Vakali A, Diamantaras K, Karadais P (2019) Behind the cues: A benchmarking study for fake news detection. Expert Syst Appl 128:201–213
[20]. Alghamdi J, Lin Y, Luo S (2024) Unveiling the hidden patterns: A novel semantic deep learning approach to fake news detection on social media. Eng Appl Artif Intell 137:109240
[21]. Pierri F, Ceri S (2019) False news on social media: A data-driven survey. SIGMOD Rec 48:18–32
[22]. Panigrahi S, Nanda A, Swarnkar T (2021) A Survey on Transfer Learning. Smart Innovation, Systems and Technologies 194:781–789
[23]. Mikolov T, Sutskever I, Chen K, Corrado G, Dean J (2013) Distributed representations ofwords and phrases and their compositionality. Adv Neural Inf Process Syst
[24]. Pennington J, Socher R, Manning CD (2014) GloVe: Global vectors for word representation. In: EMNLP 2014 - 2014 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference. pp 1532–1543
[25]. Peters ME, Neumann M, Zettlemoyer L, Yih WT (2018) Dissecting contextual word embeddings: Architecture and representation. In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, EMNLP 2018. pp 1499–1509
[26]. Devlin J, Chang MW, Lee K, Toutanova K (2019) BERT: Pre-training of deep bidirectional transformers for language understanding. NAACL HLT 2019 - 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Proceedings of the Conference 1:4171–4186
[27]. Luong MT, Pham H, Manning CD (2015) Effective approaches to attention-based neural machine translation. In: Conference Proceedings - EMNLP 2015: Conference on Empirical Methods in Natural Language Processing. pp 1412–1421
[28]. Rajpurkar P, Zhang J, Lopyrev K, Liang P (2016) SQuad: 100,000+ questions for machine comprehension of text. In: EMNLP 2016 - Conference on Empirical Methods in Natural Language Processing, Proceedings. pp 2383–2392
[29]. Williams A, Nangia N, Bowman SR (2018) A broad-coverage challenge corpus for sentence understanding through inference. In: NAACL HLT 2018 - 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Proceedings of the Conference. pp 1112–1122
[30]. Shu K, Mahudeswaran D, Wang S, Lee D, Liu H (2020) FakeNewsNet: A Data Repository with News Content, Social Context, and Spatiotemporal Information for Studying Fake News on Social Media. Big Data 8:171–188
[31]. Singhania S, Fernandez N, Rao S (2017) 3HAN: A Deep Neural Network for Fake News Detection. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 10635 LNCS:572–581
[32]. Kaliyar RK, Goswami A, Narang P (2021) FakeBERT: Fake news detection in social media with a BERT-based deep learning approach. Multimed Tools Appl 80:11765–11788
[33]. Loshchilov I, Hutter F (2019) Decoupled weight decay regularization. 7th International Conference on Learning Representations,
Amirzdeh, M., Hosseini Moradi, S. A., & Ghobadi, N. (2023). Real Time Detection of Multi-Rotor Unmanned Aerial Vehicle Using YOLOv5 Optimized Algorithm. Journal of Advanced Defense Science & Technology14(1), 11-22.‏ICLR 2019