多模态多层次事件网络的谣言检测

李莎; 张怀文; 钱胜胜; 方全; 徐常胜

doi:10.11834/jig.200499

图像理解和计算机视觉 | 浏览量 : 0 下载量: 0 CSCD: 1

PDF
导出
分享
收藏
专辑

多模态多层次事件网络的谣言检测
Multi-modal multi-level event network for rumor detection
2021年26卷第7期页码：1648-1657
纸质出版日期： 2021-07-16 ，

录用日期： 2021-02-16
DOI： 10.11834/jig.200499
稿件说明：

移动端阅览

李莎, 张怀文, 钱胜胜, 方全, 徐常胜. 多模态多层次事件网络的谣言检测[J]. 中国图象图形学报, 2021,26(7):1648-1657.

Sha Li, Huaiwen Zhang, Shengsheng Qian, Quan Fang, Changsheng Xu. Multi-modal multi-level event network for rumor detection[J]. Journal of Image and Graphics, 2021,26(7):1648-1657.
李莎, 张怀文, 钱胜胜, 方全, 徐常胜. 多模态多层次事件网络的谣言检测[J]. 中国图象图形学报, 2021,26(7):1648-1657. DOI： 10.11834/jig.200499.

Sha Li, Huaiwen Zhang, Shengsheng Qian, Quan Fang, Changsheng Xu. Multi-modal multi-level event network for rumor detection[J]. Journal of Image and Graphics, 2021,26(7):1648-1657. DOI： 10.11834/jig.200499.

摘要

目的

自动检测谣言至关重要，目前已有多种谣言检测方法，但存在以下两点局限：1）只考虑文本内容，忽略了可用于判断谣言的辅助多模态信息；2）只关注时间序列模型捕捉谣言事件的时间特征，没有很好地研究事件的局部信息和全局信息。为了克服这些局限性，有效利用多模态帖子信息并联合多种编码策略构建每个新闻事件的表示，本文提出一种新颖的基于多模态多层次事件网络的社交媒体谣言检测方法。

方法

通过一个多模态的帖子嵌入层，同时利用文本内容和视觉内容；将多模态的帖子嵌入向量送入多层次事件编码网络，联合使用多种编码策略，以由粗到细的方式描述事件特征。

结果

在Twitter和Pheme数据集上的大量实验表明，本文提出的多模态多层次事件网络模型比现有的SVM-TS（support vector machine—time structure）、CNN（convolutional neural network）、GRU（gated recurrent unit）、CallAtRumors和MKEMN（multimodal knowledge-aware event memory network）等方法在准确率上提升了4 %以上。

结论

本文提出的谣言检测模型，对每个事件的全局、时间和局部信息进行建模，提升了谣言检测的性能。

Abstract

Objective

The proliferation of social media has revolutionized the way people acquire information. A growing number of people choose to share information

and express and exchange opinions through social media. Unfortunately

because a large number of users do not carefully verify the released content when posting information and sharing their opinions

various rumors have been fostered on social media platforms. The extensive spread of these rumors is expected to bring new threats to the political

economic

and cultural fields and affect people's lives. To strengthen the detection of rumors and prevent their spread

many approaches to rumor detection have been proposed. An early rumor detection platform (e.g.

snopes.com) mainly reported through users

and then invited experts or institutions in related fields to confirm. Although these methods can achieve the purpose of rumor detection

the timeliness of detection has obvious limitations. Thus

how to detect rumors automatically has become a key research direction in recent years. To date

many automatic detection approaches have been proposed to improve the efficiency of rumor detection

including feature construction-based and neural network-based methods. The feature construction-based methods rely on hand-craft features to train rumor classifiers and neural network-based methods using neural networks to automatically extract deep features. Compared with traditional methods

models based on deep neural networks can automatically learn the underlying deep representation of rumors and extract more effective semantic features. However

these methods may suffer from the following limitations. 1) At post level

many existing methods only consider the text content. In fact

posts often contain various types of information (e.g.

text and images)

and the visual information are often used as an auxiliary information to judge the credibility of posts in reality. Therefore

the key to detecting rumors is obtaining the multi-modal information of the posts and systematically integrating the textual and visual information. 2) At the event level

existing approaches typically only use the temporal sequence model to capture temporal features of events. Local and global information has not been well investigated yet. In practice

local and global features are important because the former helps distinguish between posts of subtle differences

and the latter helps capture features that repeatedly present in the event. Therefore

based on encoding the temporal information of the event

local and global information should be exploited to obtain a fine-grained feature of the event for event encoding collaboratively.

Method

To overcome these limitations

this paper presents a novel multi-modal multi-level event network (MMEN) for rumor detection

which can effectively use multi-modal post information and combine multi-level encoding strategies to construct a representation of each news event. MMEN employs an encoding network that jointly exploits multiple encoding strategies such as mean pooling

recurrent neural networks

and convolutional networks to model the global

temporal

and local information of each event. Then

these various types of information are combined into a unified deep model. Specifically

our model consists of the following three components: 1) The multi-modal post embedding layer employs bidirectional encode representations form transformers(BERT) to generate the text content embedding vector and use Visual Geometry Group-19(VGG-19) to obtain the visual content. 2) The multi-level event encoding network utilizes three-level encodings to capture global

temporal

and local information. The first level is a global encoder through the mean pooling

which represents the elements that are repeatedly present in the posts. The second is a temporal encoder that exploits a bidirectional recurrent neural network to use past and future information of a given post sequence. The third level is a local encoder by utilizing more subtle local representation of events. Then

the encoding results are combined to describe the events in a coarse-to-fine fashion. 3) The rumor detector layer aims to classify each event as either fake or authentic. The detector exploits a fully connected layer with corresponding activation function to generate predicted probability to determine whether the event is a rumor or not.

Result

In this study

the public datasets Pheme and Twitter are used to evaluate the effectiveness of the MMEN. The quantitative evaluation metrics included accuracy

precision

recall

and F1 score. We also perform five-fold cross-validation throughout all experiments. The experiments demonstrate that our proposed MMEN has improved accuracy by more than 4% over current best practices. MMEN has an accuracy of 82.2% on the Pheme dataset and 87.0% on the Twitter dataset. We compare our model MMEN with five state-of-the-art baseline models. Compared with all the baselines

the MMEN achieves the best performance and outperforms other rumor detection methods in most cases. To examine the usefulness of each component in the MMEN and demonstrate its effectiveness

we compare variants of MMEN. The experiment results show that the multi-modal features learned by the multimodal post embedding layer can improve the accuracy of rumor detection by nearly 0.2% on the two datasets. The experimental results also show that the temporal encoder has a stronger effect on detection accuracy.

Conclusion

In this study

we design a novel MMEN for rumor detection. Experiments and comparisons demonstrate that our model is more robust and effective than state-of-the-art baselines based on two public datasets for rumor detection. We attribute the superiority of MMEN to its two properties. The MMEN takes advantage of the multiple modalities of posts

and the proposed multi-level encoder jointly exploits multiple encoding strategies to generate powerful and complementary features progressively.

关键词

多模态谣言检测社交媒体多层次编码策略事件网络

Keywords

multi-modalrumor detectionsocial mediamulti-level encoding strategyevent network

references

Castillo C, Mendoza M and Poblete B. 2011. Information credibility on twitter//Proceedings of the 20th International Conference on World Wide Web. Hyderabad, India: ACM: 675-684 [DOI: 10.1145/1963405.1963500http://dx.doi.org/10.1145/1963405.1963500]

Chen T, Li X, Yin H Z and Zhang J. 2018. Call attention to rumors: deep attention based recurrent neural networks for early rumor detection//Proceedings of Pacific-Asia Conference on Knowledge Discovery and Data Mining. Melbourne, Australia: Springer: 40-52 [DOI: 10.1007/978-3-030-04503-6_4http://dx.doi.org/10.1007/978-3-030-04503-6_4]

Chen Y C, Liu Z Y and Kao H Y. 2017. IKM at SemEval-2017 task 8: convolutional neural networks for stance detection and rumor verification//Proceedings of the 11th International Workshop on Semantic Evaluation. Vancouver, Canada: Association for Computational Linguistics: 465-469 [DOI: 10.18653/v1/S17-2081http://dx.doi.org/10.18653/v1/S17-2081]

Devlin J, Chang M W, Lee K and Toutanova K. 2019. BERT: pre-training of deep bidirectional transformers for language understanding//Proceedings of 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Minneapolis, USA: Association for Computational Linguistics: 4171-4186 [DOI: 10.18653/v1/n19-1423http://dx.doi.org/10.18653/v1/n19-1423]

Jin Z W, Cao J, Guo H, Zhang Y D and Luo J B. 2017. Multimodal fusion with recurrent neural networks for rumor detection on microblogs//Proceedings of the 25th ACM international conference on Multimedia. Mountain View, USA: ACM: 795-816 [DOI: 10.1145/3123266.3123454http://dx.doi.org/10.1145/3123266.3123454]

Khattar D, Goud J S, Gupta M and Varma V. 2019. MVAE: multimodal variational autoencoder for fake news detection//Proceedings of World Wide Web Conference. San Francisco, USA: ACM: 2915-2921 [DOI: 10.1145/3308558.3313552http://dx.doi.org/10.1145/3308558.3313552]

Kwon S, Cha M, Jung K, Chen W and Wang Y J. 2013. Prominent features of rumor propagation in online social media//Proceedings of the 13th IEEE International Conference on Data Mining. Dallas, USA: IEEE: 1103-1108 [DOI: 10.1109/ICDM.2013.61http://dx.doi.org/10.1109/ICDM.2013.61]

Ma J, Gao W and Wei Z. 2015. Detect rumors using time series of social context information on microblogging websites//Proceedings of the 24th ACM International Conference on Information and Knowledge Management. Melbourne, Australia: 1751-1754 [DOI: 10.1142/9789813223615_0006http://dx.doi.org/10.1142/9789813223615_0006]

Ma J, Gao W, Mitra P, Kwon S, Jansen B J, Wong K F and Cha M. 2016. Detecting rumors from microblogs with recurrent neural networks//Proceedings of the 25th International Joint Conference on Artificial Intelligence. New York, USA: IJCAI/AAAI Press: 3818-3824

Ma J, Gao W and Wong K F. 2018a. Detect rumor and stance jointly by neural multi-task learning//Proceedings of Web Conference 2018. Lyon, France: ACM: 585-593 [DOI: 10.1145/3184558.3188729http://dx.doi.org/10.1145/3184558.3188729]

Ma J, Gao W and Wong K F. 2018b. Rumor detection on Twitter with tree-structured recursive neural networks//Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics. Melbourne, Australia: ACL: 1980-1989 [DOI: 10.18653/v1/P18-1184http://dx.doi.org/10.18653/v1/P18-1184]

Sanh V, Debut L, Chaumond J and Wolf T. 2019. DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter [EB/OL]. [2020-08-06].https://arxiv.org/pdf/1910.01108.pdfhttps://arxiv.org/pdf/1910.01108.pdf

Simonyan K and Zisserman A. 2015. Very deep convolutional networks for large-scale image recognition//Proceedings of the 3rd International Conference on Learning Representations. San Diego, USA: ICLR

Singhal S, Shah R R, Chakraborty T, Kumaraguru P and Satoh S. 2019. SpotFake: a multi-modal framework for fake news detection//Proceedings of the 5th IEEE International Conference on Multimedia Big Data. Singapore, Singapore: IEEE: 39-47 [DOI: 10.1109/BigMM.2019.00-44http://dx.doi.org/10.1109/BigMM.2019.00-44]

Sun C, Qiu X P, Xu Y G and Huang X J. 2019. How to fine-tune BERT for text classification?//Proceedings of the 18th China National Conference on Chinese Computational Linguistics. Kunming, China: Springer: 194-206 [DOI: 10.1007/978-3-030-32381-3_16http://dx.doi.org/10.1007/978-3-030-32381-3_16]

Wu L W, Rao Y, Jin H L, Nazir A and Sun L. 2019. Different absorption from the same sharing: sifted multi-task learning for fake news detection//Proceedings of 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing. Hong Kong, China: Association for Computational Linguistics: 4644-4653 [DOI: 10.18653/v1/D19-1471http://dx.doi.org/10.18653/v1/D19-1471]

Yang F, Liu Y, Yu X H and Yang M. 2012. Automatic detection of rumor on Sina Weibo//Proceedings of ACM SIGKDD Workshop on Mining Data Semantics. Beijing, China: ACM: 1-7 [DOI: 10.1145/2350190.2350203http://dx.doi.org/10.1145/2350190.2350203]

Yu F, Liu Q, Wu S, Wang L and Tan T N. 2017. A convolutional approach for misinformation identification//Proceedings of the 26th International Joint Conference on Artificial Intelligence. Melbourne, Australia: Ijcai. org: 3901-3907 [DOI: 10.24963/ijcai.2017/545http://dx.doi.org/10.24963/ijcai.2017/545]

Zhang H W, Fang Q, Qian S S and Xu C S. 2019. Multi-modal knowledge-aware event memory network for social media rumor detection//Proceedings of the 27th ACM International Conference on Multimedia. Nice, France: ACM: 1942-1951 [DOI: 10.1145/3343031.3350850http://dx.doi.org/10.1145/3343031.3350850]

Zubiaga A, Liakata M, Procter R, Hoi G W S and Tolmie P. 2016. Analysing how people orient to and spread rumours in social media by looking at conversational threads. PLoS One, 11(3): #e0150989 [DOI: 10.1371/journal.pone.0150989]

文章被引用时，请邮件提醒。

提交

生物特征识别学科发展报告