多模态多层次事件网络的谣言检测
Multi-modal multi-level event network for rumor detection
- 2021年26卷第7期 页码:1648-1657
纸质出版日期: 2021-07-16 ,
录用日期: 2021-02-16
DOI: 10.11834/jig.200499
移动端阅览
浏览全部资源
扫码关注微信
纸质出版日期: 2021-07-16 ,
录用日期: 2021-02-16
移动端阅览
李莎, 张怀文, 钱胜胜, 方全, 徐常胜. 多模态多层次事件网络的谣言检测[J]. 中国图象图形学报, 2021,26(7):1648-1657.
Sha Li, Huaiwen Zhang, Shengsheng Qian, Quan Fang, Changsheng Xu. Multi-modal multi-level event network for rumor detection[J]. Journal of Image and Graphics, 2021,26(7):1648-1657.
目的
2
自动检测谣言至关重要,目前已有多种谣言检测方法,但存在以下两点局限:1)只考虑文本内容,忽略了可用于判断谣言的辅助多模态信息;2)只关注时间序列模型捕捉谣言事件的时间特征,没有很好地研究事件的局部信息和全局信息。为了克服这些局限性,有效利用多模态帖子信息并联合多种编码策略构建每个新闻事件的表示,本文提出一种新颖的基于多模态多层次事件网络的社交媒体谣言检测方法。
方法
2
通过一个多模态的帖子嵌入层,同时利用文本内容和视觉内容;将多模态的帖子嵌入向量送入多层次事件编码网络,联合使用多种编码策略,以由粗到细的方式描述事件特征。
结果
2
在Twitter和Pheme数据集上的大量实验表明,本文提出的多模态多层次事件网络模型比现有的SVM-TS(support vector machine—time structure)、CNN(convolutional neural network)、GRU(gated recurrent unit)、CallAtRumors和MKEMN(multimodal knowledge-aware event memory network)等方法在准确率上提升了4 %以上。
结论
2
本文提出的谣言检测模型,对每个事件的全局、时间和局部信息进行建模,提升了谣言检测的性能。
Objective
2
The proliferation of social media has revolutionized the way people acquire information. A growing number of people choose to share information
and express and exchange opinions through social media. Unfortunately
because a large number of users do not carefully verify the released content when posting information and sharing their opinions
various rumors have been fostered on social media platforms. The extensive spread of these rumors is expected to bring new threats to the political
economic
and cultural fields and affect people's lives. To strengthen the detection of rumors and prevent their spread
many approaches to rumor detection have been proposed. An early rumor detection platform (e.g.
snopes.com) mainly reported through users
and then invited experts or institutions in related fields to confirm. Although these methods can achieve the purpose of rumor detection
the timeliness of detection has obvious limitations. Thus
how to detect rumors automatically has become a key research direction in recent years. To date
many automatic detection approaches have been proposed to improve the efficiency of rumor detection
including feature construction-based and neural network-based methods. The feature construction-based methods rely on hand-craft features to train rumor classifiers and neural network-based methods using neural networks to automatically extract deep features. Compared with traditional methods
models based on deep neural networks can automatically learn the underlying deep representation of rumors and extract more effective semantic features. However
these methods may suffer from the following limitations. 1) At post level
many existing methods only consider the text content. In fact
posts often contain various types of information (e.g.
text and images)
and the visual information are often used as an auxiliary information to judge the credibility of posts in reality. Therefore
the key to detecting rumors is obtaining the multi-modal information of the posts and systematically integrating the textual and visual information. 2) At the event level
existing approaches typically only use the temporal sequence model to capture temporal features of events. Local and global information has not been well investigated yet. In practice
local and global features are important because the former helps distinguish between posts of subtle differences
and the latter helps capture features that repeatedly present in the event. Therefore
based on encoding the temporal information of the event
local and global information should be exploited to obtain a fine-grained feature of the event for event encoding collaboratively.
Method
2
To overcome these limitations
this paper presents a novel multi-modal multi-level event network (MMEN) for rumor detection
which can effectively use multi-modal post information and combine multi-level encoding strategies to construct a representation of each news event. MMEN employs an encoding network that jointly exploits multiple encoding strategies such as mean pooling
recurrent neural networks
and convolutional networks to model the global
temporal
and local information of each event. Then
these various types of information are combined into a unified deep model. Specifically
our model consists of the following three components: 1) The multi-modal post embedding layer employs bidirectional encode representations form transformers(BERT) to generate the text content embedding vector and use Visual Geometry Group-19(VGG-19) to obtain the visual content. 2) The multi-level event encoding network utilizes three-level encodings to capture global
temporal
and local information. The first level is a global encoder through the mean pooling
which represents the elements that are repeatedly present in the posts. The second is a temporal encoder that exploits a bidirectional recurrent neural network to use past and future information of a given post sequence. The third level is a local encoder by utilizing more subtle local representation of events. Then
the encoding results are combined to describe the events in a coarse-to-fine fashion. 3) The rumor detector layer aims to classify each event as either fake or authentic. The detector exploits a fully connected layer with corresponding activation function to generate predicted probability to determine whether the event is a rumor or not.
Result
2
In this study
the public datasets Pheme and Twitter are used to evaluate the effectiveness of the MMEN. The quantitative evaluation metrics included accuracy
precision
recall
and F1 score. We also perform five-fold cross-validation throughout all experiments. The experiments demonstrate that our proposed MMEN has improved accuracy by more than 4% over current best practices. MMEN has an accuracy of 82.2% on the Pheme dataset and 87.0% on the Twitter dataset. We compare our model MMEN with five state-of-the-art baseline models. Compared with all the baselines
the MMEN achieves the best performance and outperforms other rumor detection methods in most cases. To examine the usefulness of each component in the MMEN and demonstrate its effectiveness
we compare variants of MMEN. The experiment results show that the multi-modal features learned by the multimodal post embedding layer can improve the accuracy of rumor detection by nearly 0.2% on the two datasets. The experimental results also show that the temporal encoder has a stronger effect on detection accuracy.
Conclusion
2
In this study
we design a novel MMEN for rumor detection. Experiments and comparisons demonstrate that our model is more robust and effective than state-of-the-art baselines based on two public datasets for rumor detection. We attribute the superiority of MMEN to its two properties. The MMEN takes advantage of the multiple modalities of posts
and the proposed multi-level encoder jointly exploits multiple encoding strategies to generate powerful and complementary features progressively.
多模态谣言检测社交媒体多层次编码策略事件网络
multi-modalrumor detectionsocial mediamulti-level encoding strategyevent network
Castillo C, Mendoza M and Poblete B. 2011. Information credibility on twitter//Proceedings of the 20th International Conference on World Wide Web. Hyderabad, India: ACM: 675-684 [DOI: 10.1145/1963405.1963500http://dx.doi.org/10.1145/1963405.1963500]
Chen T, Li X, Yin H Z and Zhang J. 2018. Call attention to rumors: deep attention based recurrent neural networks for early rumor detection//Proceedings of Pacific-Asia Conference on Knowledge Discovery and Data Mining. Melbourne, Australia: Springer: 40-52 [DOI: 10.1007/978-3-030-04503-6_4http://dx.doi.org/10.1007/978-3-030-04503-6_4]
Chen Y C, Liu Z Y and Kao H Y. 2017. IKM at SemEval-2017 task 8: convolutional neural networks for stance detection and rumor verification//Proceedings of the 11th International Workshop on Semantic Evaluation. Vancouver, Canada: Association for Computational Linguistics: 465-469 [DOI: 10.18653/v1/S17-2081http://dx.doi.org/10.18653/v1/S17-2081]
Devlin J, Chang M W, Lee K and Toutanova K. 2019. BERT: pre-training of deep bidirectional transformers for language understanding//Proceedings of 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Minneapolis, USA: Association for Computational Linguistics: 4171-4186 [DOI: 10.18653/v1/n19-1423http://dx.doi.org/10.18653/v1/n19-1423]
Jin Z W, Cao J, Guo H, Zhang Y D and Luo J B. 2017. Multimodal fusion with recurrent neural networks for rumor detection on microblogs//Proceedings of the 25th ACM international conference on Multimedia. Mountain View, USA: ACM: 795-816 [DOI: 10.1145/3123266.3123454http://dx.doi.org/10.1145/3123266.3123454]
Khattar D, Goud J S, Gupta M and Varma V. 2019. MVAE: multimodal variational autoencoder for fake news detection//Proceedings of World Wide Web Conference. San Francisco, USA: ACM: 2915-2921 [DOI: 10.1145/3308558.3313552http://dx.doi.org/10.1145/3308558.3313552]
Kwon S, Cha M, Jung K, Chen W and Wang Y J. 2013. Prominent features of rumor propagation in online social media//Proceedings of the 13th IEEE International Conference on Data Mining. Dallas, USA: IEEE: 1103-1108 [DOI: 10.1109/ICDM.2013.61http://dx.doi.org/10.1109/ICDM.2013.61]
Ma J, Gao W and Wei Z. 2015. Detect rumors using time series of social context information on microblogging websites//Proceedings of the 24th ACM International Conference on Information and Knowledge Management. Melbourne, Australia: 1751-1754 [DOI: 10.1142/9789813223615_0006http://dx.doi.org/10.1142/9789813223615_0006]
Ma J, Gao W, Mitra P, Kwon S, Jansen B J, Wong K F and Cha M. 2016. Detecting rumors from microblogs with recurrent neural networks//Proceedings of the 25th International Joint Conference on Artificial Intelligence. New York, USA: IJCAI/AAAI Press: 3818-3824
Ma J, Gao W and Wong K F. 2018a. Detect rumor and stance jointly by neural multi-task learning//Proceedings of Web Conference 2018. Lyon, France: ACM: 585-593 [DOI: 10.1145/3184558.3188729http://dx.doi.org/10.1145/3184558.3188729]
Ma J, Gao W and Wong K F. 2018b. Rumor detection on Twitter with tree-structured recursive neural networks//Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics. Melbourne, Australia: ACL: 1980-1989 [DOI: 10.18653/v1/P18-1184http://dx.doi.org/10.18653/v1/P18-1184]
Sanh V, Debut L, Chaumond J and Wolf T. 2019. DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter [EB/OL]. [2020-08-06].https://arxiv.org/pdf/1910.01108.pdfhttps://arxiv.org/pdf/1910.01108.pdf
Simonyan K and Zisserman A. 2015. Very deep convolutional networks for large-scale image recognition//Proceedings of the 3rd International Conference on Learning Representations. San Diego, USA: ICLR
Singhal S, Shah R R, Chakraborty T, Kumaraguru P and Satoh S. 2019. SpotFake: a multi-modal framework for fake news detection//Proceedings of the 5th IEEE International Conference on Multimedia Big Data. Singapore, Singapore: IEEE: 39-47 [DOI: 10.1109/BigMM.2019.00-44http://dx.doi.org/10.1109/BigMM.2019.00-44]
Sun C, Qiu X P, Xu Y G and Huang X J. 2019. How to fine-tune BERT for text classification?//Proceedings of the 18th China National Conference on Chinese Computational Linguistics. Kunming, China: Springer: 194-206 [DOI: 10.1007/978-3-030-32381-3_16http://dx.doi.org/10.1007/978-3-030-32381-3_16]
Wu L W, Rao Y, Jin H L, Nazir A and Sun L. 2019. Different absorption from the same sharing: sifted multi-task learning for fake news detection//Proceedings of 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing. Hong Kong, China: Association for Computational Linguistics: 4644-4653 [DOI: 10.18653/v1/D19-1471http://dx.doi.org/10.18653/v1/D19-1471]
Yang F, Liu Y, Yu X H and Yang M. 2012. Automatic detection of rumor on Sina Weibo//Proceedings of ACM SIGKDD Workshop on Mining Data Semantics. Beijing, China: ACM: 1-7 [DOI: 10.1145/2350190.2350203http://dx.doi.org/10.1145/2350190.2350203]
Yu F, Liu Q, Wu S, Wang L and Tan T N. 2017. A convolutional approach for misinformation identification//Proceedings of the 26th International Joint Conference on Artificial Intelligence. Melbourne, Australia: Ijcai. org: 3901-3907 [DOI: 10.24963/ijcai.2017/545http://dx.doi.org/10.24963/ijcai.2017/545]
Zhang H W, Fang Q, Qian S S and Xu C S. 2019. Multi-modal knowledge-aware event memory network for social media rumor detection//Proceedings of the 27th ACM International Conference on Multimedia. Nice, France: ACM: 1942-1951 [DOI: 10.1145/3343031.3350850http://dx.doi.org/10.1145/3343031.3350850]
Zubiaga A, Liakata M, Procter R, Hoi G W S and Tolmie P. 2016. Analysing how people orient to and spread rumours in social media by looking at conversational threads. PLoS One, 11(3): #e0150989 [DOI: 10.1371/journal.pone.0150989]
相关文章
相关作者
相关机构