融合知识表征的多模态Transformer场景文本视觉问答
Knowledge-representation-enhanced multimodal Transformer for scene text visual question answering
- 2022年27卷第9期 页码:2761-2774
收稿日期:2022-01-05,
修回日期:2022-06-01,
录用日期:2022-6-8,
纸质出版日期:2022-09-16
DOI: 10.11834/jig.211213
移动端阅览
