Views : 0 下载量: 6620 CSCD: 0
  • Export

  • Share

  • Collection

  • Album

    • Advancements in 3D vision understanding using multimodal large language models

    • Three dimensional visual perception and understanding have made significant progress in fields such as robot navigation and autonomous driving. The fusion of multimodal large models with 3D data presents unique advantages, paving the way for the development of spatial intelligence.
    • Vol. 30, Issue 6, Pages: 1744-1791(2025)   

      Received:29 September 2024

      Revised:2024-12-22

      Published:16 June 2025

    • DOI: 10.11834/jig.240588     

    移动端阅览

  • Feng Mingtao, Shen Junhao, Wu Zijie, Peng Weixing, Zhong Hang, Guo Yulan, Shu Xiangbo, Zhang Hui, Dong Weisheng, Wang Yaonan. 2025. Advancements in 3D vision understanding using multimodal large language models. Journal of Image and Graphics, 30(6):1744-1791 DOI: 10.11834/jig.240588.
  •  
  •  
Alert me when the article has been cited
提交

相关作者

Feng Mingtao 西安电子科技大学
Shen Junhao 西安电子科技大学
Wu Zijie 湖南大学
Peng Weixing 湖南大学
Zhong Hang 湖南大学
Guo Yulan 国防科技大学
Shu Xiangbo 南京理工大学
Zhang Hui 湖南大学

相关机构

National University of Defense Technology
Department of Automation, Tsinghua University
Department of Computer Science and Engineering, University of California, , La Jolla
Institute of Computing Technology, Chinese Academy of Sciences
School of Intelligence Science and Technology, Peking University
0