Large Vision and Multimodal Models | Views : 0 下载量: 339 CSCD: 0
  • Export

  • Share

  • Collection

  • Album

    • Iterative optimization for video retrieval data using large language model guidance

    • In the field of cross modal retrieval of video text, experts have proposed a data iterative optimization method guided by large language models, which effectively alleviates the one to many problem in datasets and significantly improves model performance.
    • Vol. 30, Issue 5, Pages: 1257-1271(2025)   

      Received:04 September 2024

      Revised:2024-11-27

      Published:16 May 2025

    • DOI: 10.11834/jig.240545     

    移动端阅览

  • Zeng Runhao, Li Jialiang, Zhuo Yishen, Duan Haihan, Chen Qi, Hu Xiping. 2025. Iterative optimization for video retrieval data using large language model guidance. Journal of Image and Graphics, 30(5):1257-1271 DOI: 10.11834/jig.240545.
  •  
  •  
Alert me when the article has been cited
提交

相关作者

Hu xiping 深圳北理莫斯科大学人工智能研究院;粤港澳情感智能与普适计算联合实验室
Chen qi 阿德莱德大学计算机科学学院
Duan haihan 深圳北理莫斯科大学人工智能研究院;粤港澳情感智能与普适计算联合实验室
Zhuo yishen 深圳大学机电与控制工程学院
Zeng runhao 深圳北理莫斯科大学人工智能研究院;粤港澳情感智能与普适计算联合实验室
Li Guanbin 中山大学计算机学院
Zhang Ruifei 香港中文大学(深圳)理工学院
Xie Junlin 香港中文大学(深圳)理工学院

相关机构

School of Computer and Mathematical Sciences,Adelaide University
School of Computer Science and Engineer, Sun Yat-sen University
School of Science and Engineer, The Chinese University of Hong Kong
School of Electrical Engineering and Automation, Jiangxi University of Science and Technology
National Key Laboratory of Aerospace Flight Dynamics
0