Large Vision and Multimodal Models | Views : 0 下载量: 290 CSCD: 0
  • Export

  • Share

  • Collection

  • Album

    • Information disentanglement-based self-supervised learning speech pretrained large model

    • In the field of speech interaction, experts have proposed a pre trained large model based on speech information decoupling strategy, which effectively improves the model's ability to parse and reconstruct speech information, providing a new research perspective and practical tool for speech interaction large models.
    • Vol. 30, Issue 5, Pages: 1272-1285(2025)   

      Received:31 December 2024

      Revised:2025-02-23

      Published:16 May 2025

    • DOI: 10.11834/jig.240607     

    移动端阅览

  • Wang Longbiao, Jiang Yu, Wang Tianrui, Wang Xiaobao, Dang Jianwu. 2025. Information disentanglement-based self-supervised learning speech pretrained large model. Journal of Image and Graphics, 30(5):1272-1285 DOI: 10.11834/jig.240607.
  •  
  •  
Alert me when the article has been cited
提交

相关作者

Wang Longbiao 天津大学智能与计算学部认知计算与应用重点实验室
Jiang Yu 天津大学智能与计算学部认知计算与应用重点实验室
Wang Tianrui 天津大学智能与计算学部认知计算与应用重点实验室
Wang Xiaobao 天津大学智能与计算学部认知计算与应用重点实验室
Dang Jianwu 中国科学院深圳先进技术研究院
Yu Hongzhi 西北民族大学语言与文化计算教育部重点实验室
Zhang Kehong 兰州财经大学信息工程与人工智能学院
Zhang Jinxi 兰州财经大学商务传媒学院

相关机构

Key Laboratory of Linguistic and Cultural Computing Ministry of Education, Northwest Minzu University
School of Information Engineering and Artificial Intelligence, Lanzhou University of Finance and Economics
School of Business and Media, Lanzhou University of Finance and Economics
School of Safety Science and Engineering, Anhui University of Science and Technology
School of Artificial Intelligence, Anhui University of Science and Technology
0