Comprehensive Evaluation of Nano Banana Pro Based on 14 Tasks and 40 Datasets

Zuo Jialong; Deng Haoyou; Zuo Haotong; Zhou Hanyu; Zhu Jiaxin; Zhang Yicheng; Zhang Yiwei; Yan Yongxin; Huang Kaixing; Chen Weisen; Deng Yongtai; Jin Rui; Sang Nong; Gao Changxin

doi:10.11834/jig.260029

Views : 0 下载量: 35 CSCD: 0

PDF
Export
Share
Collection
Album

Comprehensive Evaluation of Nano Banana Pro Based on 14 Tasks and 40 Datasets
Pages: 1-36(2026)
Received：13 January 2026，

Revised：2026-05-02，

Accepted：07 May 2026，

Online First：07 May 2026，
DOI： 10.11834/jig.260029
稿件说明：

移动端阅览

DOI：

Zuo Jialong, Deng Haoyou, Zuo Haotong, Zhou Hanyu, Zhu Jiaxin, Zhang Yicheng, Zhang Yiwei, Yan Yongxin, Huang Kaixing, Chen Weisen, Deng Yongtai, Jin Rui, Sang Nong, Gao Changxin. Comprehensive Evaluation of Nano Banana Pro Based on 14 Tasks and 40 Datasets[J/OL]. Journal of Image and Graphics, 2026, 1-36. DOI： 10.11834/jig.260029.

摘要

文本到图像生成模型的快速发展彻底改变了视觉内容创作。虽然诸如Nano Banana Pro之类的商业产品已获得广泛关注，但其作为传统底层视觉任务通用解决方案的潜力仍未得到充分探索。本文致力于解答一个核心问题：Nano Banana Pro是否是底层视觉全能选手？通过零样本评估的方式，在涵盖40个多样化数据集的14个底层视觉任务上进行了全面测试。仅使用简单文本提示而未进行微调的情况下，将Nano Banana Pro与最先进的专用模型进行对比。深入分析揭示了明显的性能分野：尽管Nano Banana Pro展现出卓越的主观视觉质量，其“幻觉生成”的高频细节常超越专用模型，但在传统基于参考的定量指标上表现欠佳。本文将这种差异归因于生成模型固有的随机性，即难以满足传统指标对像素级一致性的严苛要求。本文肯定了Nano Banana Pro作为底层视觉任务零样本解决方案的潜力，同时指出要达到领域专用模型的高保真度仍面临重大挑战。

Abstract

The rapid evolution of large-scale text-to-image generation models has fundamentally transformed the landscape of visual content creation. Driven by advances in diffusion models， large multimodal pretraining， and scalable inference pipelines， modern generative systems have demonstrated unprecedented capabilities in synthesizing visually compelling images across a wide range of styles， scenes， and semantic conditions. Commercial models such as Nano Banana Pro have attracted significant attention due to their strong zero-shot generation ability， robust semantic understanding， and impressive perceptual quality. However， despite their success in creative image synthesis， a critical and largely underexplored question remains： Can such foundation generative models serve as general-purpose solvers for traditional low-level vision tasks？ Low-level vision tasks—including dehazing， deblurring， super-resolution and so on—have historically been dominated by task-specific， regression-based models. These models are typically trained under strong supervision with paired data and optimized using pixel-aligned objectives such as PSNR（peak signal-to-noise ratio）and SSIM（structural similarity index measure）. While highly effective within their target domains， such specialist models lack flexibility， often require costly retraining for new tasks， and struggle to generalize beyond their training distributions. In contrast， foundation generative models promise a unified alternative： a single pretrained model capable of addressing diverse vision tasks through natural language prompts， without task-specific fine-tuning. In this work， we present the first large-scale， systematic zero-shot evaluation of Nano Banana Pro across a broad spectrum of low-level vision tasks. Specifically， we investigate whether Nano Banana Pro can function as a low-level vision all-rounder—a generalist model capable of producing high-quality results across heterogeneous restoration， enhancement， and fusion tasks. To this end， we conduct an extensive evaluation covering 14 distinct low-level vision tasks across 40 datasets， encompassing both synthetic and real-world degradations. The evaluated tasks include deblurring （motion， defocus）， super-resolution， image denoising， deraining， shadow removal， reflection removal， flare removal， low-light image enhancement， underwater image enhancement， HDR（high dynamic range）reconstruction， multi-focus image fusion， and infrared–visible image fusion， among others. All experiments are conducted under a standard zero-shot protocol. Nano Banana Pro is queried exclusively through simple， task-oriented natural language prompts， without any model fine-tuning， parameter adaptation， or task-specific post-processing. This setting is deliberately chosen to reflect realistic deployment scenarios and to assess the intrinsic capability of the model as a foundation visual system. For each task， we compare Nano Banana Pro against state-of-the-art specialist methods specifically designed for the corresponding task. Our comprehensive evaluation reveals a consistent and striking performance dichotomy. On one hand， Nano Banana Pro frequently produces results with superior perceptual quality， characterized by enhanced clarity， vivid textures， improved contrast， and visually pleasing color distributions. In many challenging scenarios—such as severe noise， extreme low-light conditions， heavy underwater color distortion， or strong atmospheric degradation—the model is able to hallucinate plausible high-frequency details and recover semantically coherent structures that rival or even surpass those generated by domain-specific methods. Across multiple tasks， Nano Banana Pro achieves competitive or leading performance on no-reference perceptual metrics and consistently receives favorable qualitative assessments. On the other hand， when evaluated using traditional full-reference， pixel-aligned quantitative metrics， Nano Banana Pro systematically underperforms compared to specialist models. Metrics such as PSNR， SSIM， SCD（sum of correlations of differences）， and VIF（visual information fidelity） consistently reveal notable gaps， particularly in tasks requiring strict structural alignment or physical signal fidelity. This discrepancy is especially pronounced in tasks like denoising， HDR reconstruction， and image fusion， where pixel-level consistency with the reference image is heavily rewarded. We attribute this behavior to the inherent stochastic and generative nature of diffusion-based models， which prioritize semantic plausibility and perceptual realism over deterministic pixel correspondence. As a result， even visually improved outputs may be penalized for global color shifts， localized texture synthesis， or subtle geometric deviations. Importantly， our analysis shows that these quantitative penalties do not necessarily indicate failure. In many datasets， the provided “ground-truth” images themselves contain residual noise， blur， or imperfect color balance. In such cases， Nano Banana Pro often generates cleaner， more visually appealing results that deviate from the reference but align better with human perception. This observation highlights a fundamental tension between regression-based evaluation paradigms and generative reconstruction behaviors， and suggests that current benchmarks may be insufficient for assessing foundation generative models. Beyond aggregate metrics， we conduct detailed task-wise and dataset-wise analyses to characterize the operational scope and limitations of Nano Banana Pro. The model excels in scenarios involving severe degradation， ambiguous structure， or incomplete information， where its strong semantic priors can compensate for missing signal. Conversely， it struggles in applications demanding strict physical accuracy， such as forensic analysis， scientific imaging， or safety-critical perception， where hallucinated details or slight structural inconsistencies may be unacceptable. Collectively， our findings position Nano Banana Pro as a powerful zero-shot contender for low-level vision， capable of delivering high perceptual quality across a remarkably diverse set of tasks without retraining. At the same time， achieving the pixel-level fidelity of domain specialists remains a significant challenge. Rather than framing this as a binary competition between generative and regression paradigms， our results suggest a more promising direction： strategic integration. Future robust vision systems may combine the semantic imagination of foundation generative models with the physical constraints and precision of task-specific networks， leveraging the strengths of both. In summary， this study provides the first comprehensive empirical answer to the question： Is Nano Banana Pro a low-level vision all-rounder？ Our answer is nuanced. Nano Banana Pro substantially raises the upper bound of perceptual quality in zero-shot low-level vision， but has yet to establish a stable lower bound suitable for high-fidelity， safety-critical applications. By systematically documenting these strengths and limitations across 14 tasks and 40 datasets， this report offers a detailed reference point for future research on foundation models in low-level vision， and calls for the development of new evaluation frameworks that better reflect perceptual realism， semantic consistency， and downstream utility.

关键词

Keywords

references

A. Galdran ， D. Pardo ， A. Picon ， and A. Alvarez-Gila . Automatic red-channel underwater image restoration . Journal of Visual Communication and Image Representation ， 26 ： 132 – 145， 2015 .

A. S. A. Ghani and N. A. M. Isa . Underwater image quality enhancement through integrated color model with rayleigh distribution . Applied Soft Computing ， 27 ： 219 – 230， 2015 .

Abdelrahman Abdelhamed ， Stephen Lin ， and Michael S Brown . A high-quality denoising dataset for smartphone cameras . In Proceedings of the IEEE conference on CVPR ， pages 1692 – 1700 ， 2018 .

Abdullah Abuolaim and Michael S Brown . Defocus deblurring using dual-pixel data . In ECCV ， pages 111– 126 . Springer ， 2020.

Abdullah Abuolaim ， Mahmoud Afifi ， and Michael S Brown . Improving single-image defocus deblurring： How dual-pixel images help through multi-task learning . In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision ， pages 1231 – 1239 ， 2022 .

Anish Mittal ， Rajiv Soundararajan ， and Alan C Bovik . Making a completely blind image quality analyzer . IEEE Signal processing letters ， 20 （ 3 ）： 209 – 212， 2012 .

Armin Mehri ， Parichehr B Ardakani ， and Angel D Sappa . Mprnet： Multi-path residual network for lightweight image super resolution . In Proceedings of the IEEE/CVF winter conference on applications of computer vision ， pages 2704 – 2713 ， 2021 .

Bernd Jähne . Digital image processing . Springer ， 2005 .

Bin Xia ， Yulun Zhang ， Shiyin Wang ， Yitong Wang ， Xinglong Wu ， Yapeng Tian ， Wenming Yang ， and Luc Van Gool . Diffir： Efficient diffusion model for image restoration . In Proceedings of the IEEE/CVF ICCV ， pages 13095 – 13105 ， 2023 .

Bin Xiao ， Haifeng Wu ， and Xiuli Bi . Dtmnet： A discrete tchebichef moments-based deep neural network for multi-focus image fusion . In Proceedings of the IEEE/CVF ICCV ， pages 43 – 51 ， 2021 .

Boyi Li ， Wenqi Ren ， Dengpan Fu ， Dacheng Tao ， Dan Feng ， Wenjun Zeng ， and Zhangyang Wang . Benchmarking single-image dehazing and beyond . IEEE transactions on image processing ， 28 （ 1 ）： 492 – 505， 2018 .

Boyuan Ma ， Xiang Yin ， Di Wu ， Haokai Shen ， Xiaojuan Ban ， and Yu Wang . End-to-end learning for simultaneously generating decision map and multi-focus image fusion result . Neurocomputing ， 470 ： 204 – 216， 2022 .

Boyuan Ma ， Yu Zhu ， Xiang Yin ， Xiaojuan Ban ， Haiyou Huang ， and Michele Mukeshimana . Sesf-fuse： An unsupervised deep model for multi-focus image fusion . Neural Computing and Applications ， 33 （ 11 ）： 5793 – 5804， 2021 .

C. Ancuti ， C. O. Ancuti ， T. Haber ， and P. Bekaert . Enhancing underwater images and videos by fusion . In Proc. IEEE/CVF CVPR ， pages 81 – 88 ， 2012 .

C. Li ， J. Guo ， B. Wang ， R. Cong ， Y. Zhang ， and J. Wang . Single underwater image enhancement based on color cast removal and visibility restoration . Journal of Electronic Imaging ， 25 （ 3 ）： 033012 ， 2016 .

C. O. Ancuti ， C. Ancuti ， C. De Vleeschouwer ， and P. Bekaert . Color balance and fusion for underwater image enhancement . IEEE Trans. Image Process. ， 27 （ 1 ）： 379 – 393， 2018 .

C. Zhao ， W. Cai ， C. Dong ， and C. Hu . Wavelet-based fourier information interaction with frequency diffusion adjustment for underwater image restoration . In 2024 IEEE/CVF Conference on CVPR ， pages 8281– 8291 ， Seattle， WA， USA ， jun 2024 . IEEE/CVF. doi： 10.1109/CVPR52729. 2024.00813 http://dx.doi.org/10.1109/CVPR52729.2024.00813 .

C.-Y . Li ， J.-C. Guo ， R.-M. Cong ， Y.-W. Pang ， and B. Wang . Underwater image enhancement by dehazing with minimum information loss and histogram distribution prior . IEEE Transactions on Image Processing ， 25 （ 12 ）： 5664 – 5677， 2016 .

Chao Dong ， Chen Change Loy ， Kaiming He ， and Xiaoou Tang . Learning a deep convolutional network for image super-resolution . ECCV ， 2014 .

Chao Li ， Yixiao Yang ， Kun He ， Stephen Lin ， and John E Hopcroft . Single image reflection removal through cascaded refinement . In Proceedings of the IEEE/CVF conference on CVPR ， pages 3565 – 3574 ， 2020 .

Chen Wei ， Wenjing Wang ， Wenhan Yang ， and Jiaying Liu . Deep retinex decomposition for low-light enhancement . arXiv preprint arXiv： 1808.04560 ， 2018 .

Chengyu Fang ， Chunming He ， Fengyang Xiao ， Yulun Zhang ， Longxiang Tang ， Yuelin Zhang ， Kai Li ， and Xiu Li . Real-world image dehazing with coherence-based pseudo labeling and cooperative unfolding network . Advances in NeurIPS ， 37 ： 97859 – 97883， 2024 .

Chongyi Li ， Chunle Guo ， Wenqi Ren ， Runmin Cong ， Junhui Hou ， Sam Kwong ， and Dacheng Tao . An underwater image enhancement benchmark dataset and beyond . IEEE Transactions on Image Processing ， 29 ： 4376 – 4389， nov 2019. doi： 10.1109/TIP.2019.2955241 http://dx.doi.org/10.1109/TIP.2019.2955241 .

Christian Ledig ， Lucas Theis ， Ferenc Huszár ， Jose Caballero ， Andrew Cunningham ， Alejandro Acosta ， Alykhan Aitken ， Alykhan Tejani ， Johannes Totz ， Zehan Wang ， et al . Photo-realistic single image super-resolution using a generative adversarial network . In Proceedings of the IEEE conference on CVPR ， pages 4681 – 4690 ， 2017 .

Chun-Chieh Tsai . Standard images for multifocus image fusion ， 2025 .

Chunle Guo ， Chongyi Li ， Jichang Guo ， Chen Change Loy ， Junhui Hou ， Sam Kwong ， and Runmin Cong . Zeroreference deep curve estimation for low-light image enhancement . In Proceedings of the IEEE/CVF conference on CVPR ， pages 1780 – 1789 ， 2020 .

Chun-Le Guo ， Qixin Yan ， Saeed Anwar ， Runmin Cong ， Wenqi Ren ， and Chongyi Li . Image dehazing transformer with transmission-aware 3d position embedding . In Proceedings of the IEEE/CVF conference on CVPR ， pages 5812 – 5820 ， 2022 .

Chunyang Cheng ， Tianyang Xu ， and Xiao-Jun Wu . Mufusion： A general unsupervised image fusion network based on memory unit . Information Fusion ， 92 ： 80 – 92， 2023 .

Cui Yang ， Jian-Qi Zhang ， Xiao-Rui Wang ， and Xin Liu . A novel similarity based quality metric for image fusion . Information Fusion ， 9 （ 2 ）： 156 – 160， 2008 .

Dana Berman ， Shai Avidan ， et al . Non-local image dehazing . In Proceedings of the IEEE conference on CVPR ， pages 1674 – 1682 ， 2016 .

Daniyar Zakarin ， Thiemo Wandel ， Anton Obukhov ， and Dengxin Dai . Reflection removal through efficient adaptation of diffusion transformers . arXiv preprint arXiv： 2512.05000 ， 2025 .

Dongwei Ren ， Wangmeng Zuo ， Qinghua Hu ， Pengfei Zhu ， and Deyu Meng . Progressive image deraining networks： A better and simpler baseline . In Proceedings of the IEEE/CVF conference on CVPR ， pages 3937 – 3946 ， 2019 .

Eirikur Agustsson and Radu Timofte . Ntire 2017 challenge on single image super-resolution： Dataset and study . In Proceedings of the IEEE conference on CVPRW ， pages 126 – 135 ， 2017 .

Feng Zhang ， Haoyou Deng ， Zhiqiang Li ， Lida Li ， Bin Xu ， Qingbo Lu ， Zisheng Cao ， Minchen Wei ， Changxin Gao ， Nong Sang ， et al . High-resolution photo enhancement in real-time： A laplacian pyramid network . IEEE Transactions on Pattern Analysis and Machine Intelligence ， 2025 .

Fengyi Zhang ， Hui Zeng ， Tianjun Zhang ， and Lin Zhang . Clut-net： Learning adaptively compressed representations of 3dluts for lightweight image enhancement . In Proceedings of the 30th ACM International Conference on Multimedia ， pages 6493– 6501 . ACM ， 2022 .

Fu-Jen Tsai ， Yan-Tsung Peng ， Yen-Yu Lin ， Chung-Chi Tsai ， and Chia-Wen Lin . Stripformer： Strip transformer for fast image deblurring . In ECCV ， pages 146– 162 . Springer ， 2022.

Gemini Team ， Rohan Anil ， Sebastian Borgeaud ， Jean-Baptiste Alayrac ， Jiahui Yu ， Radu Soricut ， Johan Schalkwyk ， Andrew M Dai ， Anja Hauth ， Katie Millican ， et al . Gemini： a family of highly capable multimodal models . arXiv preprint arXiv： 2312.11805 ， 2023 .

Guangmang Cui ， Huajun Feng ， Zhihai Xu ， Qi Li ， and Yueting Chen . Detail preserved fusion of visible and infrared images using regional saliency extraction and multi-scale image decomposition . Optics Communications ， 341 ： 199 – 209， 2015 .

H. Li ， J. Li ， and W. Wang . A Fusion Adversarial Underwater Image Enhancement Network with a Public Test Dataset . arXiv preprint ， 2019 . doi： 10.48550/arXiv.1906.06819. arXiv： http://dx.doi.org/10.48550/arXiv.1906.06819.arXiv： 1906.06819 .

Han Xu ， Jiayi Ma ， Zhuliang Le ， Junjun Jiang ， and Xiaojie Guo . Fusiondn： A unified densely connected network for image fusion . In Proceedings of the AAAI conference on artificial intelligence ， volume 34， pages 12484 – 12491 ， 2020 .

Han Xu ， Jiteng Yuan ， and Jiayi Ma . Murf： Mutually reinforcing multi-modal image registration and fusion . IEEE transactions on pattern analysis and machine intelligence ， 45 （ 10 ）： 12148 – 12166， 2023 .

Hang Dong ， Jinshan Pan ， Lei Xiang ， Zhe Hu ， Xinyi Zhang ， Fei Wang ， and Ming-Hsuan Yang . Multi-scale boosted dehazing network with dense feature fusion . In Proceedings of the IEEE/CVF conference on CVPR ， pages 2157 – 2167 ， 2020 .

Hanshu Yan ， Jingfeng Zhang ， Jiashi Feng ， Masashi Sugiyama ， and Vincent YF Tan . Towards adversarially robust deep image denoising . arXiv preprint arXiv： 2201.04397 ， 2022 .

Hao Tang ， Chengcheng Yuan ， Zechao Li ， and Jinhui Tang . Learning attention-guided pyramidal features for few-shot fine-grained recognition . Pattern Recognition ， 130 ： 108792 ， 2022 .

Hao Zhai ， Wenyi Zheng ， Yuncan Ouyang ， Xin Pan ， and Wanli Zhang . Multi-focus image fusion via interactive transformer and asymmetric soft sharing . Engineering Applications of Artificial Intelligence ， 133 ： 107967 ， 2024 .

Hao Zhang and Jiayi Ma . Sdnet： A versatile squeeze-and-decomposition network for real-time image fusion . IJCV ， 129 （ 10 ）： 2761 – 2785， 2021 .

Hao Zhang ， Zhuliang Le ， Zhenfeng Shao ， Han Xu ， and Jiayi Ma . Mff-gan： An unsupervised generative adversarial network with adaptive and gradient joint constraints for multi-focus image fusion . Information Fusion ， 66 ： 40 – 53， 2021 .

Hao Zhao ， Mingjia Li ， Qiming Hu ， and Xiaojie Guo . Reversible decoupling network for single image reflection removal . In Proceedings of the CVPR Conference ， pages 26430 – 26439 ， 2025 .

Haotian Liu ， Chunyuan Li ， Qingyang Wu ， and Yong Jae Lee . Visual instruction tuning . Advances in NeurIPS ， 36 ： 34892 – 34916， 2023 .

Haoyou Deng ， Lida Li ， Feng Zhang ， Zhiqiang Li ， Bin Xu ， Qingbo Lu ， Changxin Gao ， and Nong Sang . Towards blind flare removal using knowledge-driven flare-level estimator . IEEE Transactions on Image Processing ， 2024 .

Haoyu Chen ， Jinjin Gu ， Yihao Liu ， Salma Abdel Magid ， Chao Dong ， Qiong Wang ， Hanspeter Pfister ， and Lei Zhu . Masked image training for generalizable deep image denoising . In Proceedings of the IEEE/CVF Conference on CVPR ， pages 1692 – 1703 ， 2023 .

Hong Wang ， Qi Xie ， Qian Zhao ， and Deyu Meng . A model-driven deep neural network for single image rain removal . In Proceedings of the IEEE/CVF conference on CVPR ， pages 3103 – 3112 ， 2020 .

Huafeng Li ， Dan Wang ， Yuxin Huang ， Yafei Zhang ， and Zhengtao Yu . Generation and recombination for multifocus image fusion with free number of inputs . IEEE Transactions on Circuits and Systems for Video Technology ， 34 （ 7 ）： 6009 – 6023， 2023 .

Huafeng Li ， Yitang Wang ， Zhao Yang ， Ruxin Wang ， Xiang Li ， and Dapeng Tao . Discriminative dictionary learning-based multiple component decomposition for detail-preserving noisy image fusion . IEEE Transactions on Instrumentation and Measurement ， 69 （ 4 ）： 1082 – 1102， 2019 .

Hui Li ， Tianyang Xu ， Xiao-Jun Wu ， Jiwen Lu ， and Josef Kittler . Lrrnet： A novel representation learning guided fusion network for infrared and visible images . IEEE transactions on pattern analysis and machine intelligence ， 45 （ 9 ）： 11040 – 11052， 2023 .

Hui Zeng ， Jianrui Cai ， Lida Li ， Zisheng Cao ， and Lei Zhang . Learning image-adaptive 3d lookup tables for high performance photo enhancement in real-time . IEEE Transactions on Pattern Analysis and Machine Intelligence ， 44 （ 4 ）： 2058 – 2073， 2020 .

Hyeongseok Son ， Junyong Lee ， Sunghyun Cho ， and Seungyong Lee . Single image defocus deblurring using kernel-sharing parallel atrous convolutions . In Proceedings of the IEEE/CVF ICCV ， pages 2642 – 2650 ， 2021 .

Ian Goodfellow ， Jean Pouget-Abadie ， Mehdi Mirza ， Bing Xu ， David Warde-Farley ， Sherjil Ozair ， Aaron Courville ， and Yoshua Bengio . Generative adversarial nets . In Advances in NeurIPS ， 2014 .

Jaesung Rim ， Haeyun Lee ， Jucheol Won ， and Sunghyun Cho . Real-world blur dataset for learning and benchmarking deblurring algorithms . In ECCV ， pages 184– 201 . Springer ， 2020.

Jia-Bin Huang ， Abhishek Singh ， and Narendra Ahuja . Single image super-resolution from transformed selfexemplars . In Proceedings of the IEEE conference on CVPR ， pages 5197 – 5206 ， 2015 .

Jianrui Cai ， Hui Zeng ， Hongwei Yong ， Zisheng Cao ， and Lei Zhang . Toward real-world single image superresolution： A new benchmark and a new model . In Proceedings of the IEEE/CVF ICCV ， pages 3086 – 3095 ， 2019 .

Jianrui Cai ， Shuhang Gu ， and Lei Zhang . Learning a deep single image contrast enhancer from multi-exposure images . IEEE Transactions on Image Processing ， 27 （ 4 ）： 2049 – 2062， 2018 .

Jianyi Wang ， Kelvin CK Chan ， and Chen Change Loy . Exploring clip for assessing the look and feel of images . In Proceedings of the AAAI Conference on Artificial Intelligence ， volume 37， pages 2555 – 2563 ， 2023 .

Jianyi Wang ， Zongsheng Yue ， Shangchen Zhou ， Kelvin CK Chan ， and Chen Change Loy . Exploiting diffusion prior for real-world image super-resolution . IJCV ， pages 1 – 21 ， 2024 .

Jiayi Ma ， Han Xu ， Junjun Jiang ， Xiaoguang Mei ， and Xiao-Ping Zhang . Ddcgan： A dual-discriminator conditional generative adversarial network for multi-resolution image fusion . IEEE Transactions on Image Processing ， 29 ： 4980 – 4995， 2020 .

Jiayi Ma ， Linfeng Tang ， Fan Fan ， Jun Huang ， Xiaoguang Mei ， and Yong Ma . Swinfusion： Cross-domain long-range learning for general image fusion via swin transformer . IEEE/CAA Journal of Automatica Sinica ， 9 （ 7 ）： 1200 – 1217， 2022 .

Jiayi Ma ， Pengwei Liang ， Wei Yu ， Chen Chen ， Xiaojie Guo ， Jia Wu ， and Junjun Jiang . Infrared and visible image fusion via detail preserving adversarial learning . Information Fusion ， 54 ： 85 – 98， 2020 .

Jiayi Ma ， Wei Yu ， Pengwei Liang ， Chang Li ， and Junjun Jiang . Fusiongan： A generative adversarial network for infrared and visible image fusion . Information fusion ， 48 ： 11 – 26， 2019 .

Jichen Hu ， Chen Yang ， Zanwei Zhou ， Jiemin Fang ， Xiaokang Yang ， Qi Tian ， and Wei Shen . Dereflection any image with diffusion priors and diversified data . arXiv preprint arXiv： 2503.17347 ， 2025 .

Jie Cai ， Kangning Yang ， Ling Ouyang ， Lan Fu ， Jiaming Ding ， Huiming Sun ， Chiu Man Ho ， and Zibo Meng . F2t2-hit： A u-shaped fft transformer and hierarchical transformer for reflection removal . arXiv preprint arXiv： 2506.05489 ， 2025 .

Jie Xiao ， Xueyang Fu ， Aiping Liu ， Feng Wu ， and Zheng-Jun Zha . Image de-raining transformer . IEEE transactions on pattern analysis and machine intelligence ， 45 （ 11 ）： 12978 – 12995， 2022 .

Jie Xiao ， Xueyang Fu ， Yurui Zhu ， Dong Li ， Jie Huang ， Kai Zhu ， and Zheng-Jun Zha . Homoformer： Homogenized transformer for image shadow removal . In Proceedings of the IEEE/CVF conference on CVPR ， pages 25617 – 25626 ， 2024 .

Jie Yang ， Dong Gong ， Lingqiao Liu ， and Qinfeng Shi . Seeing deeply and bidirectionally： A deep learning approach for single image reflection removal . In Proceedings of the ECCV ， pages 654 – 669 ， 2018 .

Jing Li ， Hongtao Huo ， Chang Li ， Renhua Wang ， and Qi Feng . Attentionfgan： Infrared and visible image fusion using attention-based generative adversarial networks . IEEE Transactions on Multimedia ， 23 ： 1383 – 1396， 2020 .

Jing Li ， Jianming Zhu ， Chang Li ， Xun Chen ， and Bin Yang . Cgtf： Convolution-guided transformer for infrared and visible image fusion . IEEE Transactions on Instrumentation and Measurement ， 71 ： 1 – 14， 2022 .

Jingwen He ， Yihao Liu ， Yu Qiao ， and Chao Dong . Conditional sequential modulation for efficient global image retouching . In ECCV ， pages 679– 695 . Springer ， 2020.

Jinhui Hou ， Zhiyu Zhu ， Junhui Hou ， Hui Liu ， Huanqiang Zeng ， and Hui Yuan . Global structure-aware diffusion process for low-light image enhancement . Advances in NeurIPS ， 36 ： 79734 – 79747， 2023 .

Jinxing Li ， Xiaobao Guo ， Guangming Lu ， Bob Zhang ， Yong Xu ， Feng Wu ， and David Zhang . Drpl： Deep regression pair learning for multi-focus image fusion . IEEE Transactions on Image Processing ， 29 ： 4816 – 4831， 2020 .

Jinyuan Liu ， Xin Fan ， Zhanbo Huang ， Guanyao Wu ， Risheng Liu ， Wei Zhong ， and Zhongxuan Luo . Target-aware dual adversarial learning and a multi-scenario multi-modality benchmark to fuse infrared and visible for object detection . In Proceedings of the IEEE/CVF conference on CVPR ， pages 5802 – 5811 ， 2022 .

Jinyuan Liu ， Zhu Liu ， Guanyao Wu ， Long Ma ， Risheng Liu ， Wei Zhong ， Zhongxuan Luo ， and Xin Fan . Multiinteractive feature learning and a full-time multi-modality benchmark for image fusion and segmentation . In Proceedings of the IEEE/CVF ICCV ， pages 8115 – 8124 ， 2023 .

Jonathan Ho ， Ajay Jain ， and Pieter Abbeel . Denoising diffusion probabilistic models . In Advances in NeurIPS ， volume 33， pages 6840 – 6851 ， 2020 .

Jun Xu ， Hui Li ， Zhetong Liang ， David Zhang ， and Lei Zhang . Real-world noisy image denoising： A new benchmark . arXiv preprint arXiv： 1804.02603 ， 2018 .

Juncheng Zhang ， Qingmin Liao ， Haoyu Ma ， Jing-Hao Xue ， Wenming Yang ， and Shaojun Liu . Exploit the best of both end-to-end and map-based methods for multi-focus image fusion . IEEE Transactions on Multimedia ， 26 ： 6411 – 6423， 2024 .

Junjie Ke ， Qifei Wang ， Yilin Wang ， Peyman Milanfar ， and Feng Yang . MUSIQ： Multi-scale image quality transformer . In IEEE/CVF ICCV ， pages 5148 – 5157 ， 2021 .

Junyong Lee ， Hyeongseok Son ， Jaesung Rim ， Sunghyun Cho ， and Seungyong Lee . Iterative filter adaptive network for single image defocus deblurring . In Proceedings of the IEEE/CVF conference on CVPR ， pages 2034 – 2042 ， 2021 .

K. Iqbal ， M. Odetayo ， A. James ， R. A. Salam ， and A. Z. H. Talib . Enhancing the low quality images using unsupervised colour correction method . In IEEE International Conference on SMC ， pages 1703 – 1709 ， 2010 .

K. Panetta ， C. Gao ， and S. Agaian . Human-visual-system-inspired underwater image quality measures . IEEE Journal of Oceanic Engineering ， 41 （ 3 ）： 541 – 551， 2015 . doi： 10.1109/JOE.2015.2410644 http://dx.doi.org/10.1109/JOE.2015.2410644 .

Kai Zhang ， Jingyun Liang ， Luc Van Gool ， and Radu Timofte . Designing a practical degradation model for deep blind image super-resolution . In Proceedings of the IEEE/CVF ICCV ， pages 4791 – 4800 ， 2021 .

Kai Zhang ， Wangmeng Zuo ， Yunjin Chen ， Deyu Meng ， and Lei Zhang . Beyond a gaussian denoiser： Residual learning of deep cnn for image denoising . IEEE transactions on image processing ， 26 （ 7 ）： 3142 – 3155， 2017 .

Kaihao Zhang ， Wenhan Luo ， Yiran Zhong ， Lin Ma ， Bjorn Stenger ， Wei Liu ， and Hongdong Li . Deblurring by realistic blurring . In Proceedings of the IEEE/CVF conference on CVPR ， pages 2737 – 2746 ， 2020 .

Kaiming He ， Jian Sun ， and Xiaoou Tang . Single image haze removal using dark channel prior . IEEE transactions on pattern analysis and machine intelligence ， 33 （ 12 ）： 2341 – 2353， 2010 .

Kaixuan Wei ， Jiaolong Yang ， Ying Fu ， David Wipf ， and Hua Huang . Single image reflection removal exploiting misaligned training data and network enhancements . In Proceedings of the IEEE/CVF Conference on CVPR ， pages 8178 – 8187 ， 2019 .

Kecheng Zheng ， Juan Cheng ， and Yu Liu . Unfolding coupled convolutional sparse representation for multi-focus image fusion . Information Fusion ， 118 ： 102974 ， 2025 .

Kui Jiang ， Zhongyuan Wang ， Peng Yi ， Chen Chen ， Baojin Huang ， Yimin Luo ， Jiayi Ma ， and Junjun Jiang . Multi-scale progressive fusion network for single image deraining . In Proceedings of the IEEE/CVF conference on CVPR ， pages 8346 – 8355 ， 2020 .

L. Peng ， C. Zhu ， and L. Bian . U-shape transformer for underwater image enhancement . IEEE Transactions on Image Processing ， 32 ： 3066 – 3079， 2023 . doi： 10.1109/TIP.2023.3276332 http://dx.doi.org/10.1109/TIP.2023.3276332 .

Lanqing Guo ， Chong Wang ， Wenhan Yang ， Siyu Huang ， Yufei Wang ， Hanspeter Pfister ， and Bihan Wen . Shadowdiffusion： When degradation prior meets diffusion model for shadow removal . In Proceedings of the IEEE/CVF conference on CVPR ， pages 14049 – 14058 ， 2023 .

Lanqing Guo ， Siyu Huang ， Ding Liu ， Hao Cheng ， and Bihan Wen . Shadowformer： Global context helps shadow removal . In Proceedings of the AAAI conference on artificial intelligence ， volume 37， pages 710 – 718 ， 2023 .

Lei Zhang ， Xiaolin Wu ， Antoni Buades ， and Xin Li . Color demosaicking by local directional interpolation and nonlocal adaptive thresholding . Journal of Electronic imaging ， 20 （ 2 ）： 023016 – 023016， 2011 .

Liangqiong Qu ， Jiandong Tian ， Shengfeng He ， Yandong Tang ， and Rynson WH Lau . Deshadownet： A multicontext embedding deep network for shadow removal . In Proceedings of the IEEE conference on CVPR ， pages 4067 – 4075 ， 2017 .

Linfeng Tang ， Jiteng Yuan ， Hao Zhang ， Xingyu Jiang ， and Jiayi Ma . Piafusion： A progressive infrared and visible image fusion network based on illumination aware . Information Fusion ， 83 ： 79 – 92， 2022 .

Lingyan Ruan ， Bin Chen ， Jizhou Li ， and Miu-Ling Lam . Aifnet： All-in-focus image restoration network using a light field-based dataset . IEEE Transactions on Computational Imaging ， 7 ： 675 – 688， 2021 .

Lingyan Ruan ， Bin Chen ， Jizhou Li ， and Miuling Lam . Learning to deblur using light field generated and real defocus images . In Proceedings of the IEEE/CVF conference on CVPR ， pages 16304 – 16313 ， 2022 .

M. Yang and A. Sowmya . An underwater color image quality evaluation metric . IEEE Transactions on Image Processing ， 24 （ 12 ）： 6062 – 6071， 2015 . doi： 10.1109/TIP.2015.2480136 http://dx.doi.org/10.1109/TIP.2015.2480136 .

Mansour Nejati ， Shadrokh Samavi ， and Shahram Shirani . Multi-focus image fusion using dictionary-based sparse representation . Information fusion ， 25 ： 72 – 84， 2015 .

Matthias Hullin ， Elmar Eisemann ， Hans-Peter Seidel ， and Sungkil Lee . Physically-based real-time lens flare rendering . ACM Trans. Graph. ， 30 （ 4 ）， 2011 .

Michaël Gharbi ， Jiawen Chen ， Jonathan T Barron ， Samuel W Hasinoff ， and Frédo Durand . Deep bilateral learning for real-time image enhancement . ACM Transactions on Graphics （TOG）， 36 （ 4 ）： 1 – 12， 2017 .

Mining Li ， Ronghao Pei ， Tianyou Zheng ， Yang Zhang ， and Weiwei Fu . Fusiondiff： Multi-focus image fusion using denoising diffusion probabilistic models . Expert Systems with Applications ， 238 ： 121664 ， 2024 .

Mohammed Hossny ， Saeid Nahavandi ， and Douglas Creighton . Comments on ‘information measure for performance of image fusion’ . Electronics letters ， 44 （ 18 ）： 1066 – 1067， 2008 .

Orest Kupyn ， Tetiana Martyniuk ， Junru Wu ， and Zhangyang Wang . Deblurgan-v2： Deblurring （orders-ofmagnitude） faster and better . In Proceedings of the IEEE/CVF ICCV ， pages 8878 – 8887 ， 2019 .

Orest Kupyn ， Volodymyr Budzan ， Mykola Mykhailych ， Dmytro Mishkin ， and Jiří Matas . Deblurgan： Blind motion deblurring using conditional adversarial networks . In Proceedings of the IEEE conference on CVPR ， pages 8183 – 8192 ， 2018 .

P. L. Drews ， E. R. Nascimento ， S. S. Botelho ， and M. F. M. Campos . Underwater depth estimation and image restoration based on single images . IEEE Computer Graphics and Applications ， 36 （ 2 ）： 24 – 35， 2016 .

Pengwei Liang ， Junjun Jiang ， Xianming Liu ， and Jiayi Ma . Fusion from decomposition： A self-supervised decomposition approach for image fusion . In ECCV ， pages 719– 735 . Springer ， 2022.

Pengxu Wei ， Ziwei Xie ， Hannan Lu ， Zongyuan Zhan ， Qixiang Ye ， Wangmeng Zuo ， and Liang Lin . Component divide-and-conquer for real-world image super-resolution . In ECCV ， pages 101 – 117 ， 2020 .

Qiaosi Yi ， Juncheng Li ， Qinyan Dai ， Faming Fang ， Guixu Zhang ， and Tieyong Zeng . Structure-preserving deraining with residue channel prior guidance . In Proceedings of the IEEE/CVF ICCV ， pages 4238 – 4247 ， 2021 .

Qiming Hu and Xiaojie Guo . Single image reflection separation via component synergy . In Proceedings of the IEEE/CVF ICCV ， pages 13138 – 13147 ， 2023 .

Qiming Hu and Xiaojie Guo . Trash or treasure？ an interactive dual-stream strategy for single image reflection separation. Advances in NeurIPS ， 34 ： 24683 – 24694， 2021 .

Qiming Hu ， Hainuo Wang ， and Xiaojie Guo . Single image reflection separation via dual-stream interactive transformers . Advances in NeurIPS ， 37 ： 55228 – 55248， 2024 .

R. Cong ， W. Yang ， W. Zhang ， C. Li ， C.-L. Guo ， Q. Huang ， and S. Kwong . Pugan： Physical model-guided underwater image enhancement using gan with dual-discriminators . IEEE Transactions on Image Processing ， 32 ： 4472 – 4485， 2023 .

Raanan Fattal . Dehazing using color-lines . ACM transactions on graphics （TOG）， 34 （ 1 ）： 1 – 14， 2014 .

Renjie Wan ， Boxin Shi ， Ling-Yu Duan ， Ah-Hwee Tan ， and Alex C Kot . Benchmarking single-image reflection removal algorithms . In Proceedings of the IEEE ICCV ， pages 3922 – 3930 ， 2017 .

Rich Franzen . Kodak lossless true color image suite， volume 5 . https：//r0k.us/graphics/kodak/ ， 1999 .

Richard Zhang ， Phillip Isola ， Alexei A Efros ， Eli Shechtman ， and Oliver Wang . The unreasonable effectiveness of deep features as a perceptual metric . In Proceedings of the IEEE conference on CVPR ， pages 586 – 595 ， 2018 .

Risheng Liu ， Long Ma ， Jiaao Zhang ， Xin Fan ， and Zhongxuan Luo . Retinex-inspired unrolling with cooperative prior architecture search for low-light image enhancement . In Proceedings of the IEEE/CVF conference on CVPR ， pages 10561 – 10570 ， 2021 .

Robin Rombach ， Andreas Blattmann ， Dominik Lorenz ， Patrick Esser ， and Björn Ommer . High-resolution image synthesis with latent diffusion models . In IEEE/CVF Conference on CVPR ， pages 10684 – 10695 ， 2022 .

Ruiqi Guo ， Qieyun Dai ， and Derek Hoiem . Paired regions for shadow detection and removal . IEEE transactions on pattern analysis and machine intelligence ， 35 （ 12 ）： 2956 – 2967， 2012 .

Rui-Qi Wu ， Zheng-Peng Duan ， Chun-Le Guo ， Zhi Chai ， and Chongyi Li . Ridcp： Revitalizing real image dehazing via high-quality codebook priors . In Proceedings of the IEEE/CVF conference on CVPR ， pages 22282 – 22291 ， 2023 .

Ruixing Wang ， Qing Zhang ， Chi-Wing Fu ， Xiaoyong Shen ， Wei-Shi Zheng ， and Jiaya Jia . Underexposed photo enhancement using deep illumination estimation . In Proceedings of the IEEE/CVF Conference on CVPR ， pages 6849 – 6857 ， 2019 .

S.-B . Gao ， M. Zhang ， Q. Zhao ， X.-S. Zhang ， and Y.-J . Li . Underwater image enhancement using adaptive retinal mechanisms. IEEE Trans. Image Process. ， 28 （ 11 ）： 5580 – 5595， 2019 .

Sean Moran ， Pierre Marza ， Steven McDonagh ， Sarah Parisot ， and Gregory Slabaugh . Deeplpf： Deep local parametric filters for image enhancement . In Proceedings of the IEEE/CVF Conference on CVPR ， pages 12826 – 12835 ， 2020 .

Seungjun Nah ， Tae Hyun Kim ， and Kyoung Mu Lee . Deep multi-scale convolutional neural network for dynamic scene deblurring . In Proceedings of the IEEE conference on CVPR ， pages 3883 – 3891 ， 2017 .

Shuai Bai ， Keqin Chen ， Xuejing Liu ， Jialin Wang ， Wenbin Ge ， Sibo Song ， Kai Dang ， Peng Wang ， Shijie Wang ， Jun Tang ， et al . Qwen 2 . 5-vl technical report. arXiv preprint arXiv： 2502.13923 ， 2025.

Shuang Xu ， Xiaoli Wei ， Chunxia Zhang ， Junmin Liu ， and Jiangshe Zhang . Mffw： A new dataset for multi-focus image fusion . arXiv preprint arXiv： 2002.04780 ， 2020 .

Simin Luan ， Cong Yang ， Zeyd Boukhers ， Xue Qin ， Dongfeng Cheng ， Wei Sui ， and Zhijun Li . Gyroscope-assisted motion deblurring network . CoRR ， 2024 .

Syed Waqas Zamir ， Aditya Arora ， Salman Khan ， Munawar Hayat ， Fahad Shahbaz Khan ， Ming-Hsuan Yang ， and Ling Shao . Multi-stage progressive image restoration . In Proceedings of the IEEE/CVF conference on CVPR ， pages 14821 – 14831 ， 2021 .

Syed Waqas Zamir ， Aditya Arora ， Salman Khan ， Munawar Hayat ， Fahad Shahbaz Khan ， and Ming-Hsuan Yang . Restormer： Efficient transformer for high-resolution image restoration . In Proceedings of the IEEE/CVF conference on CVPR ， pages 5728 – 5739 ， 2022 .

Tao Wang ， Kaihao Zhang ， Tianrun Shen ， Wenhan Luo ， Bjorn Stenger ， and Tong Lu . Ultra-high-definition low-light image enhancement： A benchmark and transformer-based method . In Proceedings of the AAAI conference on artificial intelligence ， volume 37， pages 2654 – 2662 ， 2023 .

Tao Wang ， Yong Li ， Jingyang Peng ， Yipeng Ma ， Xian Wang ， Fenglong Song ， and Youliang Yan . Real-time image enhancer via learnable spatial-aware 3d lookup tables . In Proceedings of the IEEE/CVF ICCV ， pages 2471 – 2480 ， 2021 .

Tianyu Wang ， Xin Yang ， Ke Xu ， Shaozhe Chen ， Qiang Zhang ， and Rynson W . H . Lau. Spatial attentive single-image deraining with a high quality real rain dataset. In The IEEE Conference on CVPR ， June 2019 .

Vibashan Vs ， Jeya Maria Jose Valanarasu ， Poojan Oza ， and Vishal M Patel . Image fusion transformer . In 2022 IEEE International conference on ICIP ， pages 3566– 3570 . IEEE ， 2022 .

Vineeth Murali and PV Sudeep . Image denoising using dncnn： An exploration study . In Advances in Communication Systems and Networks ： Select Proceedings of ComNet 2019 ， pages 847–859 . Springer ， 2020 .

Vladimir Bychkovsky ， Sylvain Paris ， Eric Chan ， and Fredo Durand . Learning photographic global tonal adjustment with a database of input / output image pairs . In CVPR 2011 ， pages 97 – 104 ， 2011 . doi： 10.1109/ CVPR.2011.5995332 http://dx.doi.org/10.1109/CVPR.2011.5995332 .

W. Zhang ， Y. Wang ， and C. Li . Underwater image enhancement by attenuated color channel correction and detail preserved contrast enhancement . IEEE Journal of Oceanic Engineering ， pages 1 – 18 ， 2022 .

Wenda Zhao ， Shigeng Xie ， Fan Zhao ， You He ， and Huchuan Lu . Metafusion： Infrared and visible image fusion via meta-feature embedding from object detection . In Proceedings of the IEEE/CVF Conference on CVPR ， pages 13955 – 13965 ， 2023 .

Wenhan Yang ， Robby T Tan ， Jiashi Feng ， Jiaying Liu ， Zongming Guo ， and Shuicheng Yan . Deep joint rain detection and removal from a single image . In Proceedings of the IEEE conference on CVPR ， pages 1357 – 1366 ， 2017 .

Wenhan Yang ， Wenjing Wang ， Haofeng Huang ， Shiqi Wang ， and Jiaying Liu . Sparse gradient regularized deep retinex network for robust low-light image enhancement . IEEE Transactions on Image Processing ， 30 ： 2072 – 2086， 2021 .

Wenjing Wang ， Huan Yang ， Jianlong Fu ， and Jiaying Liu . Zero-reference low-light enhancement via physical quadruple priors . In Proceedings of the IEEE/CVF conference on CVPR ， pages 26057 – 26066 ， 2024 .

Xia Li ， Jianlong Wu ， Zhouchen Lin ， Hong Liu ， and Hongbin Zha . Recurrent squeeze-and-excitation context aggregation net for single image deraining . In Proceedings of the ECCV ， pages 254 – 269 ， 2018 .

Xiang Chen ， Hao Li ， Mingqiang Li ， and Jinshan Pan . Learning a sparse transformer network for effective image deraining . In Proceedings of the IEEE/CVF conference on CVPR ， pages 5896 – 5905 ， 2023 .

Xiang Chen ， Jinshan Pan ， and Jiangxin Dong . Bidirectional multi-scale implicit neural representations for image deraining . In Proceedings of the IEEE/CVF Conference on CVPR ， June 2024 .

Xiaodong Cun ， Chi-Man Pun ， and Cheng Shi . Towards ghost-free shadow removal via dual hierarchical aggregation network and shadow matting gan . In Proceedings of the AAAI conference on artificial intelligence ， volume 34， pages 10680 – 10687 ， 2020 .

Xiaogang Xu ， Ruixing Wang ， Chi-Wing Fu ， and Jiaya Jia . Snr-aware low-light image enhancement . In Proceedings of the IEEE/CVF conference on CVPR ， pages 17714 – 17724 ， 2022 .

Xiaojie Guo and Qiming Hu . Low-light image enhancement via breaking down the darkness . IJCV ， 131 （ 1 ）： 48 – 66， 2023 .

Xiaowei Hu ， Lei Zhu ， Chi-Wing Fu ， Jing Qin ， and Pheng-Ann Heng . Direction-aware spatial context features for shadow detection . In Proceedings of the IEEE conference on CVPR ， pages 7454 – 7462 ， 2018 .

Xiaoyu Li ， Bo Zhang ， Jing Liao ， and Pedro V . Sander . Let’s see clearly ： Contaminant artifact removal for moving cameras.In 2021 IEEE/CVF ， ICCV， pages 1991 – 2000 ， 2021 . doi： 10.1109/ICCV48922.2021.00202 http://dx.doi.org/10.1109/ICCV48922.2021.00202 .

Xin Li ， Bingchen Li ， Xin Jin ， Cuiling Lan ， and Zhibo Chen . Learning distortion invariant representation for image restoration from a causality perspective . In Proceedings of the IEEE/CVF Conference on CVPR ， pages 1714 – 1724 ， 2023 .

Xingyu Hu ， Junjun Jiang ， Xianming Liu ， and Jiayi Ma . Zmff： Zero-shot multi-focus image fusion . Information Fusion ， 92 ： 127 – 138， 2023 .

Xinqi Lin ， Jingwen He ， Ziyan Chen ， Zhaoyang Lyu ， Bo Dai ， Fanghua Yu ， Wanli Ouyang ， Yu Qiao ， and Chao Dong . Diffbir： Towards blind image restoration with generative diffusion prior . In arXiv preprint arXiv： 2308.15070 ， 2023 .

Xintao Wang ， Liangbin Xie ， Chao Dong ， and Ying Shan . Real-esrgan： Training real-world blind super-resolution with pure synthetic data . In Proceedings of the IEEE/CVF ICCV Workshops ， pages 1905 – 1914 ， 2021 .

Xuaner Zhang ， Ren Ng ， and Qifeng Chen . Single image reflection separation with perceptual losses . In Proceedings of the IEEE conference on CVPR ， pages 4786 – 4794 ， 2018 .

Xueyang Fu ， Jiabin Huang ， Delu Zeng ， Yue Huang ， Xinghao Ding ， and John Paisley . Removing rain from single images via a deep detail network . In Proceedings of the IEEE conference on CVPR ， pages 3855 – 3863 ， 2017 .

Xueyang Fu ， Qi Qi ， Zheng-Jun Zha ， Yurui Zhu ， and Xinghao Ding . Rain streak removal via dual graph convolutional network . In Proceedings of the AAAI Conference on Artificial Intelligence ， volume 35， pages 1352 – 1360 ， 2021 .

Yael Vinker ， Inbar Huberman-Spiegelglas ， and Raanan Fattal . Unpaired learning for high dynamic range image tone mapping . In 2021 IEEE/CVF ICCV ， pages 14637 – 14646 ， 2021 . doi： 10.1109/ICCV48922.2021.01439 http://dx.doi.org/10.1109/ICCV48922.2021.01439 .

Yang Yang ， Chaoyue Wang ， Risheng Liu ， Lin Zhang ， Xiaojie Guo ， and Dacheng Tao . Self-augmented unpaired image dehazing via density and depth decomposition . In Proceedings of the IEEE/CVF conference on CVPR ， pages 2037 – 2046 ， 2022 .

Yanzuo Lu ， Xin Xia ， Manlin Zhang ， Huafeng Kuang ， Jianbin Zheng ， Yuxi Ren ， and Xuefeng Xiao . Hyper-bagel： A unified acceleration framework for multimodal understanding and generation . arXiv preprint arXiv： 2509.18824 ， 2025 .

Yi Tang ， Hiroshi Kawasaki ， and Takafumi Iwaguchi . Underwater image enhancement by transformer-based diffusion model with non-uniform sampling for skip strategy . In Proceedings of the 31st ACM International Conference on Multimedia （MM ’23 ）， pages 5419 – 5427 ， New York ， NY ， USA ， 2023 . Association for Computing Machinery. doi： 10.1145/3581783.3612475 http://dx.doi.org/10.1145/3581783.3612475 .

Yicheng Wu ， Qiurui He ， Tianfan Xue ， Rahul Garg ， Jiawen Chen ， Ashok Veeraraghavan ， and Jonathan T . Barron . How to train neural networks for flare removal. In 2021 IEEE/CVF ICCV ， pages 2219 – 2227 ， 2021 . doi： 10.1109/ICCV48922.2021.00224 http://dx.doi.org/10.1109/ICCV48922.2021.00224 .

Yifan Jiang ， Xinyu Gong ， Ding Liu ， Yu Cheng ， Chen Fang ， Xiaohui Shen ， Jianchao Yang ， Pan Zhou ， and Zhangyang Wang . Enlightengan： Deep light enhancement without paired supervision . IEEE transactions on image processing ， 30 ： 2340 – 2349， 2021 .

Yihang Huang ， Yuanfei Huang ， Junhui Lin ， and Hua Huang . Deflaremamba： Hierarchical vision mamba for contextually consistent lens flare removal . In Proceedings of the 33rd ACM International Conference on Multimedia ， page 8028 – 8037 ， 2025 .

Yin Chen and Rick S Blum . A new automated quality assessment algorithm for image fusion . Image and vision computing ， 27 （ 10 ）： 1421 – 1432， 2009 .

Yochai Blau and Tomer Michaeli . The perception-distortion tradeoff . In Proceedings of the IEEE conference on CVPR ， pages 6228 – 6237 ， 2018 .

Yonghua Zhang ， Jiawan Zhang ， and Xiaojie Guo . Kindling the darkness： A practical low-light image enhancer . In Proceedings of the 27th ACM international conference on multimedia ， pages 1632 – 1640 ， 2019 .

Yu Li ， Ming Liu ， Yaling Yi ， Qince Li ， Dongwei Ren ， and Wangmeng Zuo . Two-stage single image reflection removal with reflection-aware guidance . Applied Intelligence ， 53 （ 16 ）： 19433 – 19448， 2023 .

Yu Li ， Robby T Tan ， Xiaojie Guo ， Jiangbo Lu ， and Michael S Brown . Rain streak removal using layer priors . In Proceedings of the IEEE conference on CVPR ， pages 2736 – 2744 ， 2016 .

Yu Liu ， Shuping Liu ， and Zengfu Wang . A general framework for image fusion based on multi-scale transform and sparse representation . Information fusion ， 24 ： 147 – 164， 2015 .

Yu Liu ， Xun Chen ， Hu Peng ， and Zengfu Wang . Multi-focus image fusion with a deep convolutional neural network . Information Fusion ， 36 ： 191 – 207， 2017 .

Yu Luo ， Yong Xu ， and Hui Ji . Removing rain from a single image via discriminative sparse coding . In Proceedings of the IEEE ICCV ， pages 3397 – 3405 ， 2015 .

Yu Zhang ， Yu Liu ， Peng Sun ， Han Yan ， Xiaolin Zhao ， and Li Zhang . Ifcnn： A general image fusion framework based on convolutional neural network . Information Fusion ， 54 ： 99 – 118， 2020 .

Yuanhao Cai ， Hao Bian ， Jing Lin ， Haoqian Wang ， Radu Timofte ， and Yulun Zhang . Retinexformer： One-stage retinex-based transformer for low-light image enhancement . In Proceedings of the IEEE/CVF ICCV ， pages 12504 – 12513 ， 2023 .

Yuanjie Shao ， Lerenhan Li ， Wenqi Ren ， Changxin Gao ， and Nong Sang . Domain adaptation for image dehazing . In Proceedings of the IEEE/CVF conference on CVPR ， pages 2808 – 2817 ， 2020 .

Yuchen Hong ， Haofeng Zhong ， Shuchen Weng ， Jinxiu Liang ， and Boxin Shi . L-differ： Single image reflection removal with language-based diffusion model . In ECCV ， pages 58– 76 . Springer ， 2024.

Yuda Song ， Zhuqing He ， Hui Qian ， and Xin Du . Vision transformers for single image dehazing . IEEE Transactions on Image Processing ， 32 ： 1927 – 1941， 2023 .

Yudong Wang ， Jichang Guo ， Huan Gao ， and Huihui Yue . Uiec2-net： Cnn-based underwater image enhancement using two color space . Signal Processing ： Image Communication ， 96 ： 116250 ， 2021 . ISSN 0923-5965 .

Yue Huang ， Zi’ang Li ， Tianle Hu ， Jie Wen ， Guanbin Li ， Jinglin Zhang ， Guoxu Zhou ， and Xiaozhao Fang . Single image reflection removal via inter-layer complementarity . arXiv preprint arXiv： 2505.12641 ， 2025 .

Yuekun Dai ， Chongyi Li ， Shangchen Zhou ， Ruicheng Feng ， Yihang Luo ， and Chen Change Loy . Flare7k++： Mixing synthetic and real datasets for nighttime flare removal and beyond . IEEE Transactions on Pattern Analysis and Machine Intelligence ， 46 （ 11 ）： 7041 – 7055， 2024 . doi： 10.1109/TPAMI.2024.3406821 http://dx.doi.org/10.1109/TPAMI.2024.3406821 .

Yuekun Dai ， Dafeng Zhang ， Xiaoming Li ， Zongsheng Yue ， Chongyi Li ， Shangchen Zhou ， Ruicheng Feng ， Peiqing Yang ， Zhezhu Jin ， Guanqun Liu ， and Chen Change Loy . Mipi 2024 challenge on nighttime flare removal： Methods and results . In 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops （CVPRW）， pages 1144 – 1152 ， 2024 . doi： 10.1109/CVPRW63382.2024.00121 http://dx.doi.org/10.1109/CVPRW63382.2024.00121 .

Yufei Wang ， Renjie Wan ， Wenhan Yang ， Haoliang Li ， Lap-Pui Chau ， and Alex Kot . Low-light image enhancement with normalizing flow . In Proceedings of the AAAI conference on artificial intelligence ， volume 36， pages 2604 – 2612 ， 2022 .

Yufei Wang ， Wenhan Yang ， Xinyuan Chen ， Yaohui Wang ， Lanqing Guo ， Lap-Pui Chau ， Ziwei Liu ， Yu Qiao ， Alex C Kot ， and Bihan Wen . Sinsr： diffusion-based image super-resolution in a single step . In Proceedings of the IEEE/CVF Conference on CVPR ， pages 25796 – 25805 ， 2024 .

Yufeng Zheng ， Edward A Essock ， Bruce C Hansen ， and Andrew M Haun . A new metric based on extended spatial frequency and its application to dwt based fusion algorithms . Information Fusion ， 8 （ 2 ）： 177 – 192， 2007 .

Yuhui Quan ， Xi Wan ， Zitao Tang ， Jinxiu Liang ， and Hui Ji . Multi-focus image fusion via explicit defocus blur modelling . In Proceedings of the AAAI Conference on Artificial Intelligence ， volume 39， pages 6657 – 6665 ， 2025 .

Yuhui Quan ， Xin Yao ， and Hui Ji . Single image defocus deblurring via implicit neural inverse kernels . In Proceedings of the IEEE/CVF ICCV ， pages 12600 – 12610 ， 2023 .

Yuhui Quan ， Zicong Wu ， and Hui Ji . Gaussian kernel mixture network for single image defocus deblurring . Advances in NeurIPS ， 34 ： 20812 – 20824， 2021 .

Yuhui Quan ， Zicong Wu ， and Hui Ji . Neumann network with recursive kernels for single image defocus deblurring . In Proceedings of the IEEE/CVF conference on CVPR ， pages 5754 – 5763 ， 2023 .

Yuhui Quan ， Zicong Wu ， Ruotao Xu ， and Hui Ji . Deep single image defocus deblurring via gaussian kernel mixture learning . IEEE Transactions on Pattern Analysis and Machine Intelligence ， 2024 .

Yulun Zhang ， Kunpeng Li ， Kai Li ， Lichen Wang ， Bineng Zhong ， and Yun Fu . Image super-resolution using very deep residual channel attention networks . In Proceedings of the ECCV ， pages 286 – 301 ， 2018 .

Yurui Zhu ， Jie Huang ， Xueyang Fu ， Feng Zhao ， Qibin Sun ， and Zheng-Jun Zha . Bijective mapping network for shadow removal . In Proceedings of the IEEE/CVF Conference on CVPR ， pages 5627 – 5636 ， 2022 .

Yurui Zhu ， Xueyang Fu ， Peng-Tao Jiang ， Hao Zhang ， Qibin Sun ， Jinwei Chen ， Zheng-Jun Zha ， and Bo Li . Revisiting single image reflection removal in the wild . In Proceedings of the IEEE/CVF Conference on CVPR ， pages 25468 – 25478 ， 2024 .

Yuwei Qiu ， Kaihao Zhang ， Chenxi Wang ， Wenhan Luo ， Hongdong Li ， and Zhi Jin . Mb-taylorformer： Multibranch efficient transformer expanded by taylor formula for image dehazing . In Proceedings of the IEEE/CVF ICCV ， pages 12802 – 12813 ， 2023 .

Zeyuan Chen ， Yangchao Wang ， Yang Yang ， and Dong Liu . Psd： Principled synthetic-to-real dehazing guided by physical priors . In Proceedings of the IEEE/CVF conference on CVPR ， pages 7180 – 7189 ， 2021 .

Zhaohan Wang ， Chengjun Chen ， and Chenggang Dai . Zero-shot realistic image deblurring with consistency model . Complex & Intelligent Systems ， 12 （ 1 ）： 29 ， 2026 .

Zhendong Wang ， Xiaodong Cun ， Jianmin Bao ， Wengang Zhou ， Jianzhuang Liu ， and Houqiang Li . Uformer： A general u-shaped transformer for image restoration . In Proceedings of the IEEE/CVF conference on CVPR ， pages 17683 – 17693 ， 2022 .

Zheng Chen ， Yulun Zhang ， Ding Liu ， Jinjin Gu ， Linghe Kong ， Xin Yuan ， et al . Hierarchical integration diffusion model for realistic image deblurring . Advances in NeurIPS ， 36 ： 29114 – 29125， 2023 .

Zheng Dong ， Ke Xu ， Yin Yang ， Hujun Bao ， Weiwei Xu ， and Rynson WH Lau . Location-aware single image reflection removal . In Proceedings of the IEEE/CVF ICCV ， pages 5017 – 5026 ， 2021 .

Zhengyang Lu ， Weifan Wang ， Tianhao Guo ， and Feng Wang . Single-image reflection removal via self-supervised diffusion models . The Journal of Supercomputing ， 81 （ 1 ）： 338 ， 2025 .

Zhenqi Fu ， Yan Yang ， Xiaotong Tu ， Yue Huang ， Xinghao Ding ， and Kai-Kuang Ma . Learning a simple low-light image enhancer from paired low-light instances . In Proceedings of the IEEE/CVF conference on CVPR ， pages 22252 – 22261 ， 2023 .

Zhishe Wang ， Yanlin Chen ， Wenyu Shao ， Hui Li ， and Lei Zhang . Swinfuse： A residual swin transformer fusion network for infrared and visible images . IEEE Transactions on Instrumentation and Measurement ， 71 ： 1 – 12， 2022 .

Zhou Wang ， Alan C Bovik ， Hamid R Sheikh ， and Eero P Simoncelli . Image quality assessment： from error visibility to structural similarity . IEEE transactions on image processing ， 13 （ 4 ）： 600 – 612， 2004 .

Ziwei Luo ， Fredrik K Gustafsson ， Zheng Zhao ， Jens Sjölund ， and Thomas B Schön . Image restoration with mean-reverting stochastic differential equations . In Proceedings of the 40th International Conference on Machine Learning ， pages 23045 – 23066 ， 2023 .

Zixiang Zhao ， Haowen Bai ， Jiangshe Zhang ， Yulun Zhang ， Kai Zhang ， Shuang Xu ， Dongdong Chen ， Radu Timofte ， and Luc Van Gool . Equivariant multi-modality image fusion . In Proceedings of the IEEE/CVF conference on CVPR ， pages 25912 – 25921 ， 2024 .

Zixiang Zhao ， Haowen Bai ， Jiangshe Zhang ， Yulun Zhang ， Shuang Xu ， Zudi Lin ， Radu Timofte ， and Luc Van Gool . Cddfuse： Correlation-driven dual-branch feature decomposition for multi-modality image fusion . In Proceedings of the IEEE/CVF conference on CVPR ， pages 5906 – 5916 ， 2023 .

Zixiang Zhao ， Haowen Bai ， Yuanzhi Zhu ， Jiangshe Zhang ， Shuang Xu ， Yulun Zhang ， Kai Zhang ， Deyu Meng ， Radu Timofte ， and Luc Van Gool . Ddfm： denoising diffusion model for multi-modality image fusion . In Proceedings of the IEEE/CVF ICCV ， pages 8082 – 8093 ， 2023 .

Zixiang Zhao ， Shuang Xu ， Jiangshe Zhang ， Chengyang Liang ， Chunxia Zhang ， and Junmin Liu . Efficient and model-based infrared and visible image fusion via algorithm unrolling . IEEE Transactions on Circuits and Systems for Video Technology ， 32 （ 3 ）： 1186 – 1196， 2021 .

Ziyi Shen ， Wenguan Wang ， Xiankai Lu ， Jianbing Shen ， Haibin Ling ， Tingfa Xu ， and Ling Shao . Humanaware motion deblurring . In Proceedings of the IEEE/CVF ICCV ， pages 5572 – 5581 ， 2019 .

Alert me when the article has been cited

提交

暂无数据