Xuhang Chen

xuhangc at hzu.edu.cn

Huizhou, Guangdong, China

I am currently a lecturer at the School of Computer Science and Engineering at Huizhou University. I received the B.Sc. degree in electronic information science and technology from the Sun Yat-Sen University, Guangzhou, China, in 2016 and B.Eng. degree in electronic engineering from the Chinese University of Hong Kong, Hong Kong, China, in 2016, and the M.Eng. degree in electrical engineering and the M.Sc. degree in computer and information technology from the University of Pennsylvania, Philadelphia, USA, in 2019. I am currently a Ph.D. candidate in computer science from the IPPRLab, University of Macau, Macao and the Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, co-supervised by Prof. Chi-Man Pun and Prof. Shuqiang Wang.

Research Interests

Computational Imaging
Biomedical Signal and Image Processing
Segmentation
Recognition, Categorization, Detection
Multimodal Learning

selected publications

AAAI
Devignet: High-Resolution Vignetting Removal via a Dual Aggregated Fusion Transformer with Adaptive Channel Expansion

Shenghong Luo^* , Xuhang Chen^*, Weiwen Chen, Zinuo Li, Shuqiang Wang^†, and Chi-Man Pun^†

In Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2024

Abs Bib HTML Code Dataset Slides Website

Vignetting commonly occurs as a degradation in images resulting from factors such as lens design, improper lens hood usage, and limitations in camera sensors. This degradation affects image details, color accuracy, and presents challenges in computational photography. Existing vignetting removal algorithms predominantly rely on ideal physics assumptions and hand-crafted parameters, resulting in the ineffective removal of irregular vignetting and suboptimal results. Moreover, the substantial lack of real-world vignetting datasets hinders the objective and comprehensive evaluation of vignetting removal. To address these challenges, we present VigSet, a pioneering dataset for vignetting removal. VigSet includes 983 pairs of both vignetting and vignetting-free high-resolution (over 4k) real-world images under various conditions. In addition, We introduce DeVigNet, a novel frequency-aware Transformer architecture designed for vignetting removal. Through the Laplacian Pyramid decomposition, we propose the Dual Aggregated Fusion Transformer to handle global features and remove vignetting in the low-frequency domain. Additionally, we propose the Adaptive Channel Expansion Module to enhance details in the high-frequency domain. The experiments demonstrate that the proposed model outperforms existing state-of-the-art methods. The code, models, and dataset are available at https://github.com/CXH-Research/DeVigNet.
@inproceedings{Luo:2024, title = {Devignet: High-Resolution Vignetting Removal via a Dual Aggregated Fusion Transformer with Adaptive Channel Expansion}, booktitle = {Proceedings of the AAAI Conference on Artificial Intelligence (AAAI)}, author = {Luo, Shenghong and Chen, Xuhang and Chen, Weiwen and Li, Zinuo and Wang, Shuqiang and Pun, Chi-Man}, year = {2024}, publisher = {Association for the Advancement of Artificial Intelligence}, address = {Vancouver, Canada}, pages = {4000-4008}, doi = {10.1609/AAAI.V38I5.28193}, }
MM
Dual-Hybrid Attention Network for Specular Highlight Removal

Xiaojiao Guo^* , Xuhang Chen^* , Shenghong Luo, Shuqiang Wang^†, and Chi-Man Pun^†

In Proceedings of the ACM International Conference on Multimedia (MM), 2024

Abs Bib HTML Code Dataset Poster

Specular highlights are a common issue in images captured under direct light sources. They are caused by the reflection of light sources on the surface of objects, which can lead to overexposure and loss of detail. Existing methods for specular highlight removal often rely on hand-crafted features and heuristics, which limits their effectiveness. In this paper, we propose a dual-hybrid attention network for specular highlight removal. The network consists of two branches: a spatial attention branch and a channel attention branch. The spatial attention branch focuses on the spatial distribution of specular highlights, while the channel attention branch emphasizes the importance of different channels. The two branches are combined to form a dual-hybrid attention network, which effectively removes specular highlights while preserving image details. Experimental results show that the proposed network outperforms state-of-the-art methods in terms of both visual quality and quantitative metrics.
@inproceedings{Guo:2024, title = {Dual-Hybrid Attention Network for Specular Highlight Removal}, author = {Guo, Xiaojiao and Chen, Xuhang and Luo, Shenghong and Wang, Shuqiang and Pun, Chi-Man}, booktitle = {Proceedings of the ACM International Conference on Multimedia (MM)}, year = {2024}, address = {Melbourne, VIC, Australia}, publisher = {ACM}, pages = {10173--10181}, doi = {10.1145/3664647.3680745}, }
WACV
High-Fidelity Document Stain Removal via A Large-Scale Real-World Dataset and A Memory-Augmented Transformer

Mingxian Li^*, Hao Sun^*, Yingtie Lei^*, Xiaofeng Zhang, Yihang Dong, Yilin Zhou , Zimeng Li , and Xuhang Chen^†

In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2025

Abs Bib HTML Video Code Dataset Poster

Document images are often degraded by various stains, significantly impacting their readability and hindering downstream applications such as document digitization and analysis. The absence of a comprehensive stained document dataset has limited the effectiveness of existing document enhancement methods in removing stains while preserving fine-grained details. To address this challenge, we construct StainDoc, the first large-scale, high-resolution (2145\times2245) dataset specifically designed for document stain removal. StainDoc comprises over 5,000 pairs of stained and clean document images across multiple scenes. This dataset encompasses a diverse range of stain types, severities, and document backgrounds, facilitating robust training and evaluation of document stain removal algorithms. Furthermore, we propose StainRestorer, a Transformer-based document stain removal approach. StainRestorer employs a memory-augmented Transformer architecture that captures hierarchical stain representations at part, instance, and semantic levels via the DocMemory module. The Stain Removal Transformer (SRTransformer) leverages these feature representations through a dual attention mechanism: an enhanced spatial attention with an expanded receptive field, and a channel attention captures channel-wise feature importance. This combination enables precise stain removal while preserving document content integrity. Extensive experiments demonstrate StainRestorer’s superior performance over state-of-the-art methods on the StainDoc dataset and its variants StainDoc_Mark and StainDoc_Seal, establishing a new benchmark for document stain removal. Our work highlights the potential of memory-augmented Transformers for this task and contributes a valuable dataset to advance future research.
@inproceedings{Li:2025, title = {High-Fidelity Document Stain Removal via A Large-Scale Real-World Dataset and A Memory-Augmented Transformer}, author = {Li, Mingxian and Sun, Hao and Lei, Yingtie and Zhang, Xiaofeng and Dong, Yihang and Zhou, Yilin and Li, Zimeng and Chen, Xuhang}, booktitle = {Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)}, year = {2025}, pages = {7614-7624}, publisher = {IEEE}, address = {Tucson, AZ, USA}, doi = {10.1109/WACV61041.2025.00740}, }
TCSVT
Underwater Image Restoration Through a Prior Guided Hybrid Sense Approach and Extensive Benchmark Analysis

Xiaojiao Guo^* , Xuhang Chen^*, Shuqiang Wang^†, and Chi-Man Pun^†

IEEE Transactions on Circuits and Systems for Video Technology, 2025

Abs Bib HTML Code Dataset

Underwater imaging grapples with challenges from light-water interactions, leading to color distortions and reduced clarity. In response to these challenges, we propose a novel Color Balance Prior Guided Hybrid Sense Underwater Image Restoration framework (GuidedHybSensUIR). This framework operates on multiple scales, employing the proposed Detail Restorer module to restore low-level detailed features at finer scales and utilizing the proposed Feature Contextualizer module to capture long-range contextual relations of high-level general features at a broader scale. The hybridization of these different scales of sensing results effectively addresses color casts and restores blurry details. In order to effectively point out the evolutionary direction for the model, we propose a novel Color Balance Prior as a strong guide in the feature contextualization step and as a weak guide in the final decoding phase. We construct a comprehensive benchmark using paired training data from three real-world underwater datasets and evaluate on six test sets, including three paired and three unpaired, sourced from four real-world underwater datasets. Subsequently, we tested 14 traditional and retrained 23 deep learning existing underwater image restoration methods on this benchmark, obtaining metric results for each approach. This effort aims to furnish a valuable benchmarking dataset for standard basis for comparison. The extensive experiment results demonstrate that our method outperforms 37 other state-of-the-art methods overall on various benchmark datasets and metrics, despite not achieving the best results in certain individual cases. The code and dataset are available at https://github.com/CXH-Research/GuidedHybSensUIR.
@article{Guo:2026, author = {Guo, Xiaojiao and Chen, Xuhang and Wang, Shuqiang and Pun, Chi-Man}, journal = {IEEE Transactions on Circuits and Systems for Video Technology}, title = {Underwater Image Restoration Through a Prior Guided Hybrid Sense Approach and Extensive Benchmark Analysis}, year = {2025}, volume = {35}, number = {5}, pages = {4784-4800}, doi = {10.1109/TCSVT.2025.3525593}, publisher = {IEEE}, }