-->

Xiaoyan Zhang


I am a senior student in the Artificial Intelligence College at Anhui University, expecting to graduate in 2025. I currently serve as a research assistant at the Anhui Provincial International Joint Research Center for Advanced Technology in Medical Imaging, under the guidance of Prof. Zhe Jin and Dr. Xingbo Dong. In the summer of 2023, I had the opportunity to visit Nanyang Technological University, Singapore, where I completed a summer course on "Machine Learning & Deep Learning Methodologies". I am also honored to be working as a research assistant at the Video and Image Processing Laboratory (VIPER) at Purdue University, starting in 2024, under the supervision of Prof. Fengqing Maggie Zhu.

My research interests include computer vision and its applications to medical imaging, continual learning, pattern recognition, and compression. I am always open to exploring new research areas and welcome potential collaborations. Currently, I am applying for Ph.D. positions for Fall 2025. Please feel free to contact me for any inquiries or collaboration opportunities.

Email: wa2114214 AT stu D0t ahu DOt edu Dot cn

CV Google Scholar

Xiaoyan Zhang Picture

Education

PU
Purdue University 2024-present
Intern in Electrical and Computer Engineering
Advisor: Prof. Fengqing Maggie Zhu
NTU
Nanyang Technological UniversityJuly 2023-August 2023
2023 Summer School Program
Theme: Machine Learning & Deep Learning Methodologies
AHU
Anhui University 2021-2025
Bachelor of Engineering in Artificial Intelligence
GPA: 4.33/5.00, Rank: 1/251
Advisor: Prof. Zhe Jin and Dr. Xingbo Dong

Publications

Project image
Learning Frequency and Structure in UDA for Medical Object Detection
Zhang Xiaoyan*, Liwen Wang*, Guannan He, Ying Tan, Shengli Li, Bin Pu, Zhe Jin, Wen Sha, Xingbo Dong.
Chinese Conference on Pattern Recognition and Computer Vision (PRCV), 2024
Paper /
@InProceedings{10.1007/978-981-97-8496-7_36, author="Wang, Liwen and Zhang, Xiaoyan and He, Guannan and Tan, Ying and Li, Shengli and Pu, Bin and Jin, Zhe and Sha, Wen and Dong, Xingbo",
                    editor="Lin, Zhouchen and Cheng, Ming-Ming and He, Ran and Ubul, Kurban and Silamu, Wushouer and Zha, Hongbin and Zhou, Jie and Liu, Cheng-Lin",
                    title="Learning Frequency and Structure in UDA for Medical Object Detection",
                    booktitle="Pattern Recognition and Computer Vision",
                    year="2025",
                    publisher="Springer Nature Singapore",
                    address="Singapore",
                    pages="518--532",
                    abstract="In medical imaging applications, particularly in cardiac and skeletal analysis, the anatomical structure detection is crucial for diagnosing cardiac disease and other disease. However, the domain gap between images acquired from different sources or modalities poses a significant challenge and impedes model generalization across diverse patient populations and imaging conditions. Bridging this gap is particularly essential in image-based diagnosis, where subtle variations in anatomical structures and imaging characteristics can profoundly impact diagnostic performance. Take fetal cardiac ultrasound images as an example, this paper proposes a novel method for unsupervised domain adaptive fetal cardiac structure detection. The method integrates both the frequency-based distributional properties and anatomical structural information inherent in medical images. Specifically, we introduce a Frequency Distribution Alignment (FDA) module and an Organ Structure Alignment (OSA) module to mitigate detection misalignment across different hospital settings. We demonstrates the effectiveness of these modules through extensive experiments. Our method significantly improves the performance of fetal cardiac structure detection tasks, enabling adaptation to diverse hospital scenarios and showcasing its potential in addressing domain gaps in medical imaging.",
                    isbn="978-981-97-8496-7"
                    }

We proposed a novel method for unsupervised domain adaptive fetal cardiac structure detection to address the domain gap between different sources or modalities in medical imaging.
Project image
Validating Privacy-Preserving Face Recognition under a Minimum Assumption
Zhang Hui, Dong Xingbo, Lai YenLung, Zhou Ying, Zhang Xiaoyan, Lv Xingguo, Jin Zhe, Li Xuejun.
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024
Paper / Code /
@inproceedings{zhang2024validating,
                    title={Validating Privacy-Preserving Face Recognition under a Minimum Assumption},
                    author={Zhang, Hui and Dong, Xingbo and Lai, YenLung and Zhou, Ying and Zhang, Xiaoyan and Lv, Xingguo and Jin, Zhe and Li, Xuejun},
                    booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
                    pages={12205--12214},
                    year={2024},
                    publisher={CVPR}
                  }

We proposed the Map2V, a novel privacy validation method using deep image priors and zeroth-order gradient estimation for privacy-preserving face recognition (PPFR) schemes. Map2V not only exposes the vulnerabilities of SOTA PPFRs but can also be employed to validate the privacy protection capacity of naive FR and PPFR systems.
Project image
Long-Tailed Continual Learning For Visual Food Recognition
Jiangpeng He, Xiaoyan Zhang, Luotao Lin, Jack Ma, Heather A. Eicher-Miller, Fengqing Zhu.
IEEE Transactions on Multimedia (TMM)
Paper /
@misc{he2025longtailedcontinuallearningvisual,
                        title={Long-Tailed Continual Learning For Visual Food Recognition}, 
                        author={Jiangpeng He and Xiaoyan Zhang and Luotao Lin and Jack Ma and Heather A. Eicher-Miller and Fengqing Zhu},
                        year={2025},
                        eprint={2307.00183},
                        archivePrefix={arXiv},
                        primaryClass={cs.CV},
                        url={https://arxiv.org/abs/2307.00183}, 
                  }

We provide the VFN186 dataset, along with a novel framework that enhances generalization on long-tailed food distribution using knowledge distillation and a CAM-CutMix-based augmentation technique.
Project image
MFP3D: Monocular Food Portion Estimation Leveraging 3D Point Clouds (Oral)
Jinge Ma, Xiaoyan Zhang, Gautham Vinod, Siddeshwar Raghavan, Jiangpeng He, Fengqing Zhu.
International Conference on Pattern Recognition (ICPR) MADiMa workshop, 2024
Paper /
@article{ma2024mfp3d,
                      title={MFP3D: Monocular Food Portion Estimation Leveraging 3D Point Clouds},
                      author={Ma, Jinge and Zhang, Xiaoyan and Vinod, Gautham and Raghavan, Siddeshwar and He, Jiangpeng and Zhu, Fengqing},
                      journal={arXiv preprint arXiv:2411.10492},
                      year={2024}
                  }

We propose MFP3D, a new framework for accurate food portion estimation from a single image. It uses 3D reconstruction, feature extraction, and deep learning to estimate food volume and energy content, showing improved accuracy on the MetaFood3D dataset compared to existing approaches.
Project image
Single Source Domain Generalization for Palm Biometrics
Congcong Jia, Xingbo Dong, Yen Lung Lai, Andrew Beng Jin Teoh, Ziyuan Yang, Xiaoyan Zhang, Liwen Wang, Zhe Jin, Lianqiang Yang.
Pattern Recognition
Paper /
@article{JIA2025111620,
                    title = {Single source domain generalization for palm biometrics},
                    journal = {Pattern Recognition},
                    volume = {165},
                    pages = {111620},
                    year = {2025},
                    issn = {0031-3203},
                    doi = {https://doi.org/10.1016/j.patcog.2025.111620},
                    url = {https://www.sciencedirect.com/science/article/pii/S0031320325002808},
                    author = {Congcong Jia and Xingbo Dong and Yen Lung Lai and Andrew Beng Jin Teoh and Ziyuan Yang and Xiaoyan Zhang and Liwen Wang and Zhe Jin and Lianqiang Yang},
                    keywords = {Palmprint recognition, Single source domain generalization, Open-set recognition, Low-level frequencies, Histogram matching},
                   }
                  }

We proposed to improve palmprint recognition by addressing the challenge of single-source domain generalization through data alignment techniques: Fourier Alignment Transform and Histogram Matching.

Teaching Experience

  • 24 Fall: ZJ52014 Introduction to Artificial Intelligence, Undergraduate Teaching Assistant Anhui University
  • 24 Fall: ZX52340 Java Technology and Its Application (Practice), Undergraduate Teaching Assistant Anhui University
  • May 23 -- May 24: Guoyuan Dream Plan, Academic Peer Mentor Anhui University

Awards

  • Outstanding Graduate of Anhui Province, Ministry of Education of Anhui Province 2025
  • National Scholarship, Ministry of Education of the People's Republic of China 2024
  • Pacemaker to Merit Student, Anhui University 2022, 2023, 2024
  • Merit Student, Anhui University 2022, 2023, 2024
  • Song Qingling Future Grant for Discipline Focus Students (0.05%), The China Song Qingling Foundation 2024
  • Excellent Student Scholarship, Anhui University 2023
  • Academic Science and Technology Scholarship, Anhui University 2023
  • National Encouragement Scholarship, Ministry of Education of Anhui Province 2022