Welcome! My name is Han Fang, I work at TeleAI (China Telecom) as a Senior Research Engineer now in Beijing.
I am now working on video-text retrieval, video understanding, text-to-image generation, and multimodal large language models. If you are seeking any form of academic cooperation, please feel free to email me at fanghan1996@outlook.com. We are hiring interns!
I obtained my Master’s degree in Information and Communication Engineering from BUPT in 2022 and Bachelor’s degree in Telecommunications Engineering with Management from BUPT in 2019.
My research interest includes face recognition, video/image-text understanding, and text-to-image generation. I have published 10+ papers at the top international AI journals and conferences such as TMM, ECCV, AAAI, ACM MM, and ICME.
🔥 News
- 2025.11: 🎉🎉 One paper is accepted by AAAI 2026.
- 2025.09: 🎉🎉 One paper is accepted by ICDAR 2025.
- 2025.04: 🎉🎉 One paper is accepted by IEEE Transactions on Affective Computing 2025.
- 2024.12: 🎉🎉 Two papers are accepted by ICASSP 2025.
- 2024.12: 🎉🎉 One paper is accepted by AAAI 2025.
- 2024.07: 🎉🎉 One paper is accepted by ACM MM 2024.
- 2024.03: 🎉🎉 Two papers are accepted by ICME 2024 (oral presentation).
- 2023.07: 🎉🎉 Two papers are accepted by ACM MM 2023.
- 2022.12: 🎉🎉 Our paper about video-text retrieval (CLIP2Video) is accedpted by TMM 2022.
📝 Publications

Trusted Unified Feature-Neighborhood Dynamics for Multi-View Classification Haojian Huang, Chuanyu Qin, Zhe Liu, Kaijing Ma, Jin Chen, Han Fang, Chao Ban, Hao Sun, Zhongjiang He Project

Transferring Image-CLIP to Video-Text Retrieval via Temporal Relations Han Fang, Pengfei Xiong, Luhui Xu, Wenhan Luo Project

MLFW: A Database for Face Recognition on Masked Faces Chengrui Wang, Han Fang, Yaoyao Zhong, Weihong Deng Dataset
AAAI 2026Adaptive Evidential Learning for Temporal-Semantic Robustness in Moment Retrieval, Haojian Huang, Kaijing Ma, Jin Chen, Haodong Chen, Zhou Wu, Xianghao Zang, Han Fang, et al.ICDAR 2025SelectVision: Adaptive Vision Resolution Selection for Visual Document Understanding, Zhongjiang He, An Zhao, Ye Yuan, Han Fang, et al.TAFFC 2025DDL: Dynamic Direction Learning for Semi-Supervised Facial Expression Recognition, Yaqi Li, Jing Jiang, Yuhang Zhang, Han Fang, et al.ICASSP 2025FASTER: Face Attribute Sliders with Semantic Rewards, Jingyan Chen, Lanxiang Zhou, Han Fang, et al.ICASSP 2025ViCo: A Multitask Video-enhanced and Cognition-preserving Modality Alignment Training Framework, Zhenda Yu, Jin Chen, Jiayu Shen, Lanxiang Zhou, Han Fang, et al.AAAI 2025Trusted Unified Feature-Neighborhood Dynamics for Multi-View Classification, Haojian Huang, Chuanyu Qin, Zhe Liu, Kaijing Ma, Jin Chen, Han Fang, et al.ACM MM 2024GOAL: Grounded text-to-image Synthesis with Joint Layout Alignment Tuning, Yaqi Li, Han Fang, et al.ICME 2024(Oral) ProTA: Probabilistic Token Aggregation for Text-Video Retrieval, Han Fang, et al.ICME 2024(Oral) Disentangle and Denoise: Tackling Context Misalignment for Video Moment Retrieval, Kaijing Ma, Han Fang, et al.ICCVW 2023Alignment and Generation Adapter for Efficient Video-text Understanding, Han Fang, et al.ICCVW 2023LLaViLo: Boosting Video Moment Retrieval via Adapter-Based Multimodal Modeling, Kaijing Ma, Han Fang, et al.ACM MM 2023Mask to Reconstruct: Cooperative Semantics Completion for Video-text Retrieval, Han Fang, et al.ACM MM 2023A Baseline Investigation: Transformer-based Cross-view Baseline for Text-based Person Search, Xianghao Zang, Wei Gao, Ge Li, Han Fang, et al.TMM 2022Transferring image-clip to video-text retrieval via temporal relations, Han Fang, et al.CCBR 2022Mlfw: A database for face recognition on masked faces, ChengRui Wang, Han Fang, et al.FG 2021(Oral) Augmented Face Representation Learning via Transitive Distillation, Han Fang, et al.TMM 2021Dynamic training data dropout for robust deep face recognition, Yaoyao Zhong, Han Fang, et al.ICASSP 2021Adaptive Re-Balancing Network with Gate Mechanism for Long-Tailed Visual Question Answering, Hongyu Chen, Ruifang Liu, Han Fang, et al.ECCV 2020Generate to adapt: Resolution adaption network for surveillance face recognition, Han Fang, et al.CVPRW 2020Triple-GAN: Progressive face aging with triple translation loss, Han Fang, et al.IGTA 2018Semantic Segmentation of Aerial Image Using Fully Convolutional Network, Junli Yang, Yiran Jiang, Han Fang, et al.
🎖 Honors and Awards
- 2019, 2022 Beijing Excellent Graduate Award (Top 1%).
- 2019.05 Beijing Excellent Bachelor Dissertation Award (Top 3%).
- 2016, 2017, 2018, 2019, 2020, 2021 First-Class Scholarship of Beijing University of Posts and Telecommunications.
📖 Educations
- 2019.09 - 2022.06, Master, Beijing University Of Posts And Telecommunications, Beijing.
- 2015.09 - 2019.06, Undergraduate, Beijing University Of Posts And Telecommunications and Queen Mary University of London, Beijing.
💻 Internships
- 2021.03 - 2021.09, PCG, Tencent, Beijing.
- 2020.12 - 2021.02, MIG, SenseTime, Beijing.