I am a doctoral student majoring in Software Engineering at University College London, co-supervised by Dr. He Ye and Prof. Federica Sarro. Iβm currently engaged in research related to deep learning and software engineering. My research interests mainly focus on fields such as computer vision, natural language processing, and the application of AI Agents.
In the past few years, I have focused on developing new deep learning algorithms to solve practical problems, particularly making progress in multi-modal generation. I believe that artificial intelligence technology can bring positive changes to society and am committed to combining theoretical research with practical applications.
My research interest includes multimodal learning, cross-domain transfer and AI for SSE.
π Publications

MultiModal Large Language Model with RAG Strategies in Soccer Commentary Generation
Xiang Li, Shuaishuai Zu, Kevin Zhang, et al.

SCBench: A Sports Commentary Benchmark for Video LLMs
Kuangzhi Ge, Lingjun Chen, Kevin Zhang, Yulin Luo, Tianyu Shi, Liaoyuan Fan, Xiang Li, Guanqun Wang, Shanghang Zhang

MCRE: Multimodal Conditional Representation and Editing for Text Motion Generation
Tengjiao Sun, Xiang Li, Tianyu Shi, et al.

Uniform Text-Motion Generation and Editing via Diffusion Model.
Ruoyu Wang, Xiang Li(Co-first author), Tengjiao Sun, et al.

Haiyu Zhou, Xiang Li(Co-first author), Yufeng Jiang, et al.
π Educations and Work Experience
- 2025.10 - present, University College London, Software Engineering, PhD Student
- 2024.08 - 2025.06, Southern University of Science and Technology, Statistics, Research Assistant.
- 2022.07 - 2024.06, Li Auto Inc., Algorithm Engineer.
- 2019.09 - 2022.06, Guangxi University, Computer Science and Technology, Masterβs Degree.
- 2014.09 - 2018.06, Zhengzhou University, Biology Engineering, Bachelorβs Degree.