Xiangyue ZHANG

M.Sc. student in Computer Applications, Wuhan University

my_portrait4_1m.jpg

Hi there 👋. Welcome to Xiangyue ZHANG (章湘粤) Page. I’m an incoming Ph.D. student in Mechano-Informatics at The University of Tokyo, supervised by Prof. Tatsuya Harada. I’m also a final-year M.Sc. at HAVPR Lab(Human Activity & Visual Perception Research Lab) at Wuhan University, supervised by Prof. Zhigang Tu. My research interests mainly focus on Embodied AI, 3D&2D motion generation, and World Model. Currently I’m a research intern at Galbot, working on Vision-Language-Action Modeling for Humanoid Robots, supervised by Dr. Jinlu Zhang and Prof. Yi Li.



If you want to get to know me faster, here is my CV !

I’m also looking for a remote or on-site internship/visiting position. If you are interested in my research, please feel free to contact me.

News

Mar 16, 2026
🤖 Joined Galbot as a Research Intern, working on Vision-Language-Action Modeling for Humanoid Robots.
Nov 8, 2025
🎉 GlobalDiff was accepted by AAAI 2026.
Jul 5, 2025
🎉 EchoMask was accepted by ACM MM 2025.
Jun 26, 2025
🎉 SemTalk was accepted by ICCV 2025.
Apr 16, 2025
🎉 One paper has been accepted by IEEE T-CSVT.
Dec 22, 2024
🥂 Our band performed successfully at the Hua Young Music Festival! Cheers! Check More to see pictures.


Selected Publications

  1. GlobalDiff.png
    AAAI 2026
    Mitigating Error Accumulation in Co-Speech Motion Generation via Global Rotation Diffusion and Multi-Level Constraints
    Xiangyue Zhang*, Jianfang Li*†, Jianqiang Ren, and Jiaxu Zhang
    Annual AAAI Conference on Artificial Intelligence (AAAI), 2026
  2. SemTalk.png
    ICCV 2025
    SemTalk: Holistic Co-speech Motion Generation with Frame-level Semantic Emphasis
    Xiangyue Zhang*, Jianfang Li*, Jiaxu Zhang, Ziqiang Dang, Jianqiang Ren, Liefeng Bo, and Zhigang Tu†
    International Conference on Computer Vision (ICCV), 2025
  3. EchoMask.png
    ACM MM 2025
    EchoMask: Speech-Queried Attention-based Mask Modeling for Holistic Co-Speech Motion Generation
    Xiangyue Zhang*, Jianfang Li*, Jiaxu Zhang, Jianqiang Ren, Liefeng Bo, and Zhigang Tu†
    ACM International Conference on Multimedia (ACM MM), 2025
  4. 2D3_SkelAct.png
    T-CSVT 2025
    Robust 2D Skeleton Action Recognition via Decoupling and Distilling 3D Latent Features
    Xiangyue Zhang*, Yifan Jia*, Jiaxu Zhang, Yijie Yang, and Zhigang Tu†
    IEEE Transactions on Circuits and Systems for Video Technology (T-CSVT), 2025


Experience & Education

Galbot
2026.03 - now, Beijing
Research Intern in Vision-Language-Action Modeling for Humanoid Robots.
Advisor: Dr. Jinlu Zhang and Prof. Yi Li
ByteDance
2025.12 - 2026.03, Shenzhen
Research Intern in Intelligent Creation Team.
Advisor: Youjiang Xu
Working on Large Streaming Motion Generation Model.
Alibaba
2024.06 - 2025.11, Hangzhou
Research Intern in Tongyi Lab at Alibaba Group.
Advisor: Dr. Liefeng Bo and Dr. Jianfang Li
Working on Co-speech Motion Generation.
The University of Tokyo
Expected 2026.10 - 2029, Tokyo
Ph.D. Student in Mechano-Informatics, IST.
Advisor: Prof. Tatsuya Harada
Wuhan University
2023.09 - Present, Wuhan
Master Student in LIEMSARS.
Research Advisor: Prof. Zhigang Tu
Central South University
2019.09 - 2023.06, Changsha
I received my B.S Degree of Geomatics in 2023.
Minored in Data Science and Big Data Technology.


Awards & Honors

OCT, 2025
National Scholarship (top 3%)
JUN, 2023
Outstanding Graduate Award