Publications

Check the latest through Google Scholar.

View All publications

Open-Source Systems

  1. auto-deep-researcher-banner.png
    Technical Report
    Deep Researcher Agent: An Autonomous Framework for 24/7 Deep Learning Experimentation with Zero-Cost Monitoring
    Xiangyue Zhang
    arXiv preprint arXiv:2604.05854, 2026

2026

  1. MACE-Dance.png
    SIGGRAPH 2026
    MACE-Dance: Motion-Appearance Cascaded Experts for Music-Driven Dance Video Generation
    Kaixing Yang, Jiashu Zhu, Xulong Tang, Ziqiao Peng, Xiangyue Zhang, Puwei Wang, Jiahong Wu, Xiangxiang Chu, Hongyan Liu, and Jun He
    ACM Transactions on Graphics (SIGGRAPH), 2026
  2. PersonaGesture.png
    arXiv 2026
    PersonaGesture: Single-Reference Co-Speech Gesture Personalization for Unseen Speakers
    Xiangyue Zhang, Yiyi Cai, Kunhang Li, Kaixing Yang, You Zhou, Zhengqing Li, Xuangeng Chu, Jiaxu Zhang, and Haiyang Liu
    arXiv, 2026
  3. DynMask.jpg
    arXiv 2026
    Not All Frames Are Equal: Complexity-Aware Masked Motion Generation via Motion Spectral Descriptors
    Pengfei Zhou*, Xiangyue Zhang*, Xukun Shen, and Yong Hu†
    arXiv preprint arXiv:2603.29655, 2026
  4. SemTalk.png
    AAAI 2026
    Mitigating Error Accumulation in Co-Speech Motion Generation via Global Rotation Diffusion and Multi-Level Constraints
    Xiangyue Zhang*, Jianfang Li*†, Jianqiang Ren, and Jiaxu Zhang
    Annual AAAI Conference on Artificial Intelligence (AAAI), 2026
  5. FlowerDance.gif
    arXiv 2025
    FlowerDance: MeanFlow for Efficient and Refined 3D Dance Generation
    Kaixing Yang*, Xulong Tang*, Ziqiao Peng*, Xiangyue Zhang, Puwei Wang, Jun He, and Hongyan Liu
    arXiv preprint arXiv:2511.21029, 2025

2025

  1. SemTalk.png
    ICCV 2025
    SemTalk: Holistic Co-speech Motion Generation with Frame-level Semantic Emphasis
    Xiangyue Zhang*, Jianfang Li*, Jiaxu Zhang, Ziqiang Dang, Jianqiang Ren, Liefeng Bo, and Zhigang Tu†
    International Conference on Computer Vision (ICCV), 2025
  2. EchoMask.png
    ACM MM 2025
    EchoMask: Speech-Queried Attention-based Mask Modeling for Holistic Co-Speech Motion Generation
    Xiangyue Zhang*, Jianfang Li*, Jiaxu Zhang, Jianqiang Ren, Liefeng Bo, and Zhigang Tu†
    ACM International Conference on Multimedia (ACM MM), 2025
  3. 2D3_SkelAct.png
    T-CSVT 2025
    Robust 2D Skeleton Action Recognition via Decoupling and Distilling 3D Latent Features
    Xiangyue Zhang*, Yifan Jia*, Jiaxu Zhang, Yijie Yang, and Zhigang Tu†
    IEEE Transactions on Circuits and Systems for Video Technology (T-CSVT), 2025