Yuchen Zhu

Machine Learning PhD @ Georgia Tech 🍀

prof_pic.jpg

140 Skiles

686 Cherry St NW

Atlanta, GA 30332

Hi, I am Yuchen Zhu, a 3rd year Machine Learning PhD at Georgia Tech.

My interests lie in the broad aspects of GenAI, wich applications in vision, language, and sciences (e.g. single cell genomics, protein sciences, etc). My research currently focuses on diffusion models/flow-based methods (especially discrete diffusion and diffusion LLMs), multimodal foundation models (such as VLMs, Multimodal LLMs), and their RL post-training algorithms.

At Georgia Tech, I am fortunate to be advised by Molei Tao and Yongxin Chen, and working with a group of incredible researchers. Prior to that, I graduated with MA in Statistics from Yale University and BS in Honors Mathematics with highest honor from NYU Shanghai. During those time, I had the privilege to work with Zhuoran Yang and Mathieu Laurière on theory/numerics of RL and mean-field system.

I am actively looking for PhD research internship opportunities for Summer 2026. Feel free to reach out if you think there might be a good fit! You can find my CV here.

Contact: yzhu738 [at] gatech [dot] edu


Updates
[10/2025] DMPO is online! Check out a completely new RL paradigm that uniquely leverages characteristics of diffusion LLMs for effective post-training! Code is available at here.
[10/2025] TR2-D2 is online! Discover the SOTA method for finetuning MDM for bio seqs design with tree search + off-policy RL!
[10/2025] PDNS is online! See our new upgrade of MDNS with additional proximal gradient steps!
[09/2025] MDNS and Discrete Fast Solvers got accepted to NeurIPS 2025, see you in San Diego!
[08/2025] MDNS is online! Come find out our new work on ways to doing RL with masked discrete diffusion!
[06/2025] Mimicking or Reasoning is public! Check out our new work on evaluating MM-ICL for VLM reasoners!
[05/2025] Learning to Stop and Diffuse Everything got accepted to ICML 2025, see you in Vancouver!
[04/2025] I wrote a new blog on how group structures aid generative modeling of manifold data.
[01/2025] TDM and STEM got accepted to ICLR 2025, see you in Singapore!

Talks
[03/2026] INFORMS Optimization Society Conference 2026
[09/2025] GT ML Student Conference
[08/2025] MolSS Reading Group
[11/2024] GT ML Student Seminar
[10/2024] SIAM MDS 2024
[04/2024] Southeast ACM Student Workshop 2024

Selected Publications

(* Equal contribution, Alphabetical order)
  1. dmpo.png
    Enhancing Reasoning for Diffusion LLMs via Distribution Matching Policy Optimization
    Yuchen Zhu*, Wei Guo*, Jaemoo Choi, Petr Molodyk, Bo Yuan, Molei Tao, and Yongxin Chen
    Preprint, 2025
  2. mdns.png
    MDNS: Masked Diffusion Neural Sampler via Stochastic Optimal Control
    Yuchen Zhu*, Wei Guo*, Jaemoo Choi, Guan-Horng Liu, Yongxin Chen, and Molei Tao
    Advances in Neural Information Processing Systems (NeurIPS), 2025
  3. tr2d2.png
    TR2-D2: Tree Search Guided Trajectory-Aware Fine-Tuning for Discrete Diffusion
    Sophia Tang*, Yuchen Zhu*, Molei Tao, and Pranam Chatterjee
    Preprint, 2025
  4. fast-solver.png
    Fast Solvers for Discrete Diffusion Models: Theory and Applications of High-Order Algorithms
    Yinuo Ren*, Haoxuan Chen*, Yuchen Zhu*, Wei Guo*, Yongxin Chen, Grant M Rotskoff, Molei Tao, and Lexing Ying
    Advances in Neural Information Processing Systems (NeurIPS), 2025
  5. diffuse-everything.jpg
    Diffuse Everything: Multimodal Diffusion Models on Arbitrary State Spaces
    Kevin Rojas*, Yuchen Zhu*, Sichen Zhu, Felix Ye, and Molei Tao
    International Conference on Machine Learning (ICML), 2025
  6. diffusion-gene-expression.png
    Diffusion Generative Modeling for Spatially Resolved Gene Expression Inference from Histology Images
    Sichen Zhu*, Yuchen Zhu*, Molei Tao, and Peng Qiu
    International Conference on Learning Representations (ICLR), 2025