Richard Zhuang

Current: Stanford MSCSđŸŒČ, Prev: UC Berkeley CS + Applied MathđŸ», Research Intern at Bespoke Labs

profile-photo.jpg
profile-photo-2.jpg

(Last Updated: 2026.06)

Welcome to my personal space! I am currently a first-year Master’s student in Computer Science at Stanford University, where I am a core contributor of the OpenThoughts-Agent project with Prof. Ludwig Schmidt, working on data recipe for post-training agents. Before Stanford, I graduated from UC Berkeley double majoring in Applied Math and Computer Science. During my time at Cal, I researched on LLM routing (EmbedLLM) with Jiantao Jiao and Tianhao Wu, as well as LLM + Game (PokerBench) with Akshat Gupta. I have also interned at Bespoke Labs in Spring 2025 where I worked on enhancing tool-use capability of LLM agents through RL (blog).

I’m broadly interested in understanding and improving the capabilities of Large Language Models (LLMs) in a data-centric way. Specifically, I’m intrigued by how certain data “foster” skills that are essential for LLM agents (e.g. reasoning and planning). I have also had a long-standing passion in Sports Analytics.

Outside the realm of AI, you will usually find me playing basketball🏀 or immersing myself in Chinese Hip-hop musicđŸ”„.

News

Jun 25, 2026 Announcing OpenThoughts-Agent and OpenThinkerAgent-32B — the strongest Qwen-3-based open-data agentic model for terminal use and coding, reaching 44.8% average accuracy across seven agentic benchmarks. We openly share the full stack: paper, model, data, and code. Read the X thread for the highlights.

Selected Work

  1. open_thoughts_agent.jpg
    OpenThoughts-Agent: Data Recipes for Agentic Models (500K+ views on X)
    Negin Raoof*, Richard Zhuang*, Marianna Nezhurina*, Etash Guha*, and 46 more authors
    2026
  2. rl_logo.jpg
    Improving Multi-Turn Tool Use with Reinforcement Learning (200K+ Views on X)
    Richard Zhuang*, Trung Vu*, Alex Dimakis, and 1 more author
    2025
  3. embedllm_logo.png
    EmbedLLM: Learning Compact Representations of Large Language Models (ICLR 2025 Spotlight🌟)
    Richard Zhuang, Tianhao Wu, Zhaojin Wen, and 3 more authors
    In The Thirteenth International Conference on Learning Representations (ICLR) , 2025
  4. pokerbench_logo.jpg
    PokerBench: Training Large Language Models to become Professional Poker Players (AAAI 2025)
    Richard Zhuang, Akshat Gupta, Richard Yang, and 3 more authors
    In The 39th Annual AAAI Conference on Artificial Intelligence (AAAI) , 2025