7-day longest streak
Biography 🔭 I’m currently a Ph.D. student at Tsinghua University, fortunate to work closely with Dr. Chenjia Bai and Prof. Chongjie Zhang. I received my Bachelor’s degree in Automation at…

Biography
🔭 I’m currently a Ph.D. student at Tsinghua University, fortunate to work closely with Dr. Chenjia Bai and Prof. Chongjie Zhang. I received my Bachelor’s degree in Automation at Tsinghua University, advised by Prof. Li Li. See my homepage for full publication list.
Research Interests
✨ I aim to develop a general world model that can empower agents with intelligent, generalizable and interpretable decision-makinig capability. To this end, I mainly focus on:
- Reinforcement Learning and its applications in the real world
- Cooperative Multi-Agent Reinforcement Learning
- Generative Modeling in RL, especially
TransformerandDiffusion Model - Foundation models for reasoning and decision-making (i.e. Embodied AI)
- Building Efficient World Models
😄 I'm open to any kind of collaborations.
😊 Feel free to contact me with email ([email protected]) or wechat (YangZhang9470).
<!--
breez3young/breez3young is a ✨ _special_ ✨ repository because its README.md (this file) appears on your GitHub profile.
Here are some ideas to get you started:
- 🔭 I’m currently working on ...
- 🌱 I’m currently learning ...
- 👯 I’m looking to collaborate on ...
- 🤔 I’m looking for help with ...
- 💬 Ask me about ...
- 📫 How to reach me: ...
- 😄 Pronouns: ...
- ⚡ Fun fact: ...
-
MARIE ★ PINNED
Official Implementation of "Decentralized Transformers with Centralized Aggregation are Sample-Efficient Multi-Agent World Models" (accepted at Transaction on Machine Learning Research, TMLR)
Python ★ 18 6mo agoExplain → -
DIMA ★ PINNED
[NIPS'25] Official Implementation of "Revisiting Multi-Agent World Modeling from a Diffusion-Inspired Perspective" in PyTorch.
Python ★ 15 7mo agoExplain → -
TACO ★ PINNED
Official Implementation of "Steering Vision-Language-Action Models as Anti-Exploration: A Test-Time Scaling Approach"
Python ★ 39 2mo agoExplain → -
mamba-smax
MAMBA with smax evaluation
Python ★ 2 4mo agoExplain → -
breez3young
No description.
★ 1 6mo agoExplain → -
dsrl_pi0_dev ⑂
Official implementation for pi0 steering via DSRL, Steering Your Diffusion Policy with Latent Space Reinforcement Learning (CoRL 2025)
Python ★ 1 7mo agoExplain → -
molmospaces ⑂
An end-to-end open ecosystem for robot learning
Python ★ 0 14d agoExplain → -
research-template ⑂
An ML research template with good documentation by Boyuan Chen, an MIT PhD student
★ 0 1y agoExplain → -
flash-attention ⑂
Fast and memory-efficient exact attention
Python ★ 0 2mo agoExplain → -
claude-code ⑂
An independent Python feature port of Claude Code, entirely rewritting from scratch using oh-my-codex. Educational Purpose only.
Rust ★ 0 2mo agoExplain → -
claude-code-sourcemap ⑂
No description.
★ 0 2mo agoExplain → -
breez3young.github.io
Personal homepage of Yang Zhang
JavaScript ★ 0 6mo agoExplain → -
BEHAVIOR-1K-eval ⑂
BEHAVIOR-1K: a platform for accelerating Embodied AI research. Join our Discord for support: https://discord.gg/bccR5vGFEx
Python ★ 0 7mo agoExplain → -
openpi-reason ⑂
No description.
Python ★ 0 9mo agoExplain → -
RoboTwin ⑂
RoboTwin 2.0 Offical Repo
★ 0 11mo agoExplain → -
diamond ⑂
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
Python ★ 0 1y agoExplain → -
Paper-List
record the waiting/done list of papers to read
★ 0 1y agoExplain → -
daydreamer ⑂
Variant of DayDreamer: World Models for Physical Robot Learning
★ 0 3y agoExplain → -
perceiver-io ⑂
A PyTorch implementation of Perceiver, Perceiver IO and Perceiver AR with PyTorch Lightning scripts for distributed training
★ 0 3y agoExplain → -
iris ⑂
Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.
Python ★ 0 2y agoExplain → -
minerl ⑂
MineRL Competition for Sample Efficient Reinforcement Learning - Python Package
★ 0 2y agoExplain → -
tdmpc ⑂
Code for "Temporal Difference Learning for Model Predictive Control"
★ 0 3y agoExplain → -
transformers ⑂
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
★ 0 3y agoExplain →
No repos match these filters.