Ethan (Yusheng) Su

@yushengsu-thu ·California, USA ·yushengsu-thu.github.io

By the community, for the community.

42 repos
202 followers
58 following

Python 25%
Shell 25%
Jupyter Notebook 17%
Lua 8%
Vim Script 8%

About Me Learning in LLMs and MLsys, recently focused on RL training <!-- My primary work and research focus on RL: Design better architectures and algorithms to conduct efficient/effective RL…

All public repos (42)

Show forks Show archived

sglang-miles-hand-on

No description.

Jupyter Notebook ★ 4 1mo ago
Explain →
torch_memory_saver ⑂

Allow torch tensor memory to be released and resumed later

Python ★ 2 6mo ago
Explain →
PET_Scaling

Exploring the Impact of Model Scaling on Parameter-efficient Tuning Methods

Python ★ 2 1y ago
Explain →
miles ⑂

No description.

Python ★ 1 17h ago
Explain →
lora_perf_lora_profile

LoRA vs no-LoRA perf benchmarks + torch profiles (GB300 dev runs)

Python ★ 1 4d ago
Explain →
tune-lora-perf

No description.

Shell ★ 1 6d ago
Explain →
sglang ⑂

SGLang is a fast serving framework for large language models and vision language models.

Python ★ 1 2d ago
Explain →
lora-dev-script

No description.

Shell ★ 1 27d ago
Explain →
yushengsu-thu.github.io ⑂

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

JavaScript ★ 1 1mo ago
Explain →
miles-dev-script

No description.

Python ★ 1 2mo ago
Explain →
claude-vim-IDE

No description.

Lua ★ 1 2mo ago
Explain →
hack-vimrc

my vim configure

Vim Script ★ 1 2mo ago
Explain →
Megatron-Bridge ⑂

HuggingFace conversion and training library for Megatron-based models

Python ★ 1 2mo ago
Explain →
yushengsu-thu

No description.

★ 1 4mo ago
Explain →
verl ⑂

verl: Volcano Engine Reinforcement Learning for LLMs

Python ★ 1 11mo ago
Explain →
Pai-Megatron-Patch-amd_version ⑂

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Python ★ 1 11mo ago
Explain →
Megatron-LM-amd_version ⑂

Ongoing research training transformer models at scale

Python ★ 1 11mo ago
Explain →
Awesome-ML-SYS-Tutorial ⑂

My learning notes/codes for ML SYS.

Python ★ 1 1y ago
Explain →
FacebookChatBot

No description.

JavaScript ★ 1 3y ago
Explain →
miles_dev

No description.

★ 0 18h ago
Explain →
merge_ci

No description.

Shell ★ 0 25d ago
Explain →
LeetCUDA ⑂

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

★ 0 4mo ago
Explain →
lm-evaluation-harness ⑂

A framework for few-shot evaluation of language models.

★ 0 4mo ago
Explain →
tilelang ⑂

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

★ 0 4mo ago
Explain →
slime ⑂

slime is an LLM post-training framework for RL Scaling.

Python ★ 0 5mo ago
Explain →
nano-sglang ⑂

No description.

★ 0 8mo ago
Explain →
mbridge ⑂

Bridge Megatron-Core to Hugging Face/Reinforcement Learning

Python ★ 0 8mo ago
Explain →
batch_invariant_ops ⑂

No description.

★ 0 9mo ago
Explain →
llm.c ⑂

LLM training in simple, raw C/CUDA

Cuda ★ 0 1y ago
Explain →
Megatron-LLM ⑂

distributed trainer for LLMs

Python ★ 0 1y ago
Explain →
verl_training_log

verl_training_log

★ 0 1y ago
Explain →
datasciencecoursera ⑂

for Data Science class on Coursera

★ 0 6y ago
Explain →
LunarVim ⑂

An IDE layer for Neovim with sane defaults. Completely free and community driven.

★ 0 3y ago
Explain →
lua-nvim-config ⑂

No description.

★ 0 3y ago
Explain →
ProKil ⑂

No description.

★ 0 3y ago
Explain →
nvimdots ⑂

A well configured and structured Neovim.

★ 0 3y ago
Explain →
ModelCenter ⑂

Efficient, Low-Resource, Distributed transformer implementation based on BMTrain

Python ★ 0 4y ago
Explain →
PromptPapers ⑂

Must-read papers on prompt-based tuning for pre-trained language models.

★ 0 4y ago
Explain →
House-Renting ⑂

No description.

HTML ★ 0 8y ago
Explain →
nccucs-se2016-dapp-example ⑂

105學年度國立政治大學資訊科學系軟體工程 DApp 範例

JavaScript ★ 0 9y ago
Explain →
Click_Through_Rate_Prediction

Predict (User pattern): Predict every user's Commercial Click-through-rate with random forest model and kdd-naggle cvs data

Jupyter Notebook ★ 0 9y ago
Explain →
OOP_assignment4_RPG_Game

RPG_Game

C++ ★ 0 8y ago
Explain →

Ethan (Yusheng) Su

About Me

Working on open-source Projects:

Maintain and work on:

Contact:

Github Stats:

All public repos (42)