gitmyhub

LPLB

Python ★ 504 updated 7mo ago

An early research stage expert-parallel load balancer for MoE models based on linear programming.

No plain-English explanation yet — one is being written right now. Check back in a minute.