gitmyhub

DualPipe

Python ★ 3.0k updated 5mo ago

A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.

No plain-English explanation yet — one is being written right now. Check back in a minute.