gitmyhub

fast-molmobot

Python ★ 0 updated 2mo ago

6.5x speedup to MolmoBot inference code (graph capture, torch compile, flash attention, kv caching)

No plain-English explanation yet — one is being written right now. Check back in a minute.