fast-molmobot
Python
★ 0
updated 2mo ago
6.5x speedup to MolmoBot inference code (graph capture, torch compile, flash attention, kv caching)
No plain-English explanation yet — one is being written right now. Check back in a minute.