Mini-Omni-Reasoner
★ 165
updated 9mo ago
Mini-Omni-Reasoner: a real-time speech reasoning framework that interleaves silent reasoning tokens with spoken response tokens (“thinking-in-speaking”), exploiting the LLM–audio throughput gap to keep speech fluent and low-latency while maintaining structured internal reasoning.
No plain-English explanation yet — one is being written right now. Check back in a minute.