gitmyhub

Mini-Omni-Reasoner

★ 165 updated 9mo ago

Mini-Omni-Reasoner: a real-time speech reasoning framework that interleaves silent reasoning tokens with spoken response tokens (“thinking-in-speaking”), exploiting the LLM–audio throughput gap to keep speech fluent and low-latency while maintaining structured internal reasoning.

No plain-English explanation yet — one is being written right now. Check back in a minute.