6-day longest streak
-
Audio-Reasoner ★ PINNED
The first Large Audio Language Model that enables native in-depth thinking, which is trained on large-scale audio Chain-of-Thought data.
Python ★ 297 1y agoExplain → -
Mega-ASR ★ PINNED
First foundation ASR built for the real world - 7 atomic acoustic conditions, 54 compound scenarios, 2.6M samples, and up to ~30% gains over SOTA where every other model falls apart. **You'll come back to MEGA-ASR, after the rest fail in the wild. ⭐**
Python ★ 978 9d agoExplain → -
Mini-Omni-Reasoner ★ PINNED
Mini-Omni-Reasoner: a real-time speech reasoning framework that interleaves silent reasoning tokens with spoken response tokens (“thinking-in-speaking”), exploiting the LLM–audio throughput gap to keep speech fluent and low-latency while maintaining structured internal reasoning.
★ 165 9mo agoExplain → -
Pask ★ PINNED
Towards Self-Evolving Proactive AI with Perpetual Memory
Python ★ 197 1mo agoExplain → -
Audio-Interaction
No description.
Python ★ 352 8d agoExplain → -
Voices-in-the-Wild-Bench
No description.
Python ★ 24 21d agoExplain → -
xzf-thu.github.io
I'm Xie Zhifei
★ 0 2d agoExplain → -
MMRC
Measuring Massive-Computational Math Reasoning with Code in LLMs
HTML ★ 0 7mo agoExplain →
No repos match these filters.