-
CARE ★ PINNED
CARE: Covariance-Aware and Rank-Enhanced Decomposition for Enabling Multi-Head Latent Attention
Python ★ 7 25d agoExplain → -
OSCAR ★ PINNED
OSCAR: Offline Spectral Covariance-Aware Rotation for 2-bit KV Cache Quantization
Python ★ 488 4d agoExplain → -
RUD
Repeat Until Done
Python ★ 4 8d agoExplain → -
FutureMLS-Lab.github.io
No description.
HTML ★ 0 11h agoExplain →
No repos match these filters.