-
AdaptiveGEMM ⑂
AdaptiveGEMM: FP8 GEMM with Adaptation to Various Lengths of Group M
Cuda ★ 1 3mo agoExplain → -
acp-ui ⑂
A modern, cross-platform client for the Agent Client Protocol (ACP) on desktop, mobile, and the web — connect to any ACP-compatible AI agent (Claude, Codex, Copilot, Qwen, Gemini, OpenCode, OpenClaw and more)
★ 0 1mo agoExplain → -
lmdeploy ⑂
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Python ★ 0 2mo agoExplain → -
kimi-cli ⑂
Kimi Code CLI is your next CLI agent.
Python ★ 0 2mo agoExplain → -
xtuner ⑂
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
Python ★ 0 17d agoExplain → -
deko3d ⑂
Homebrew low level graphics API for Nintendo Switch (Nvidia Tegra X1)
★ 0 4mo agoExplain → -
melonDS ⑂
DS emulator, sorta
★ 0 6mo agoExplain → -
mmengine ⑂
OpenMMLab Foundational Library for Training Deep Learning Models
★ 0 4mo agoExplain → -
DeepEP ⑂
DeepEP: an efficient expert-parallel communication library
Cuda ★ 0 6mo agoExplain → -
nputop ⑂
An interactive Ascend-NPU process viewer
Python ★ 0 7mo agoExplain → -
GroupedGEMM ⑂
PyTorch bindings for CUTLASS and CUBLAS Grouped GEMM.
Cuda ★ 0 5mo agoExplain → -
dlinfer ⑂
No description.
Python ★ 0 7mo agoExplain → -
accelerate ⑂
🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
Python ★ 0 2y agoExplain →
No repos match these filters.