Members
-
openfang
Open-source Agent Operating System
Rust ★ 18k 1mo agoExplain → -
picolm
Run a 1-billion parameter LLM on a $10 board with 256MB RAM
C ★ 1.7k 3mo agoExplain → -
autokernel
Autoresearch for GPU kernels. Give it any PyTorch model, go to sleep, wake up to optimized Triton kernels.
Python ★ 1.4k 3mo agoExplain → -
qwen3.5-triton
Pure Triton kernels for Qwen3.5-27B inference on NVIDIA B200
Python ★ 115 3mo agoExplain → -
rightnow-cli
Claude Code for CUDA. Free AI assistant that actually understands GPU architecture
Python ★ 110 8mo agoExplain → -
RightNow-GPU-Database
Comprehensive GPU specifications database with 2,824 GPUs across NVIDIA, AMD, and Intel
★ 87 5mo agoExplain → -
AutoMegaKernel
An agent harness that compiles a model into one provably-correct, self-retargeting CUDA megakernel and self-tunes it past cuBLAS at batch-1 LLM decode, paper: https://arxiv.org/abs/2606.09682
Python ★ 67 3d agoExplain → -
tiny-tpu
Minimal TPU implementation with 8x8 systolic array and PyTorch integration
Python ★ 63 4mo agoExplain → -
ouroboros
Dynamic weight generation for recursive transformers via input-conditioned LoRA modulation
Python ★ 33 2mo agoExplain → -
TIDE
Dynamic per-token early exit for LLM inference. Skip layers tokens don't need
Python ★ 32 3mo agoExplain → -
StreamIndex
Memory-bounded compressed sparse attention via streaming top-k. Triton kernels for the DeepSeek-V4 lightning indexer. 32x regime extension on a single H200 | by RightNow https://www.rightnowai.co/
Python ★ 20 1mo agoExplain → -
RightNow-Tile
Open-source transpiler for CUDA Tile (13.1) migration
TypeScript ★ 20 6mo agoExplain → -
gpuci
GPU CI/CD tool that tests CUDA kernels across multiple GPUs in parallel - Part of RightNow
Python ★ 15 4mo agoExplain → -
gpu-profiler
Open-source web-based GPU performance visualization tool that transforms NVIDIA profiling data into interactive insights for CUDA engineers. Features timeline views, flame graphs, heatmaps, and AI-powered bottleneck detection.
TypeScript ★ 13 9mo agoExplain → -
hclsm
Hierarchical Causal Latent State Machines for Object-Centric World Modeling
Python ★ 7 2mo agoExplain → -
rightnow-arabic-llm-corpus
RightNow Arabic LLM Corpus - One of the largest high-quality Arabic text datasets for LLM training
★ 6 10mo agoExplain → -
runinfra-sdk
Official RunInfra SDK (TypeScript + Python) | optimized inference deployments
TypeScript ★ 3 2d agoExplain →
No repos match these filters.