gitmyhub

cutlass

C++ ★ 9.9k updated 16h ago

CUDA Templates and Python DSLs for High-Performance Linear Algebra

No plain-English explanation yet — one is being written right now. Check back in a minute.