gitmyhub

AITemplate

★ 0 updated 2y ago ⑂ fork

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

No plain-English explanation yet — one is being written right now. Check back in a minute.