TransformerEngine
Python
★ 0
updated 2y ago
⑂ fork
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper GPUs, to provide better performance with lower memory utilization in both training and inference.
No plain-English explanation yet — one is being written right now. Check back in a minute.