sagemaker-hyperpod-cluster-setup
Python
★ 10
updated 2d ago
This repository provides setup assets to create Amazon SageMaker HyperPod clusters orchestrated with either Slurm or Amazon EKS. These clusters help you quickly scale model development tasks such as training, fine-tuning, or inference across a cluster of hundreds or thousands of AI accelerators.
No plain-English explanation yet — one is being written right now. Check back in a minute.