Members
-
gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
Python ★ 6.2k 10mo agoExplain → -
torchtune
PyTorch native post-training library
Python ★ 5.8k 17h agoExplain → -
captum
Model interpretability and understanding for PyTorch
Python ★ 5.7k 3d agoExplain → -
botorch
Bayesian optimization in PyTorch
Jupyter Notebook ★ 3.6k 10h agoExplain → -
torchrec
Pytorch domain library for recommendation systems
Python ★ 2.6k 3h agoExplain → -
opacus
Training PyTorch models with differential privacy
Python ★ 1.9k 11d agoExplain → -
tnt
A lightweight library for PyTorch training tools and utilities
Python ★ 1.7k 13d agoExplain → -
segment-anything-fast
A batched offline inference oriented version of segment-anything
Python ★ 1.3k 10mo agoExplain → -
data
A PyTorch repo for data loading and utilities to be shared by the PyTorch domain libraries.
Python ★ 1.3k 17h agoExplain → -
attention-gym
Helpful tools and examples for working with flex-attention
Python ★ 1.2k 29d agoExplain → -
torchcodec
PyTorch media decoding and encoding
Python ★ 1.1k 17h agoExplain → -
monarch
PyTorch Single Controller
Rust ★ 1.1k 3h agoExplain → -
LeanRL
LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.
Python ★ 694 10mo agoExplain → -
torchforge
PyTorch-native post-training at scale
Python ★ 689 17h agoExplain → -
torchft
Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)
Python ★ 514 7h agoExplain → -
KernelAgent
Autonomous GPU Kernel Generation & Optimization via Deep Agents
Python ★ 456 20d agoExplain → -
torchx
TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and support for E2E production ML pipelines when you're ready.
Python ★ 426 1h agoExplain → -
torchcomms
torchcomms: a modern PyTorch communications API
C++ ★ 373 9m agoExplain → -
tritonbench
Tritonbench is a collection of PyTorch custom operators with example inputs to measure their performance.
Python ★ 359 3h agoExplain → -
applied-ai
Applied AI experiments and examples for PyTorch
Python ★ 323 10mo agoExplain → -
torcheval
A library that contains a rich collection of performant PyTorch model metrics, a simple interface to create new metrics, a toolkit to facilitate metric computation in distributed training and tools for PyTorch model evaluations.
Python ★ 248 1mo agoExplain → -
workshops
This is a repository for all workshop related materials.
Jupyter Notebook ★ 240 2y agoExplain → -
hydra-torch
Configuration classes enabling type-safe PyTorch configuration for Hydra apps
Python ★ 230 3mo agoExplain → -
float8_experimental ▣
This repository contains the experimental PyTorch native float8 training UX
Python ★ 226 1y agoExplain → -
tritonparse
TritonParse: A Compiler Tracer, Visualizer, and Reproducer for Triton Kernels
Python ★ 212 3h agoExplain → -
multipy
torch::deploy (multipy for non-torch uses) is a system that lets you get around the GIL problem by running multiple Python interpreters in a single C++ process.
C++ ★ 179 6mo agoExplain → -
torchsnapshot
A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind.
Python ★ 164 16d agoExplain → -
BackendBench
Ship correct and fast LLM kernels to PyTorch
Python ★ 151 5mo agoExplain → -
torchfix ▣
TorchFix - a linter for PyTorch-using code with autofix support
Python ★ 151 10mo agoExplain → -
csprng
Cryptographically secure pseudorandom number generators for PyTorch
Batchfile ★ 127 3mo agoExplain → -
MSLK
MSLK (Meta Superintelligence Labs Kernels) is a collection of PyTorch GPU operator libraries that are designed and optimized for GenAI training and inference, such as FP8 row-wise quantization and collective communications.
Python ★ 112 5h agoExplain → -
kraken
Triton-based Symmetric Memory operators and examples
Python ★ 103 1mo agoExplain → -
tlparse
TORCH_TRACE parser for PT2
Rust ★ 87 1mo agoExplain → -
autoparallel
An experimental implementation of compiler-driven automatic sharding of models across a given device mesh.
Python ★ 84 11h agoExplain → -
executorch-examples
Example apps and demos using PyTorch's ExecuTorch framework
Kotlin ★ 80 5h agoExplain → -
torchstore
A storage solution for PyTorch tensors with distributed tensor support.
Python ★ 80 3h agoExplain → -
tokenizers
C++ implementations for various tokenizers (sentencepiece, tiktoken etc).
C++ ★ 50 1d agoExplain → -
triton-cpu
An experimental CPU backend for Triton (https//github.com/openai/triton)
C++ ★ 48 10mo agoExplain → -
superblock ▣
A block oriented training approach for inference time optimization.
Python ★ 34 1y agoExplain → -
export-python
Conveniently export torch.compile compiled products into self-contained Python files
Python ★ 33 20d agoExplain → -
spmd_types
This module defines a type system for distributed training code, based off of JAX's sharding in types, but adapted for the PyTorch ecosystem.
Python ★ 32 1d agoExplain → -
FACTO
Framework for Algorithmic Correctness Testing of Operators
Python ★ 16 3mo agoExplain → -
remat
torch_remat fine-grained activation checkpointing API
Python ★ 13 17d agoExplain → -
chakra_replay
Chakra et_replay is a tool designed for replaying Chakra Execution Traces (ET) from machine learning models using PyTorch.
Python ★ 13 16d agoExplain → -
popcorn-kernels
For building the world's largest dataset of GPU kernels.
Python ★ 10 15d agoExplain → -
torchvision-extra-decoders
An extension of `torchvision` for decoding AVIF and HEIC images.
C++ ★ 9 17h agoExplain → -
monarch-kubernetes
Monarch CRD and operator for native kubernetes support
Go ★ 8 1mo agoExplain → -
compile-graph-break-site
This repository contains the source code for a static website that provides documentation for each "Graph Break" identified by a Graph Break ID (GBID).
Python ★ 7 2h agoExplain → -
OpenEnvZoo
A curated collection of OpenEnv environments, versioned against OpenEnv core
★ 4 4mo agoExplain → -
torch-release-notes
Staging ground for pytorch/pytorch maintainers to collaborate on release notes for PyTorch releases. Houses the artifacts that chronicle each release.
Python ★ 3 7h agoExplain → -
ethos-u-core-driver-mirror ▣
This is a mirror of https//git.mlplatform.org/ml/ethos-u/ethos-u-core-driver.git/
C++ ★ 2 1y agoExplain → -
AccelKernels
The accel_kernels repository is under pytorch-labs, which houses experimental GPU and accelerator kernels implemented in Triton or other DSLs, for SoTA algorithms.
Python ★ 1 9mo agoExplain → -
skills
User facing skill definitions for PyTorch users
Python ★ 0 17d agoExplain → -
bsdp
A data parallelism that shards a model block-wise to support optimizers like Shampoo and Muon
Python ★ 0 7mo agoExplain → -
meta-pytorch.github.io
The website for Meta-PyTorch.org
Python ★ 0 3mo agoExplain → -
test-infra
No description.
★ 0 9mo agoExplain → -
tosa_serialization_lib-mirror ▣
This is a mirror of https//git.mlplatform.org/tosa/serialization_lib.git/
C++ ★ 0 1y agoExplain →
No repos match these filters.