Members
-
segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Jupyter Notebook ★ 54k 1y agoExplain → -
faiss
A library for efficient similarity search and clustering of dense vectors.
C++ ★ 40k 9h agoExplain → -
detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Python ★ 35k 12d agoExplain → -
fairseq ▣
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Python ★ 32k 8mo agoExplain → -
fastText ▣
Library for fast text representation and classification.
HTML ★ 27k 2y agoExplain → -
Detectron ▣
FAIR's research platform for object detection research, implementing popular algorithms like Mask R-CNN and RetinaNet.
Python ★ 26k 2y agoExplain → -
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Jupyter Notebook ★ 23k 3mo agoExplain → -
sam2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Jupyter Notebook ★ 19k 20d agoExplain → -
detr ▣
End-to-End Object Detection with Transformers
Python ★ 15k 2y agoExplain → -
vggt
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
Python ★ 13k 1mo agoExplain → -
dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
Jupyter Notebook ★ 13k 16d agoExplain → -
AnimatedDrawings ▣
Code to accompany "A Method for Animating Children's Drawings of the Human Figure"
Python ★ 13k 9mo agoExplain → -
seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
Jupyter Notebook ★ 12k 2mo agoExplain → -
dinov3
Reference PyTorch implementation and models for DINOv3
Jupyter Notebook ★ 11k 4d agoExplain → -
sam3
The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Python ★ 11k 4d agoExplain → -
xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
Python ★ 11k 1d agoExplain → -
hydra
Hydra is a framework for elegantly configuring complex applications
Python ★ 10k 4m agoExplain → -
demucs ▣
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Python ★ 10k 2y agoExplain → -
nougat
Implementation of Nougat Neural Optical Understanding for Academic Documents
Python ★ 10k 1y agoExplain → -
pytorch3d
PyTorch3D is FAIR's library of reusable components for deep learning with 3D data
Python ★ 9.9k 6d agoExplain → -
pifuhd ▣
High-Resolution 3D Human Digitization from A Single Image.
Python ★ 9.7k 1y agoExplain → -
ImageBind
ImageBind One Embedding Space to Bind Them All
Python ★ 9.0k 6mo agoExplain → -
DiT ▣
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Python ★ 8.6k 2y agoExplain → -
mae ▣
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
Python ★ 8.3k 1y agoExplain → -
dino ▣
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
Python ★ 7.6k 1y agoExplain → -
sam-3d-objects
SAM 3D Objects
Python ★ 7.0k 16d agoExplain → -
metaseq ▣
Repo for external large-scale work
Python ★ 6.5k 2y agoExplain → -
ConvNeXt ▣
Code release for ConvNeXt model
Python ★ 6.4k 3y agoExplain → -
Kats
Kats, a kit to analyze time series data, a lightweight, easy-to-use, generalizable, and extendable framework to perform time series analysis, from understanding the key statistics and characteristics, detecting change points and anomalies, to forecasting future trends.
Python ★ 6.3k 9d agoExplain → -
pytext ▣
A natural language modeling framework based on PyTorch
Python ★ 6.3k 3y agoExplain → -
mmf
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
Python ★ 5.6k 2d agoExplain → -
sapiens
High-resolution models for human tasks.
Python ★ 5.4k 23d agoExplain → -
moco
PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722
★ 5.1k 4mo agoExplain → -
AugLy
A data augmentations library for audio, image, text, and video.
Python ★ 5.1k 17d agoExplain → -
co-tracker
CoTracker is a model for tracking any point (pixel) on a video.
Jupyter Notebook ★ 5.0k 3mo agoExplain → -
flow_matching
A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.
Python ★ 4.5k 5mo agoExplain → -
nevergrad
A Python toolbox for performing gradient-free optimization
Python ★ 4.2k 3mo agoExplain → -
vjepa2
PyTorch code and models for VJEPA2 self-supervised learning from video.
Python ★ 4.2k 2mo agoExplain → -
esm ▣
Evolutionary Scale Modeling (esm): Pretrained language models for proteins
Python ★ 4.1k 2y agoExplain → -
VideoPose3D ▣
Efficient 3D human pose estimation in video using 2D keypoint trajectories
Python ★ 4.0k 3y agoExplain → -
jepa
PyTorch code and models for V-JEPA self-supervised learning from video.
Python ★ 4.0k 1y agoExplain → -
encodec
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
Python ★ 4.0k 2y agoExplain → -
habitat-sim
A flexible, high-performance 3D simulator for Embodied AI research.
C++ ★ 3.7k 1mo agoExplain → -
map-anything
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Python ★ 3.5k 16d agoExplain → -
ijepa ▣
Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive architecture."
Python ★ 3.4k 2y agoExplain → -
Mask2Former ▣
Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"
Python ★ 3.4k 1y agoExplain → -
sam-3d-body
The repository provides code for running inference with the SAM 3D Body Model (3DB), links for downloading the trained model checkpoints and datasets, and example notebooks that show how to use the model.
Python ★ 3.3k 4mo agoExplain → -
MUSE ▣
A library for Multilingual Unsupervised or Supervised word Embeddings
Python ★ 3.2k 3y agoExplain → -
vggt-omega
[CVPR 2026 Oral] VGGT Omega
Python ★ 3.1k 1mo agoExplain → -
habitat-lab
A modular high-level library to train embodied AI agents across a variety of tasks and environments.
Python ★ 3.0k 1mo agoExplain → -
Pearl
A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.
Jupyter Notebook ★ 3.0k 9d agoExplain → -
tribev2
This repository contains the code to train and evaluate TRIBE v2, a multimodal model for brain response prediction
Jupyter Notebook ★ 2.9k 8d agoExplain → -
audio2photoreal ▣
Code and dataset for photorealistic Codec Avatars driven from audio
Python ★ 2.9k 1y agoExplain → -
omnilingual-asr
Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages
Python ★ 2.8k 5mo agoExplain → -
hiplot ▣
HiPlot makes understanding high dimensional data easy
TypeScript ★ 2.8k 2y agoExplain → -
HyperAgents
Self-referential self-improving agents that can optimize for any computable task
Python ★ 2.6k 1mo agoExplain → -
schedule_free
Schedule-Free Optimization in PyTorch
Python ★ 2.3k 1d agoExplain → -
perception_models
State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!
Jupyter Notebook ★ 2.3k 2mo agoExplain → -
fvcore
Collection of common code that's shared among different research projects in FAIR computer vision team.
Python ★ 2.2k 16d agoExplain → -
fairchem
FAIR Chemistry's library of machine learning methods for chemistry
Python ★ 2.2k 17h agoExplain → -
SparseConvNet ▣
Submanifold sparse convolutional networks
C++ ★ 2.1k 2y agoExplain → -
SentEval ▣
A python tool for evaluating the quality of sentence embeddings.
Python ★ 2.1k 2y agoExplain → -
ConvNeXt-V2 ▣
Code release for ConvNeXt V2 model
Python ★ 2.0k 1y agoExplain → -
blt
Code for BLT research paper
Python ★ 2.0k 7mo agoExplain → -
video-nonlocal-net ▣
Non-local Neural Networks for Video Classification
Python ★ 2.0k 4y agoExplain → -
ai4animationpy
A Python framework for AI-driven character animation using neural networks.
Python ★ 2.0k 18d agoExplain → -
TimeSformer ▣
The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"
Python ★ 1.9k 2y agoExplain → -
votenet ▣
Deep Hough Voting for 3D Object Detection in Point Clouds
Python ★ 1.8k 4y agoExplain → -
TransCoder ▣
Public release of the TransCoder research project https://arxiv.org/pdf/2006.03511.pdf
Python ★ 1.7k 4y agoExplain → -
coconut
Training Large Language Model to Reason in a Continuous Latent Space
Python ★ 1.6k 8d agoExplain → -
DomainBed ▣
DomainBed is a suite to test domain generalization algorithms
Python ★ 1.6k 6mo agoExplain → -
DeepSDF ▣
Learning Continuous Signed Distance Functions for Shape Representation
Python ★ 1.6k 4y agoExplain → -
MaskFormer ▣
Per-Pixel Classification is Not All You Need for Semantic Segmentation (NeurIPS 2021, spotlight)
Python ★ 1.5k 4y agoExplain → -
diplomacy_cicero ▣
Code for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.
Python ★ 1.4k 1y agoExplain → -
vggsfm
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
Python ★ 1.4k 1y agoExplain → -
Replica-Dataset
The Replica Dataset v1 as published in https://arxiv.org/abs/1906.05797 .
C++ ★ 1.3k 1y agoExplain → -
ToMe ▣
A method to increase the speed and lower the memory footprint of existing vision transformers.
Python ★ 1.2k 2y agoExplain → -
House3D ▣
a Realistic and Rich 3D Environment
C++ ★ 1.2k 6y agoExplain → -
mixup-cifar10 ▣
mixup: Beyond Empirical Risk Minimization
Python ★ 1.2k 4y agoExplain → -
shumai
Fast Differentiable Tensor Library in JavaScript and TypeScript with Bun + Flashlight
TypeScript ★ 1.2k 1y agoExplain → -
watermark-anything
Official implementation of the paper "Watermark Anything with Localized Messages"
Jupyter Notebook ★ 1.1k 1y agoExplain → -
barlowtwins ▣
PyTorch implementation of Barlow Twins.
Python ★ 1.0k 4y agoExplain → -
EdgeTAM
[CVPR 2025] Official PyTorch implementation of "EdgeTAM: On-Device Track Anything Model"
Jupyter Notebook ★ 934 4mo agoExplain → -
mobile-vision
Mobile vision models and code
Python ★ 920 2d agoExplain → -
sapiens2
1K resolution vision transformers pretrained on 1B human images.
Python ★ 808 26d agoExplain → -
projectaria_tools
projectaria_tools is an C++/Python open-source toolkit to interact with Project Aria data.
C++ ★ 799 7h agoExplain → -
ShapeR
Code for the ShapeR research paper
Python ★ 793 1mo agoExplain → -
ocean
Ocean is the in-house framework for Computer Vision (CV) and Augmented Reality (AR) applications at Meta. It is platform independent and is mainly implemented in C/C++.
C++ ★ 779 1h agoExplain → -
supervision-by-registration ▣
Supervision-by-Registration: An Unsupervised Approach to Improve the Precision of Facial Landmark Detectors
Python ★ 775 6y agoExplain → -
metamotivo
The first behavioral foundation model to control a virtual physics-based humanoid agent for a wide range of whole-body tasks.
Python ★ 770 1y agoExplain → -
ProgramBench
Can Language Models Rebuild Programs From Scratch?
Python ★ 769 22h agoExplain → -
sonata
[CVPR'25 Highlight] Official repository of Sonata: Self-Supervised Learning of Reliable Point Representations
Python ★ 753 1y agoExplain → -
balance
The balance python package offers a simple workflow and methods for dealing with biased data samples when looking to infer from them to some target population of interest.
Python ★ 749 1d agoExplain → -
audioseal
Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector
Python ★ 741 1mo agoExplain → -
audiobox-aesthetics
Unified automatic quality assessment for speech, music, and sound.
Python ★ 733 1y agoExplain → -
MHR
Momentum Human Rig is an anatomically-inspired parametric full-body digital human model developed at Meta. It includes: A parametric body skeletal model; A realistic 3D mesh skinned to the skeleton with levels of detail;A body blendshape and pose corrective model; A facial blendshape model.Its design is friendly for both CG and CV communities.
Python ★ 721 8d agoExplain → -
tuna-2
Official implementation of Tuna-2: Pixel Embeddings Beat Vision Encoders for Unified Understanding and Generation
Python ★ 717 10d agoExplain → -
eb_jepa
An open source library designed to provide community examples of Joint Embedding Predictive Architectures (JEPAs). It contains code and examples for learning representations from images, video, and action-conditioned video, as well as planning using JEPA-based models.
Python ★ 712 8d agoExplain → -
rebel ▣
An algorithm that generalizes the paradigm of self-play reinforcement learning and search to imperfect-information games.
C++ ★ 697 2y agoExplain → -
videoseal
Open and efficient video and image watermarking
Python ★ 682 1mo agoExplain → -
AudioMAE ▣
This repo hosts the code and models of "Masked Autoencoders that Listen".
Python ★ 668 2y agoExplain → -
EUPE
Efficient Universal Perception Encoder: a single on-device vision encoder with versatile representations that match or exceed specialized experts across multiple task domains.
Python ★ 666 2mo agoExplain → -
MCC ▣
Multiview Compressive Coding for 3D Reconstruction
Python ★ 661 3y agoExplain → -
pippo
Pippo: High-Resolution Multi-View Humans from a Single Image
Python ★ 646 8d agoExplain → -
nwm
Official code for the CVPR 2025 paper "Navigation World Models".
Python ★ 638 6mo agoExplain → -
BenchMARL
BenchMARL is a library for benchmarking Multi-Agent Reinforcement Learning (MARL). BenchMARL allows to quickly compare different MARL algorithms, tasks, and models while being systematically grounded in its two core tenets: reproducibility and standardization.
Python ★ 632 4mo agoExplain → -
Ego4d
Ego4d dataset repository. Download the dataset, visualize, extract features & example usage of the dataset
Jupyter Notebook ★ 608 1mo agoExplain → -
MLGym
MLGym A New Framework and Benchmark for Advancing AI Research Agents
Python ★ 606 10mo agoExplain → -
optimizers
For optimization algorithm research and development.
Python ★ 576 1mo agoExplain → -
boxer
Code for the Boxer research paper
Python ★ 574 14d agoExplain → -
vicreg ▣
VICReg official code base
Python ★ 574 2y agoExplain → -
OrienterNet
Source Code for Paper "OrienterNet Visual Localization in 2D Public Maps with Neural Matching"
Python ★ 568 1y agoExplain → -
meta-agents-research-environments
Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike static benchmarks, this platform introduces evolving environments where agents must adapt their strategies as new information becomes available, mirroring real-world challenges.
Python ★ 520 2d agoExplain → -
minihack ▣
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
Python ★ 519 1y agoExplain → -
eai-vc
The repository for the largest and most comprehensive empirical study of visual foundation models for Embodied AI (EAI).
Python ★ 506 2y agoExplain → -
brainmagick ▣
Training and evaluation pipeline for MEG and EEG brain signal encoding and decoding using deep learning. Code for our paper "Decoding speech perception from non-invasive brain recordings" published in Nature Machine Intelligence, 2023.
Python ★ 478 2y agoExplain → -
spider
A general physic-based retargeting framework.
Python ★ 474 6d agoExplain → -
Action100M
A Large-scale Video Action Dataset
Python ★ 474 5mo agoExplain → -
sound-spaces ▣
A first-of-its-kind acoustic simulation platform for audio-visual embodied AI research. It supports training and evaluating multiple tasks and applications.
Python ★ 464 2y agoExplain → -
4DGT
[NeurIPS 2025 (Spotlight)] The implementation for the paper "4DGT Learning a 4D Gaussian Transformer Using Real-World Monocular Videos"
Python ★ 460 9mo agoExplain → -
phyre ▣
PHYRE is a benchmark for physical reasoning.
Python ★ 459 2y agoExplain → -
locate-3d
Open source repo for Locate 3D Model, 3D-JEPA and Locate 3D Dataset
Python ★ 448 1y agoExplain → -
DocAgent
DocAgent is a system designed to generate high-quality, context-aware code documentation for Python codebases using a multi-agent approach and hierarchical processing.
Python ★ 444 1y agoExplain → -
vrs
VRS is a file format optimized to record & playback streams of sensor data, such as images, audio samples, and any other discrete sensors (IMU, temperature, etc), stored in per-device streams of timestamped records.
C++ ★ 428 23h agoExplain → -
ContactPose
Large dataset of hand-object contact, hand- and object-pose, and 2.9 M RGB-D grasp images.
Jupyter Notebook ★ 421 1y agoExplain → -
Cupcake ▣
A Rust library for lattice-based additive homomorphic encryption.
Rust ★ 419 2y agoExplain → -
jepa-wms
Code, data and weights for the paper **What drives success in physical planning with Joint-Embedding Predictive World Models?**
Python ★ 409 2mo agoExplain → -
sscd-copy-detection ▣
Open source implementation of "A Self-Supervised Descriptor for Image Copy Detection" (SSCD).
Python ★ 408 3y agoExplain → -
SpinQuant
Code repo for the paper "SpinQuant LLM quantization with learned rotations"
Python ★ 405 1y agoExplain → -
dietgpu ▣
GPU implementation of a fast generalized ANS (asymmetric numeral system) entropy encoder and decoder, with extensions for lossless compression of numerical and other data types in HPC/ML applications.
Cuda ★ 395 3mo agoExplain → -
seamless_interaction
Foundation Models and Data for Human-Human and Human-AI interactions.
Python ★ 392 6mo agoExplain → -
spdl
Scalable and Performant Data Loading
Python ★ 392 1d agoExplain → -
lagernvs
Official code for "LagerNVS Latent Geometry for Fully Neural Real-time Novel View Synthesis" (CVPR 2026)
Python ★ 388 8d agoExplain → -
actionmesh
🎬ActionMesh: A fast video to animated mesh model with unprecedented quality. Generate animated mesh seamlessly importable into any 3D software in less than a minute.
Python ★ 386 21d agoExplain → -
ViewDiff ▣
ViewDiff generates high-quality, multi-view consistent images of a real-world 3D object in authentic surroundings. (CVPR2024).
Python ★ 383 2mo agoExplain → -
momentum
A library for human kinematic motion and numerical optimization solvers to apply human motion
C++ ★ 382 11h agoExplain → -
searchformer ▣
Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".
Jupyter Notebook ★ 375 2y agoExplain → -
LayerSkip
Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024
Python ★ 372 2mo agoExplain → -
RAM
A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).
Python ★ 369 9d agoExplain → -
goliath ▣
Goliath Dataset and Official PyTorch Implementation of RelightableHands, Relightable Gaussian Codec Avatars, and Driving-Signal Aware Full-Body Avatars.
Python ★ 359 1y agoExplain → -
VLM3
Official implementation of paper "VLM³: Vision Language Models Are Native 3D Learners".
Jupyter Notebook ★ 318 17d agoExplain → -
CRAG
Comprehensive benchmark for RAG
Jupyter Notebook ★ 290 1y agoExplain → -
rlmeta ▣
RLMeta is a light-weight flexible framework for Distributed Reinforcement Learning Research.
Python ★ 285 3y agoExplain → -
matrix
Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data generation.
Python ★ 279 7d agoExplain → -
meshflow
Repository for the CVPR 2026 paper MeshFlow Efficient Artistic Mesh Generation via MeshVAE and Flow-based Diffusion Transformer by Weiyu Li, Antoine Toisoul, Tom Monnier, Roman Shapovalov, Rakesh Ranjan, Ping Tan and Andrea Vedaldi.
Python ★ 278 8d agoExplain → -
digit360
Digit 360 is a modular platform that unlocks new capabilities, and enables future research on the nature of touch.
Python ★ 256 11mo agoExplain → -
neuroai
Python suite for neuroscience research across all modalities.
Python ★ 250 9h agoExplain → -
Mixture-of-Transformers
Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models. TMLR 2025.
Python ★ 248 9mo agoExplain → -
atlas-lean
ATLAS Autoformalized Textbook Library At Scale
Lean ★ 247 17d agoExplain → -
EgoBlur
This repository contains a command-line interface(CLI) that can detect and blur out faces and license plates(PII) from images and videos. The CLI takes an image or video file as input, runs an anonymization algorithm on it, and writes the blurred output to a specified path.
Python ★ 246 8d agoExplain → -
generic-neuromotor-interface
Code for exploring surface electromyography (sEMG) data and training models associated with Reality Labs' paper
Jupyter Notebook ★ 242 10mo agoExplain → -
diffq ▣
DiffQ performs differentiable quantization using pseudo quantization noise. It can automatically tune the number of bits used per weight or group of weights, in order to achieve a given trade-off between model size and accuracy.
Python ★ 239 3y agoExplain → -
VICRegL ▣
VICRegL official code base
Python ★ 236 3y agoExplain → -
gcm
GPU Cluster Monitoring (GCM): Large-Scale AI Research Cluster Monitoring
Python ★ 226 32m agoExplain → -
ava-256
Train universal codec avatars
Jupyter Notebook ★ 220 1y agoExplain → -
nymeria_dataset
Nymeria: a massive collection of multimodal egocentric daily motion in the wild
Python ★ 214 1d agoExplain → -
GloRe ▣
Global Reasoning module for visual recognition
Python ★ 209 4y agoExplain → -
SustainableConcrete
Repository to track versions of concrete strength data, models, and active learning proposals.
Jupyter Notebook ★ 197 1d agoExplain → -
GeoRT
Geometric Retargeting A Principled, Ultrafast Neural Hand Retargeting Algorithm
C ★ 195 9mo agoExplain → -
flowmm ▣
Code for “FlowMM Generating Materials with Riemannian Flow Matching” and "FlowLLM: Flow Matching for Material Generation with Large Language Models as Base Distributions"
Python ★ 185 1y agoExplain → -
FashionPlus ▣
Fashion++: Minimal Edits for Outfit Improvement
Python ★ 174 4y agoExplain → -
mtm
MTM Masked Trajectory Models for Prediction, Representation, and Control.
Python ★ 165 6mo agoExplain → -
MMRB2
Data and sample evaluation codes for Multimodal Rewardbench 2
Python ★ 144 6mo agoExplain → -
assemblyhands-toolkit
AssemblyHands Toolkit is a Python package that provides data loader, visualization, and evaluation tools for the AssemblyHands dataset (CVPR 2023).
Python ★ 132 25d agoExplain → -
ParetoQ
This repository contains the training code of ParetoQ introduced in our work "ParetoQ Scaling Laws in Extremely Low-bit LLM Quantization"
Python ★ 129 8mo agoExplain → -
baspacho
Direct solver for sparse SPD matrices for nonlinear optimization. Implements supernodal Cholesky decomposition algorithm, and supports GPU (CUDA).
C++ ★ 111 8mo agoExplain → -
OpenApps
An open source environment for digital agents.
CSS ★ 103 17h agoExplain → -
gotrack
GoTrack: Generic 6DoF Object Pose Refinement and Tracking, CV4MR 2025
Python ★ 94 8mo agoExplain → -
rela
Reinforcement Learning Assembly
C++ ★ 94 4y agoExplain → -
wasp
Official implementation of the WASP web agent security benchmark
Python ★ 93 2mo agoExplain → -
sira
Superintelligent Retrieval Agent (SIRA)
Rust ★ 92 15d agoExplain → -
iGSM
The code for creating the iGSM datasets in papers "Physics of Language Models Part 2.1, Grade-School Math and the Hidden Reasoning Process" (arxiv 2407.20311) and "Physics of Language Models Part 2.2, How to Learn From Mistakes on Grade-School Math Problems" (arxiv 2408.16293)
Python ★ 86 1y agoExplain → -
MobileLLM-R1
MobileLLM-R1
Python ★ 86 1mo agoExplain → -
dexwm
Official code and data from DexWM ("World Models Can Leverage Human Videos for Dexterous Manipulation").
Python ★ 83 2d agoExplain → -
autoform-bot
Autoform Bot
Python ★ 82 18d agoExplain → -
WMReward
[CVPR 2026 Highlight] Leveraging latent world model's physics understanding to improve the physics plausibility of video generation
Python ★ 67 8d agoExplain → -
MemoryMosaics ▣
Memory Mosaics are networks of associative memories working in concert to achieve a prediction task.
Python ★ 63 1y agoExplain → -
pyvrs
Python interface for https//github.com/facebookresearch/vrs.
C++ ★ 56 23h agoExplain → -
metadepth
Efficient image to 3D geometry foundation models from Meta Reality Labs for monocular depth, point maps, and surface normals. Featuring HyDen (ICLR 2026).
Python ★ 55 1mo agoExplain → -
prompt-siren
A research workbench for developing and testing attacks against large language models, with a focus on prompt injection vulnerabilities and defenses.
Python ★ 52 2d agoExplain → -
td_jepa
A framework for training, evaluating, and comparing state-of-the-art zero-shot reinforcement learning methods. It allows reproducing the experiments of the paper "TD-JEPA Latent-predictive Representations for Zero-Shot Reinforcement Learning"
Python ★ 44 5mo agoExplain → -
diffh2o
We introduce DiffH2O, a diffusion-based framework to synthesize dexterous hand-object interactions. DiffH2O generates realistic hand-object motion from natural language, generalizes to unseen objects at test time and enables fine-grained control over the motion with detailed textual descriptions.
Python ★ 42 7mo agoExplain → -
LAMP
[CVPR 26'] Code for the LAMP research paper.
Python ★ 40 21h agoExplain → -
GAN-optimization-landscape ▣
code to reproduce the empirical results in the research paper
Jupyter Notebook ★ 40 4y agoExplain → -
cca-swebench
swebench repro script for running confucius-code-agent (CCA)
Python ★ 38 28d agoExplain → -
ATEK
Aria Training and Evaluation Kit
Python ★ 34 3mo agoExplain → -
CRV
Code for the paper "Verifying Chain-of-Thought Reasoning via its Computational Graph".
Python ★ 32 6mo agoExplain → -
dance
Dance is an end-to-end framework that detects and classifies events in EEG signals. In a single forward pass, it extracts a set of events directly from the raw, unaligned recording.
Python ★ 30 23d agoExplain → -
SelfCite ▣
Code for the ICML 2025 paper "SelfCite Self-Supervised Alignment for Context Attribution in Large Language Models"
Python ★ 30 3mo agoExplain → -
omnisealbench ▣
This repository provides a comprehensive benchmark for evaluating the performance of neural watermarking techniques. The benchmark includes a variety of datasets, evaluation metrics, and tools for training and testing neural networks for watermarking.
Python ★ 26 5mo agoExplain → -
MetaEmbed
[ICLR 2026 Oral] Official Implementation of the paper "MetaEmbed: Scaling Multimodal Retrieval at Test-Time with Flexible Late Interactions"
Python ★ 18 8d agoExplain → -
DuoMo
Body motion estimation from monocular videos via two stage diffusion (CVPR 2026).
Python ★ 16 8d agoExplain → -
Large-Sparse-Reconstruction-Model
LSRM is a SOTA, feed-forward 3D reconstruction model that generates high-fidelity, relightable 3D digital twins from sparse 2D views.
Python ★ 12 14d agoExplain → -
Group-MATES
Code repository for Group-MATES Group-Level Data Selection for Efficient Pretraining
Python ★ 12 1y agoExplain → -
arithmetic ▣
PyTorch original implementation of "Making Hard Problems Easier with Custom Data Distributions and Loss Regularization A Case Study in Modular Arithmetic" (ICML 2025).
Python ★ 9 8mo agoExplain → -
egobabyvlm
Repository for the EgoBabyVLM Challenge
Python ★ 6 21d agoExplain → -
compute-optimal-tokenization
The repository contains raw data results and code for scaling laws fitting and visualization used in "Compute Optimal Tokenization" paper.
Python ★ 4 25d agoExplain → -
STyMo
The repository provides code for running STyMo Fast and Controllable Few-Shot Motion Style Transfer.
Python ★ 3 1d agoExplain → -
shapcpm
Efficient Calculation of Shapley Values in Critical Path Method (CPM) Networks for Concurrent Delay Analysis
Python ★ 2 22h agoExplain → -
hst-bench
HST-Bench evaluation dataset contains 753 agentic tasks along with the time taken by human annotators to solve each task. This dataset was collected as part of our ICML 2026 paper on Scaling Small Agents Through Strategy Auctions https//arxiv.org/pdf/2602.02751
Python ★ 0 2d agoExplain →
No repos match these filters.