Meta Research — gitmyhub

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook ★ 54k 1y ago

Explain →

faiss

A library for efficient similarity search and clustering of dense vectors.

C++ ★ 40k 9h ago

Explain →

detectron2

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

Python ★ 35k 12d ago

Explain →

fairseq ▣

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python ★ 32k 8mo ago

Explain →

fastText ▣

Library for fast text representation and classification.

HTML ★ 27k 2y ago

Explain →

Detectron ▣

FAIR's research platform for object detection research, implementing popular algorithms like Mask R-CNN and RetinaNet.

Python ★ 26k 2y ago

Explain →

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Jupyter Notebook ★ 23k 3mo ago

Explain →

sam2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook ★ 19k 20d ago

Explain →

detr ▣

End-to-End Object Detection with Transformers

Python ★ 15k 2y ago

Explain →

vggt

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python ★ 13k 1mo ago

Explain →

dinov2

PyTorch code and models for the DINOv2 self-supervised learning method.

Jupyter Notebook ★ 13k 16d ago

Explain →

AnimatedDrawings ▣

Code to accompany "A Method for Animating Children's Drawings of the Human Figure"

Python ★ 13k 9mo ago

Explain →

seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook ★ 12k 2mo ago

Explain →

dinov3

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook ★ 11k 4d ago

Explain →

sam3

The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Python ★ 11k 4d ago

Explain →

xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python ★ 11k 1d ago

Explain →

hydra

Hydra is a framework for elegantly configuring complex applications

Python ★ 10k 4m ago

Explain →

demucs ▣

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Python ★ 10k 2y ago

Explain →

nougat

Implementation of Nougat Neural Optical Understanding for Academic Documents

Python ★ 10k 1y ago

Explain →

pytorch3d

PyTorch3D is FAIR's library of reusable components for deep learning with 3D data

Python ★ 9.9k 6d ago

Explain →

pifuhd ▣

High-Resolution 3D Human Digitization from A Single Image.

Python ★ 9.7k 1y ago

Explain →

ImageBind

ImageBind One Embedding Space to Bind Them All

Python ★ 9.0k 6mo ago

Explain →

DiT ▣

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python ★ 8.6k 2y ago

Explain →

mae ▣

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

Python ★ 8.3k 1y ago

Explain →

dino ▣

PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO

Python ★ 7.6k 1y ago

Explain →

sam-3d-objects

SAM 3D Objects

Python ★ 7.0k 16d ago

Explain →

metaseq ▣

Repo for external large-scale work

Python ★ 6.5k 2y ago

Explain →

ConvNeXt ▣

Code release for ConvNeXt model

Python ★ 6.4k 3y ago

Explain →

Kats

Kats, a kit to analyze time series data, a lightweight, easy-to-use, generalizable, and extendable framework to perform time series analysis, from understanding the key statistics and characteristics, detecting change points and anomalies, to forecasting future trends.

Python ★ 6.3k 9d ago

Explain →

pytext ▣

A natural language modeling framework based on PyTorch

Python ★ 6.3k 3y ago

Explain →

mmf

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

Python ★ 5.6k 2d ago

Explain →

sapiens

High-resolution models for human tasks.

Python ★ 5.4k 23d ago

Explain →

moco

PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722

★ 5.1k 4mo ago

Explain →

AugLy

A data augmentations library for audio, image, text, and video.

Python ★ 5.1k 17d ago

Explain →

co-tracker

CoTracker is a model for tracking any point (pixel) on a video.

Jupyter Notebook ★ 5.0k 3mo ago

Explain →

flow_matching

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python ★ 4.5k 5mo ago

Explain →

nevergrad

A Python toolbox for performing gradient-free optimization

Python ★ 4.2k 3mo ago

Explain →

vjepa2

PyTorch code and models for VJEPA2 self-supervised learning from video.

Python ★ 4.2k 2mo ago

Explain →

esm ▣

Evolutionary Scale Modeling (esm): Pretrained language models for proteins

Python ★ 4.1k 2y ago

Explain →

VideoPose3D ▣

Efficient 3D human pose estimation in video using 2D keypoint trajectories

Python ★ 4.0k 3y ago

Explain →

jepa

PyTorch code and models for V-JEPA self-supervised learning from video.

Python ★ 4.0k 1y ago

Explain →

encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python ★ 4.0k 2y ago

Explain →

habitat-sim

A flexible, high-performance 3D simulator for Embodied AI research.

C++ ★ 3.7k 1mo ago

Explain →

map-anything

MapAnything: Universal Feed-Forward Metric 3D Reconstruction

Python ★ 3.5k 16d ago

Explain →

ijepa ▣

Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive architecture."

Python ★ 3.4k 2y ago

Explain →

Mask2Former ▣

Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"

Python ★ 3.4k 1y ago

Explain →

sam-3d-body

The repository provides code for running inference with the SAM 3D Body Model (3DB), links for downloading the trained model checkpoints and datasets, and example notebooks that show how to use the model.

Python ★ 3.3k 4mo ago

Explain →

MUSE ▣

A library for Multilingual Unsupervised or Supervised word Embeddings

Python ★ 3.2k 3y ago

Explain →

vggt-omega

[CVPR 2026 Oral] VGGT Omega

Python ★ 3.1k 1mo ago

Explain →

habitat-lab

A modular high-level library to train embodied AI agents across a variety of tasks and environments.

Python ★ 3.0k 1mo ago

Explain →

Pearl

A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.

Jupyter Notebook ★ 3.0k 9d ago

Explain →

tribev2

This repository contains the code to train and evaluate TRIBE v2, a multimodal model for brain response prediction

Jupyter Notebook ★ 2.9k 8d ago

Explain →

audio2photoreal ▣

Code and dataset for photorealistic Codec Avatars driven from audio

Python ★ 2.9k 1y ago

Explain →

omnilingual-asr

Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages

Python ★ 2.8k 5mo ago

Explain →

hiplot ▣

HiPlot makes understanding high dimensional data easy

TypeScript ★ 2.8k 2y ago

Explain →

HyperAgents

Self-referential self-improving agents that can optimize for any computable task

Python ★ 2.6k 1mo ago

Explain →

schedule_free

Schedule-Free Optimization in PyTorch

Python ★ 2.3k 1d ago

Explain →

perception_models

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook ★ 2.3k 2mo ago

Explain →

fvcore

Collection of common code that's shared among different research projects in FAIR computer vision team.

Python ★ 2.2k 16d ago

Explain →

fairchem

FAIR Chemistry's library of machine learning methods for chemistry

Python ★ 2.2k 17h ago

Explain →

SparseConvNet ▣

Submanifold sparse convolutional networks

C++ ★ 2.1k 2y ago

Explain →

SentEval ▣

A python tool for evaluating the quality of sentence embeddings.

Python ★ 2.1k 2y ago

Explain →

ConvNeXt-V2 ▣

Code release for ConvNeXt V2 model

Python ★ 2.0k 1y ago

Explain →

blt

Code for BLT research paper

Python ★ 2.0k 7mo ago

Explain →

video-nonlocal-net ▣

Non-local Neural Networks for Video Classification

Python ★ 2.0k 4y ago

Explain →

ai4animationpy

A Python framework for AI-driven character animation using neural networks.

Python ★ 2.0k 18d ago

Explain →

TimeSformer ▣

The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"

Python ★ 1.9k 2y ago

Explain →

votenet ▣

Deep Hough Voting for 3D Object Detection in Point Clouds

Python ★ 1.8k 4y ago

Explain →

TransCoder ▣

Public release of the TransCoder research project https://arxiv.org/pdf/2006.03511.pdf

Python ★ 1.7k 4y ago

Explain →

coconut

Training Large Language Model to Reason in a Continuous Latent Space

Python ★ 1.6k 8d ago

Explain →

DomainBed ▣

DomainBed is a suite to test domain generalization algorithms

Python ★ 1.6k 6mo ago

Explain →

DeepSDF ▣

Learning Continuous Signed Distance Functions for Shape Representation

Python ★ 1.6k 4y ago

Explain →

MaskFormer ▣

Per-Pixel Classification is Not All You Need for Semantic Segmentation (NeurIPS 2021, spotlight)

Python ★ 1.5k 4y ago

Explain →

diplomacy_cicero ▣

Code for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.

Python ★ 1.4k 1y ago

Explain →

vggsfm

VGGSfM: Visual Geometry Grounded Deep Structure From Motion

Python ★ 1.4k 1y ago

Explain →

Replica-Dataset

The Replica Dataset v1 as published in https://arxiv.org/abs/1906.05797 .

C++ ★ 1.3k 1y ago

Explain →

ToMe ▣

A method to increase the speed and lower the memory footprint of existing vision transformers.

Python ★ 1.2k 2y ago

Explain →

House3D ▣

a Realistic and Rich 3D Environment

C++ ★ 1.2k 6y ago

Explain →

mixup-cifar10 ▣

mixup: Beyond Empirical Risk Minimization

Python ★ 1.2k 4y ago

Explain →

shumai

Fast Differentiable Tensor Library in JavaScript and TypeScript with Bun + Flashlight

TypeScript ★ 1.2k 1y ago

Explain →

watermark-anything

Official implementation of the paper "Watermark Anything with Localized Messages"

Jupyter Notebook ★ 1.1k 1y ago

Explain →

barlowtwins ▣

PyTorch implementation of Barlow Twins.

Python ★ 1.0k 4y ago

Explain →

EdgeTAM

[CVPR 2025] Official PyTorch implementation of "EdgeTAM: On-Device Track Anything Model"

Jupyter Notebook ★ 934 4mo ago

Explain →

mobile-vision

Mobile vision models and code

Python ★ 920 2d ago

Explain →

sapiens2

1K resolution vision transformers pretrained on 1B human images.

Python ★ 808 26d ago

Explain →

projectaria_tools

projectaria_tools is an C++/Python open-source toolkit to interact with Project Aria data.

C++ ★ 799 7h ago

Explain →

ShapeR

Code for the ShapeR research paper

Python ★ 793 1mo ago

Explain →

ocean

Ocean is the in-house framework for Computer Vision (CV) and Augmented Reality (AR) applications at Meta. It is platform independent and is mainly implemented in C/C++.

C++ ★ 779 1h ago

Explain →

supervision-by-registration ▣

Supervision-by-Registration: An Unsupervised Approach to Improve the Precision of Facial Landmark Detectors

Python ★ 775 6y ago

Explain →

metamotivo

The first behavioral foundation model to control a virtual physics-based humanoid agent for a wide range of whole-body tasks.

Python ★ 770 1y ago

Explain →

ProgramBench

Can Language Models Rebuild Programs From Scratch?

Python ★ 769 22h ago

Explain →

sonata

[CVPR'25 Highlight] Official repository of Sonata: Self-Supervised Learning of Reliable Point Representations

Python ★ 753 1y ago

Explain →

balance

The balance python package offers a simple workflow and methods for dealing with biased data samples when looking to infer from them to some target population of interest.

Python ★ 749 1d ago

Explain →

audioseal

Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector

Python ★ 741 1mo ago

Explain →

audiobox-aesthetics

Unified automatic quality assessment for speech, music, and sound.

Python ★ 733 1y ago

Explain →

MHR

Momentum Human Rig is an anatomically-inspired parametric full-body digital human model developed at Meta. It includes: A parametric body skeletal model; A realistic 3D mesh skinned to the skeleton with levels of detail;A body blendshape and pose corrective model; A facial blendshape model.Its design is friendly for both CG and CV communities.

Python ★ 721 8d ago

Explain →

tuna-2

Official implementation of Tuna-2: Pixel Embeddings Beat Vision Encoders for Unified Understanding and Generation

Python ★ 717 10d ago

Explain →

eb_jepa

An open source library designed to provide community examples of Joint Embedding Predictive Architectures (JEPAs). It contains code and examples for learning representations from images, video, and action-conditioned video, as well as planning using JEPA-based models.

Python ★ 712 8d ago

Explain →

rebel ▣

An algorithm that generalizes the paradigm of self-play reinforcement learning and search to imperfect-information games.

C++ ★ 697 2y ago

Explain →

videoseal

Open and efficient video and image watermarking

Python ★ 682 1mo ago

Explain →

AudioMAE ▣

This repo hosts the code and models of "Masked Autoencoders that Listen".

Python ★ 668 2y ago

Explain →

EUPE

Efficient Universal Perception Encoder: a single on-device vision encoder with versatile representations that match or exceed specialized experts across multiple task domains.

Python ★ 666 2mo ago

Explain →

MCC ▣

Multiview Compressive Coding for 3D Reconstruction

Python ★ 661 3y ago

Explain →

pippo

Pippo: High-Resolution Multi-View Humans from a Single Image

Python ★ 646 8d ago

Explain →

nwm

Official code for the CVPR 2025 paper "Navigation World Models".

Python ★ 638 6mo ago

Explain →

BenchMARL

BenchMARL is a library for benchmarking Multi-Agent Reinforcement Learning (MARL). BenchMARL allows to quickly compare different MARL algorithms, tasks, and models while being systematically grounded in its two core tenets: reproducibility and standardization.

Python ★ 632 4mo ago

Explain →

Ego4d

Ego4d dataset repository. Download the dataset, visualize, extract features & example usage of the dataset

Jupyter Notebook ★ 608 1mo ago

Explain →

MLGym

MLGym A New Framework and Benchmark for Advancing AI Research Agents

Python ★ 606 10mo ago

Explain →

optimizers

For optimization algorithm research and development.

Python ★ 576 1mo ago

Explain →

boxer

Code for the Boxer research paper

Python ★ 574 14d ago

Explain →

vicreg ▣

VICReg official code base

Python ★ 574 2y ago

Explain →

OrienterNet

Source Code for Paper "OrienterNet Visual Localization in 2D Public Maps with Neural Matching"

Python ★ 568 1y ago

Explain →

meta-agents-research-environments

Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike static benchmarks, this platform introduces evolving environments where agents must adapt their strategies as new information becomes available, mirroring real-world challenges.

Python ★ 520 2d ago

Explain →

minihack ▣

MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research

Python ★ 519 1y ago

Explain →

eai-vc

The repository for the largest and most comprehensive empirical study of visual foundation models for Embodied AI (EAI).

Python ★ 506 2y ago

Explain →

brainmagick ▣

Training and evaluation pipeline for MEG and EEG brain signal encoding and decoding using deep learning. Code for our paper "Decoding speech perception from non-invasive brain recordings" published in Nature Machine Intelligence, 2023.

Python ★ 478 2y ago

Explain →

spider

A general physic-based retargeting framework.

Python ★ 474 6d ago

Explain →

Action100M

A Large-scale Video Action Dataset

Python ★ 474 5mo ago

Explain →

sound-spaces ▣

A first-of-its-kind acoustic simulation platform for audio-visual embodied AI research. It supports training and evaluating multiple tasks and applications.

Python ★ 464 2y ago

Explain →

4DGT

[NeurIPS 2025 (Spotlight)] The implementation for the paper "4DGT Learning a 4D Gaussian Transformer Using Real-World Monocular Videos"

Python ★ 460 9mo ago

Explain →

phyre ▣

PHYRE is a benchmark for physical reasoning.

Python ★ 459 2y ago

Explain →

locate-3d

Open source repo for Locate 3D Model, 3D-JEPA and Locate 3D Dataset

Python ★ 448 1y ago

Explain →

DocAgent

DocAgent is a system designed to generate high-quality, context-aware code documentation for Python codebases using a multi-agent approach and hierarchical processing.

Python ★ 444 1y ago

Explain →

vrs

VRS is a file format optimized to record & playback streams of sensor data, such as images, audio samples, and any other discrete sensors (IMU, temperature, etc), stored in per-device streams of timestamped records.

C++ ★ 428 23h ago

Explain →

ContactPose

Large dataset of hand-object contact, hand- and object-pose, and 2.9 M RGB-D grasp images.

Jupyter Notebook ★ 421 1y ago

Explain →

Cupcake ▣

A Rust library for lattice-based additive homomorphic encryption.

Rust ★ 419 2y ago

Explain →

jepa-wms

Code, data and weights for the paper **What drives success in physical planning with Joint-Embedding Predictive World Models?**

Python ★ 409 2mo ago

Explain →

sscd-copy-detection ▣

Open source implementation of "A Self-Supervised Descriptor for Image Copy Detection" (SSCD).

Python ★ 408 3y ago

Explain →

SpinQuant

Code repo for the paper "SpinQuant LLM quantization with learned rotations"

Python ★ 405 1y ago

Explain →

dietgpu ▣

GPU implementation of a fast generalized ANS (asymmetric numeral system) entropy encoder and decoder, with extensions for lossless compression of numerical and other data types in HPC/ML applications.

Cuda ★ 395 3mo ago

Explain →

seamless_interaction

Foundation Models and Data for Human-Human and Human-AI interactions.

Python ★ 392 6mo ago

Explain →

spdl

Scalable and Performant Data Loading

Python ★ 392 1d ago

Explain →

lagernvs

Official code for "LagerNVS Latent Geometry for Fully Neural Real-time Novel View Synthesis" (CVPR 2026)

Python ★ 388 8d ago

Explain →

actionmesh

🎬ActionMesh: A fast video to animated mesh model with unprecedented quality. Generate animated mesh seamlessly importable into any 3D software in less than a minute.

Python ★ 386 21d ago

Explain →

ViewDiff ▣

ViewDiff generates high-quality, multi-view consistent images of a real-world 3D object in authentic surroundings. (CVPR2024).

Python ★ 383 2mo ago

Explain →

momentum

A library for human kinematic motion and numerical optimization solvers to apply human motion

C++ ★ 382 11h ago

Explain →

searchformer ▣

Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".

Jupyter Notebook ★ 375 2y ago

Explain →

LayerSkip

Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024

Python ★ 372 2mo ago

Explain →

RAM

A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).

Python ★ 369 9d ago

Explain →

goliath ▣

Goliath Dataset and Official PyTorch Implementation of RelightableHands, Relightable Gaussian Codec Avatars, and Driving-Signal Aware Full-Body Avatars.

Python ★ 359 1y ago

Explain →

VLM3

Official implementation of paper "VLM³: Vision Language Models Are Native 3D Learners".

Jupyter Notebook ★ 318 17d ago

Explain →

CRAG

Comprehensive benchmark for RAG

Jupyter Notebook ★ 290 1y ago

Explain →

rlmeta ▣

RLMeta is a light-weight flexible framework for Distributed Reinforcement Learning Research.

Python ★ 285 3y ago

Explain →

matrix

Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data generation.

Python ★ 279 7d ago

Explain →

meshflow

Repository for the CVPR 2026 paper MeshFlow Efficient Artistic Mesh Generation via MeshVAE and Flow-based Diffusion Transformer by Weiyu Li, Antoine Toisoul, Tom Monnier, Roman Shapovalov, Rakesh Ranjan, Ping Tan and Andrea Vedaldi.

Python ★ 278 8d ago

Explain →

digit360

Digit 360 is a modular platform that unlocks new capabilities, and enables future research on the nature of touch.

Python ★ 256 11mo ago

Explain →

neuroai

Python suite for neuroscience research across all modalities.

Python ★ 250 9h ago

Explain →

Mixture-of-Transformers

Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models. TMLR 2025.

Python ★ 248 9mo ago

Explain →

atlas-lean

ATLAS Autoformalized Textbook Library At Scale

Lean ★ 247 17d ago

Explain →

EgoBlur

This repository contains a command-line interface(CLI) that can detect and blur out faces and license plates(PII) from images and videos. The CLI takes an image or video file as input, runs an anonymization algorithm on it, and writes the blurred output to a specified path.

Python ★ 246 8d ago

Explain →

generic-neuromotor-interface

Code for exploring surface electromyography (sEMG) data and training models associated with Reality Labs' paper

Jupyter Notebook ★ 242 10mo ago

Explain →

diffq ▣

DiffQ performs differentiable quantization using pseudo quantization noise. It can automatically tune the number of bits used per weight or group of weights, in order to achieve a given trade-off between model size and accuracy.

Python ★ 239 3y ago

Explain →

VICRegL ▣

VICRegL official code base

Python ★ 236 3y ago

Explain →

gcm

GPU Cluster Monitoring (GCM): Large-Scale AI Research Cluster Monitoring

Python ★ 226 32m ago

Explain →

ava-256

Train universal codec avatars

Jupyter Notebook ★ 220 1y ago

Explain →

nymeria_dataset

Nymeria: a massive collection of multimodal egocentric daily motion in the wild

Python ★ 214 1d ago

Explain →

GloRe ▣

Global Reasoning module for visual recognition

Python ★ 209 4y ago

Explain →

SustainableConcrete

Repository to track versions of concrete strength data, models, and active learning proposals.

Jupyter Notebook ★ 197 1d ago

Explain →

GeoRT

Geometric Retargeting A Principled, Ultrafast Neural Hand Retargeting Algorithm

C ★ 195 9mo ago

Explain →

flowmm ▣

Code for “FlowMM Generating Materials with Riemannian Flow Matching” and "FlowLLM: Flow Matching for Material Generation with Large Language Models as Base Distributions"

Python ★ 185 1y ago

Explain →

FashionPlus ▣

Fashion++: Minimal Edits for Outfit Improvement

Python ★ 174 4y ago

Explain →

mtm

MTM Masked Trajectory Models for Prediction, Representation, and Control.

Python ★ 165 6mo ago

Explain →

MMRB2

Data and sample evaluation codes for Multimodal Rewardbench 2

Python ★ 144 6mo ago

Explain →

assemblyhands-toolkit

AssemblyHands Toolkit is a Python package that provides data loader, visualization, and evaluation tools for the AssemblyHands dataset (CVPR 2023).

Python ★ 132 25d ago

Explain →

ParetoQ

This repository contains the training code of ParetoQ introduced in our work "ParetoQ Scaling Laws in Extremely Low-bit LLM Quantization"

Python ★ 129 8mo ago

Explain →

baspacho

Direct solver for sparse SPD matrices for nonlinear optimization. Implements supernodal Cholesky decomposition algorithm, and supports GPU (CUDA).

C++ ★ 111 8mo ago

Explain →

OpenApps

An open source environment for digital agents.

CSS ★ 103 17h ago

Explain →

gotrack

GoTrack: Generic 6DoF Object Pose Refinement and Tracking, CV4MR 2025

Python ★ 94 8mo ago

Explain →

rela

Reinforcement Learning Assembly

C++ ★ 94 4y ago

Explain →

wasp

Official implementation of the WASP web agent security benchmark

Python ★ 93 2mo ago

Explain →

sira

Superintelligent Retrieval Agent (SIRA)

Rust ★ 92 15d ago

Explain →

iGSM

The code for creating the iGSM datasets in papers "Physics of Language Models Part 2.1, Grade-School Math and the Hidden Reasoning Process" (arxiv 2407.20311) and "Physics of Language Models Part 2.2, How to Learn From Mistakes on Grade-School Math Problems" (arxiv 2408.16293)

Official code and data from DexWM ("World Models Can Leverage Human Videos for Dexterous Manipulation").

Autoform Bot

[CVPR 2026 Highlight] Leveraging latent world model's physics understanding to improve the physics plausibility of video generation

Python ★ 67 8d ago

Explain →

MemoryMosaics ▣

Memory Mosaics are networks of associative memories working in concert to achieve a prediction task.

Python ★ 63 1y ago

Explain →

pyvrs

Python interface for https//github.com/facebookresearch/vrs.

C++ ★ 56 23h ago

Explain →

metadepth

Efficient image to 3D geometry foundation models from Meta Reality Labs for monocular depth, point maps, and surface normals. Featuring HyDen (ICLR 2026).

Python ★ 55 1mo ago

Explain →

prompt-siren

A research workbench for developing and testing attacks against large language models, with a focus on prompt injection vulnerabilities and defenses.

Python ★ 52 2d ago

Explain →

td_jepa

A framework for training, evaluating, and comparing state-of-the-art zero-shot reinforcement learning methods. It allows reproducing the experiments of the paper "TD-JEPA Latent-predictive Representations for Zero-Shot Reinforcement Learning"

Python ★ 44 5mo ago

Explain →

diffh2o

We introduce DiffH2O, a diffusion-based framework to synthesize dexterous hand-object interactions. DiffH2O generates realistic hand-object motion from natural language, generalizes to unseen objects at test time and enables fine-grained control over the motion with detailed textual descriptions.

Python ★ 42 7mo ago

Explain →

LAMP

[CVPR 26'] Code for the LAMP research paper.

Python ★ 40 21h ago

Explain →

GAN-optimization-landscape ▣

code to reproduce the empirical results in the research paper

Jupyter Notebook ★ 40 4y ago

Explain →

cca-swebench

swebench repro script for running confucius-code-agent (CCA)

Python ★ 38 28d ago

Explain →

ATEK

Aria Training and Evaluation Kit

Python ★ 34 3mo ago

Explain →

CRV

Code for the paper "Verifying Chain-of-Thought Reasoning via its Computational Graph".

Python ★ 32 6mo ago

Explain →

dance

Dance is an end-to-end framework that detects and classifies events in EEG signals. In a single forward pass, it extracts a set of events directly from the raw, unaligned recording.

Python ★ 30 23d ago

Explain →

SelfCite ▣

Code for the ICML 2025 paper "SelfCite Self-Supervised Alignment for Context Attribution in Large Language Models"

Python ★ 30 3mo ago

Explain →

omnisealbench ▣

This repository provides a comprehensive benchmark for evaluating the performance of neural watermarking techniques. The benchmark includes a variety of datasets, evaluation metrics, and tools for training and testing neural networks for watermarking.

Python ★ 26 5mo ago

Explain →

MetaEmbed

[ICLR 2026 Oral] Official Implementation of the paper "MetaEmbed: Scaling Multimodal Retrieval at Test-Time with Flexible Late Interactions"

Python ★ 18 8d ago

Explain →

DuoMo

Body motion estimation from monocular videos via two stage diffusion (CVPR 2026).

Python ★ 16 8d ago

Explain →

Large-Sparse-Reconstruction-Model

LSRM is a SOTA, feed-forward 3D reconstruction model that generates high-fidelity, relightable 3D digital twins from sparse 2D views.

Python ★ 12 14d ago

Explain →

Group-MATES

Code repository for Group-MATES Group-Level Data Selection for Efficient Pretraining

Python ★ 12 1y ago

Explain →

arithmetic ▣

PyTorch original implementation of "Making Hard Problems Easier with Custom Data Distributions and Loss Regularization A Case Study in Modular Arithmetic" (ICML 2025).

Python ★ 9 8mo ago

Explain →

egobabyvlm

Repository for the EgoBabyVLM Challenge

Python ★ 6 21d ago

Explain →

compute-optimal-tokenization

The repository contains raw data results and code for scaling laws fitting and visualization used in "Compute Optimal Tokenization" paper.

Python ★ 4 25d ago

Explain →

STyMo

The repository provides code for running STyMo Fast and Controllable Few-Shot Motion Style Transfer.

Python ★ 3 1d ago

Explain →

shapcpm

Efficient Calculation of Shapley Values in Critical Path Method (CPM) Networks for Concurrent Delay Analysis

Python ★ 2 22h ago

Explain →

hst-bench

HST-Bench evaluation dataset contains 753 agentic tasks along with the time taken by human annotators to solve each task. This dataset was collected as part of our ICML 2026 paper on Scaling Small Agents Through Strategy Auctions https//arxiv.org/pdf/2602.02751

Python ★ 0 2d ago

Explain →

Meta Research ORG

Members

All public repos (200)