Members
-
AdelaiDet
AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.
Python ★ 3.5k 1y agoExplain → -
AdelaiDepth
This repo contains the projects: 'Virtual Normal', 'DiverseDepth', and '3D Scene Shape'. They aim to solve the monocular depth estimation, 3D scene reconstruction from single image problems.
Python ★ 1.1k 2y agoExplain → -
Matcher
[ICLR'24 & IJCV‘25] Matcher: Segment Anything with One Shot Using All-Purpose Feature Matching
Python ★ 565 6mo agoExplain → -
Framer
[ICLR'25] Official PyTorch implementation of "Framer: Interactive Frame Interpolation".
Python ★ 499 1y agoExplain → -
MovieDreamer
[ICLR'25] MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequences
★ 324 1y agoExplain → -
Diception
[NeurIPS 2025 Spotlight] A Generalist Diffusion Model for Vision Perception
Python ★ 316 9mo agoExplain → -
GenPercept
[ICLR2025] GenPercept: Diffusion Models Trained with Large Data Are Transferable Visual Models
Python ★ 228 1y agoExplain → -
StyleDrop-PyTorch
This is an unofficial PyTorch implementation of StyleDrop: Text-to-Image Generation in Any Style.
Python ★ 226 2y agoExplain → -
Poseur
[ECCV 2022] The official repo for the paper "Poseur: Direct Human Pose Regression with Transformers".
Python ★ 186 2y agoExplain → -
FreeCustom
[CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition
Python ★ 178 9mo agoExplain → -
Tinker
One-shot and Few-shot 3D Editing without Per-Scene Optimization
★ 175 10mo agoExplain → -
PM-Loss
[3DV 2026] Revisiting Depth Representations for Feed-Forward 3D Gaussian Splatting
Python ★ 161 6mo agoExplain → -
AutoStory
[IJCV'24] AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort
Jupyter Notebook ★ 148 3mo agoExplain → -
FrozenRecon
[ICCV2023] 🧊FrozenRecon: Pose-free 3D Scene Reconstruction with Frozen Depth Models
Python ★ 131 1y agoExplain → -
DyCo3D
No description.
Python ★ 128 2y agoExplain → -
Omni-R1
[NeurIPS 2025] Official Repo of Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration
Python ★ 123 6mo agoExplain → -
SegPrompt
Official Implementation of ICCV 2023 Paper - SegPrompt: Boosting Open-World Segmentation via Category-level Prompt Learning
Python ★ 112 1y agoExplain → -
SegAgent
[CVPR2025] SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories
Python ★ 105 10mo agoExplain → -
GVM
[SIGGRAPH2025] Generative Video Matting
Python ★ 90 10mo agoExplain → -
OIR
[ICLR 2024] Official PyTorch/Diffusers implementation of "Object-aware Inversion and Reassembly for Image Editing"
Python ★ 87 1y agoExplain → -
Active-o3
[ICML2026] ACTIVE-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO
★ 81 1mo agoExplain → -
RGM
No description.
★ 70 2y agoExplain → -
SINE
[NeurIPS'24] A Simple Image Segmentation Framework via In-Context Examples
Python ★ 67 1y agoExplain → -
dLLM-MidTruth
[ICLR'26] Official PyTorch implementation of "Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models".
Python ★ 66 3mo agoExplain → -
GeoBench
A toolbox for benchmarking SOTA discriminative and generative geometry estimation models.
Python ★ 65 1y agoExplain → -
LoRAPrune
No description.
Python ★ 63 1y agoExplain → -
SurfaceSplat
SurfaceSplat: Connecting Surface Reconstruction and Gaussian Splatting
★ 57 11mo agoExplain → -
DiverGen
DiverGen (CVPR 2024) & BSGAL (ICML 2024)
Python ★ 53 11mo agoExplain → -
DiffewS
[NeurIPS'24] Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation (Diffews)
Python ★ 52 1y agoExplain → -
FreeCompose
No description.
Jupyter Notebook ★ 49 1y agoExplain → -
EvoTokenDLM
[ACL'26] EvoToken-DLM (Beyond Hard Masks: Progressive Token Evolution for Diffusion Language)
Python ★ 48 2mo agoExplain → -
BA-DDG
[ICLR 2025 Spotlight] Boltzmann-Aligned Inverse Folding Model as a Predictor of Mutational Effects on Protein-Protein Interactions
Python ★ 45 1y agoExplain → -
model-quantization ⑂
Collections of model quantization algorithms. Any issues, please contact Peng Chen ([email protected])
★ 45 4y agoExplain → -
AGILE
No description.
★ 43 1mo agoExplain → -
GenDeF
No description.
Python ★ 39 3mo agoExplain → -
StaMo
Unsupervised Learning of Generalizable Robot Motion from Compact State Representation
Python ★ 38 9d agoExplain → -
MARBLE
Multi-Aspect Reward Balance for Diffusion RL
★ 36 1mo agoExplain → -
OmniJigsaw
No description.
HTML ★ 34 2mo agoExplain → -
FADiff
[ICML 2024] Floating Anchor Diffusion Model for Multi-motif Scaffolding
Python ★ 34 1y agoExplain → -
VFN
[ICLR 2024 Spotlight] The official repo for the paper "De novo Protein Design using Geometric Vector Field Networks".
Python ★ 31 1y agoExplain → -
GSI-Bench
[CVPR2026] Exploring Spatial Intelligence from a Generative Perspective
Python ★ 29 16d agoExplain → -
TVRBench
TVRBench: Target Viewpoint Reproduction Benchmark for Active Spatial Intelligence
Python ★ 21 17d agoExplain → -
partially-labelled
Learning to segment multi-organ and tumorsfrom multiple partially labeled datasets
★ 19 5y agoExplain → -
PerturboLLaVA
No description.
Python ★ 17 1y agoExplain → -
COSINE
[ICCV'25] Unified Open-World Segmentation with Multi-Modal Prompts
Python ★ 16 3d agoExplain → -
VLModel
Repo of HawkLlama.
Python ★ 16 1y agoExplain → -
ReasonMatch
[CVPR2026] Eliciting Complex Spatial Reasoning in MLLMs through Wide-Baseline Matching
Python ★ 15 15d agoExplain → -
STAIR
No description.
Python ★ 15 6d agoExplain → -
ConvNova
No description.
Python ★ 13 1y agoExplain → -
aim-uofa.github.io
code for aim-uofa.github.io
JavaScript ★ 10 3d agoExplain → -
CARVE
[CVPR2026] Unlocking the Power of Critical Factors for 3D Visual Geometry Estimation
★ 10 1mo agoExplain → -
Depth3D
No description.
Python ★ 10 1y agoExplain → -
MMControl
No description.
★ 8 1mo agoExplain → -
NRD_decoder ⑂
Dynamic Neural Representational Decoders for High-Resolution Semantic Segmentation
Python ★ 7 4y agoExplain → -
ETC-VideoSeg ⑂
Enforcing temporal consistency in real-time per-frame semantic video segmentation
Python ★ 6 4y agoExplain → -
OIR-Diffusion
No description.
JavaScript ★ 1 1y agoExplain → -
SegVit ⑂
Official Pytorch Implementation of SegViT: Semantic Segmentation with Plain Vision Transformers
Python ★ 0 2y agoExplain → -
ademxapp ⑂
Code for https://arxiv.org/abs/1611.10080
Python ★ 0 2y agoExplain →
No repos match these filters.