Advanced Intelligent Machines (AIM) ORG

@aim-uofa ·China

A research team at Zhejiang University, focusing on Computer Vision and broad AI research ...

58 repos
303 followers
0 following

Python 88%
JavaScript 5%
Jupyter Notebook 5%
HTML 2%

Members

tianzhi0549
tonghe90
WXinlong
encounter1997
zjuKeLiu
Jxzh2020
Z-MU-Z
MingyuLau
haoz0206

All public repos (58)

Show forks Show archived

AdelaiDet

AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.

Python ★ 3.5k 1y ago
Explain →
AdelaiDepth

This repo contains the projects: 'Virtual Normal', 'DiverseDepth', and '3D Scene Shape'. They aim to solve the monocular depth estimation, 3D scene reconstruction from single image problems.

Python ★ 1.1k 2y ago
Explain →
Matcher

[ICLR'24 & IJCV‘25] Matcher: Segment Anything with One Shot Using All-Purpose Feature Matching

Python ★ 565 6mo ago
Explain →
Framer

[ICLR'25] Official PyTorch implementation of "Framer: Interactive Frame Interpolation".

Python ★ 499 1y ago
Explain →
MovieDreamer

[ICLR'25] MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequences

★ 324 1y ago
Explain →
Diception

[NeurIPS 2025 Spotlight] A Generalist Diffusion Model for Vision Perception

Python ★ 316 9mo ago
Explain →
GenPercept

[ICLR2025] GenPercept: Diffusion Models Trained with Large Data Are Transferable Visual Models

Python ★ 228 1y ago
Explain →
StyleDrop-PyTorch

This is an unofficial PyTorch implementation of StyleDrop: Text-to-Image Generation in Any Style.

Python ★ 226 2y ago
Explain →
Poseur

[ECCV 2022] The official repo for the paper "Poseur: Direct Human Pose Regression with Transformers".

Python ★ 186 2y ago
Explain →
FreeCustom

[CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition

Python ★ 178 9mo ago
Explain →
Tinker

One-shot and Few-shot 3D Editing without Per-Scene Optimization

★ 175 10mo ago
Explain →
PM-Loss

[3DV 2026] Revisiting Depth Representations for Feed-Forward 3D Gaussian Splatting

Python ★ 161 6mo ago
Explain →
AutoStory

[IJCV'24] AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort

Jupyter Notebook ★ 148 3mo ago
Explain →
FrozenRecon

[ICCV2023] 🧊FrozenRecon: Pose-free 3D Scene Reconstruction with Frozen Depth Models

Python ★ 131 1y ago
Explain →
DyCo3D

No description.

Python ★ 128 2y ago
Explain →
Omni-R1

[NeurIPS 2025] Official Repo of Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration

Python ★ 123 6mo ago
Explain →
SegPrompt

Official Implementation of ICCV 2023 Paper - SegPrompt: Boosting Open-World Segmentation via Category-level Prompt Learning

Python ★ 112 1y ago
Explain →
SegAgent

[CVPR2025] SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories

Python ★ 105 10mo ago
Explain →
GVM

[SIGGRAPH2025] Generative Video Matting

Python ★ 90 10mo ago
Explain →
OIR

[ICLR 2024] Official PyTorch/Diffusers implementation of "Object-aware Inversion and Reassembly for Image Editing"

Python ★ 87 1y ago
Explain →
Active-o3

[ICML2026] ACTIVE-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO

★ 81 1mo ago
Explain →
RGM

No description.

★ 70 2y ago
Explain →
SINE

[NeurIPS'24] A Simple Image Segmentation Framework via In-Context Examples

Python ★ 67 1y ago
Explain →
dLLM-MidTruth

[ICLR'26] Official PyTorch implementation of "Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models".

Python ★ 66 3mo ago
Explain →
GeoBench

A toolbox for benchmarking SOTA discriminative and generative geometry estimation models.

Python ★ 65 1y ago
Explain →
LoRAPrune

No description.

Python ★ 63 1y ago
Explain →
SurfaceSplat

SurfaceSplat: Connecting Surface Reconstruction and Gaussian Splatting

★ 57 11mo ago
Explain →
DiverGen

DiverGen (CVPR 2024) & BSGAL (ICML 2024)

Python ★ 53 11mo ago
Explain →
DiffewS

[NeurIPS'24] Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation (Diffews)

Python ★ 52 1y ago
Explain →
FreeCompose

No description.

Jupyter Notebook ★ 49 1y ago
Explain →
EvoTokenDLM

[ACL'26] EvoToken-DLM (Beyond Hard Masks: Progressive Token Evolution for Diffusion Language)

Python ★ 48 2mo ago
Explain →
BA-DDG

[ICLR 2025 Spotlight] Boltzmann-Aligned Inverse Folding Model as a Predictor of Mutational Effects on Protein-Protein Interactions

Python ★ 45 1y ago
Explain →
model-quantization ⑂

Collections of model quantization algorithms. Any issues, please contact Peng Chen ([email protected])

★ 45 4y ago
Explain →
AGILE

No description.

★ 43 1mo ago
Explain →
GenDeF

No description.

Python ★ 39 3mo ago
Explain →
StaMo

Unsupervised Learning of Generalizable Robot Motion from Compact State Representation

Python ★ 38 9d ago
Explain →
MARBLE

Multi-Aspect Reward Balance for Diffusion RL

★ 36 1mo ago
Explain →
OmniJigsaw

No description.

HTML ★ 34 2mo ago
Explain →
FADiff

[ICML 2024] Floating Anchor Diffusion Model for Multi-motif Scaffolding

Python ★ 34 1y ago
Explain →
VFN

[ICLR 2024 Spotlight] The official repo for the paper "De novo Protein Design using Geometric Vector Field Networks".

Python ★ 31 1y ago
Explain →
GSI-Bench

[CVPR2026] Exploring Spatial Intelligence from a Generative Perspective

Python ★ 29 16d ago
Explain →
TVRBench

TVRBench: Target Viewpoint Reproduction Benchmark for Active Spatial Intelligence

Python ★ 21 17d ago
Explain →
partially-labelled

Learning to segment multi-organ and tumorsfrom multiple partially labeled datasets

★ 19 5y ago
Explain →
PerturboLLaVA

No description.

Python ★ 17 1y ago
Explain →
COSINE

[ICCV'25] Unified Open-World Segmentation with Multi-Modal Prompts

Python ★ 16 3d ago
Explain →
VLModel

Repo of HawkLlama.

Python ★ 16 1y ago
Explain →
ReasonMatch

[CVPR2026] Eliciting Complex Spatial Reasoning in MLLMs through Wide-Baseline Matching

Python ★ 15 15d ago
Explain →
STAIR

No description.

Python ★ 15 6d ago
Explain →
ConvNova

No description.

Python ★ 13 1y ago
Explain →
aim-uofa.github.io

code for aim-uofa.github.io

JavaScript ★ 10 3d ago
Explain →
CARVE

[CVPR2026] Unlocking the Power of Critical Factors for 3D Visual Geometry Estimation

★ 10 1mo ago
Explain →
Depth3D

No description.

Python ★ 10 1y ago
Explain →
MMControl

No description.

★ 8 1mo ago
Explain →
NRD_decoder ⑂

Dynamic Neural Representational Decoders for High-Resolution Semantic Segmentation

Python ★ 7 4y ago
Explain →
ETC-VideoSeg ⑂

Enforcing temporal consistency in real-time per-frame semantic video segmentation

Python ★ 6 4y ago
Explain →
OIR-Diffusion

No description.

JavaScript ★ 1 1y ago
Explain →
SegVit ⑂

Official Pytorch Implementation of SegViT: Semantic Segmentation with Plain Vision Transformers

Python ★ 0 2y ago
Explain →
ademxapp ⑂

Code for https://arxiv.org/abs/1611.10080

Python ★ 0 2y ago
Explain →