-
Recap-DataComp-1B ★ PINNED
[ICML 2025] This is the official repository of our paper "What If We Recaption Billions of Web Images with LLaMA-3 ?"
★ 151 2y agoExplain → -
MedTrinity-25M ★ PINNED
[ICLR 2025] This is the official repository of our paper "MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine“
Python ★ 410 11mo agoExplain → -
story-iter ★ PINNED
[ICLR 2026] A Training-free Iterative Framework for Long Story Visualization
Python ★ 958 2mo agoExplain → -
MedReason ★ PINNED
MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs
Python ★ 277 1y agoExplain → -
VLAA-Thinking ★ PINNED
[TMLR 25] SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models
Python ★ 149 8mo agoExplain → -
OpenVision ★ PINNED
OpenVision (ICCV 2025), OpenVision 2 (CVPR 2026), and OpenVision 3
Python ★ 484 3mo agoExplain → -
CLIPA
[NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"
Python ★ 321 2y agoExplain → -
RobustCNN
[ICLR 2023] This repository includes the official implementation our paper "Can CNNs Be More Robust Than Transformers?"
Python ★ 144 3y agoExplain → -
SwinMM
[MICCAI 2023] This repository includes the official implementation our paper "SwinMM: Masked Multi-view with Swin Transformers for 3D Medical Image Segmentation"
Python ★ 123 2y agoExplain → -
HQ-Edit
[ICLR 2025] HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing
Python ★ 113 2y agoExplain → -
DMAE
[CVPR 2023] This repository includes the official implementation our paper "Masked Autoencoders Enable Efficient Knowledge Distillers"
Python ★ 109 2y agoExplain → -
vllm-safety-benchmark
[ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs"
Python ★ 88 2y agoExplain → -
CIK-Bench
Official repository for Your Agent, Their Asset: A Real-World Safety Analysis of OpenClaw
Shell ★ 69 1mo agoExplain → -
MedVLThinker
[ML4H'25] MedVLThinker: Simple Baselines for Multimodal Medical Reasoning
Jupyter Notebook ★ 59 6mo agoExplain → -
MicroDiffusion
[CVPR 2024] This repository includes the official implementation our paper "MicroDiffusion: Implicit Representation-Guided Diffusion for 3D Reconstruction from Limited 2D Microscopy Projections"
Python ★ 55 2y agoExplain → -
m1
[ML4H'25] m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning in Large Language Models
Jupyter Notebook ★ 49 6mo agoExplain → -
o1_medical
No description.
Python ★ 48 1y agoExplain → -
CRATE-alpha
This repository includes the official implementation our paper "Scaling White-Box Transformers for Vision"
Python ★ 47 2y agoExplain → -
ReasoningEval
Official repo of Knowledge or Reasoning? A Close Look at How LLMs Think Across Domains.
Python ★ 43 1y agoExplain → -
EVP
[TMLR'24] This repository includes the official implementation our paper "Unleashing the Power of Visual Prompting At the Pixel Level"
Python ★ 42 2y agoExplain → -
CLIPS
An Enhanced CLIP Framework for Learning with Synthetic Captions
Python ★ 40 1y agoExplain → -
STAR-1
[AAAI'26 Oral] Official Implementation of STAR-1: Safer Alignment of Reasoning LLMs with 1K Data
Python ★ 37 1y agoExplain → -
MixCon3D
[CVPR 2024] The official implementation of paper "Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training"
Python ★ 35 2y agoExplain → -
VLAA-GUI
Official implementation of VLAA-GUI series
Python ★ 32 1mo agoExplain → -
MeDiM
No description.
Python ★ 32 6mo agoExplain → -
Complex-Edit
Complex-Edit: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark
Python ★ 28 1y agoExplain → -
VisualClaw
Official Implementation of VisualClaw: A Real-Time, Personalized Agent for the Physical World
Python ★ 27 3d agoExplain → -
EpiFoundation
Pytorch implementation of EpiFoundation
Python ★ 27 1y agoExplain → -
AttnGCG-attack
[TMLR 2025] Official implementation of AttnGCG: Enhancing Jailbreaking Attacks on LLMs with Attention Manipulation
Python ★ 26 1y agoExplain → -
ClinSeekAgent
No description.
Python ★ 25 18d agoExplain → -
FedConv
[TMLR'24] This repository includes the official implementation our paper "FedConv: Enhancing Convolutional Neural Networks for Handling Data Heterogeneity in Federated Learning"
Python ★ 25 2y agoExplain → -
AdvXL
[CVPR 2024] This repository includes the official implementation our paper "Revisiting Adversarial Training at Scale"
Python ★ 20 2y agoExplain → -
Sight-Beyond-Text
[TMLR 2024] Official implementation of "Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics"
Python ★ 20 2y agoExplain → -
MedVLSynther
[ICLR'26] MedVLSynther: Synthesizing High-Quality Visual Question Answering from Medical Documents with Generator-Verifier LMMs
Python ★ 19 7mo agoExplain → -
Image-Pretraining-for-Video
[ECCV 2022] This repository includes the official implementation our paper "In Defense of Image Pre-Training for Spatiotemporal Recognition".
Python ★ 19 3y agoExplain → -
EarthWhere
No description.
Python ★ 16 7mo agoExplain → -
VLM-CapCurriculum
No description.
Python ★ 13 1mo agoExplain → -
Redteaming_Challenge
No description.
Python ★ 7 1y agoExplain → -
AgentPressureBench
No description.
Python ★ 5 1mo agoExplain → -
AQA-Bench
Algorithmic-Q&A-Bench: An Interactive Benchmark for Evaluating LLMs’ Sequential Reasoning Ability
Python ★ 4 2y agoExplain → -
ViLBench
[EMNLP'25] Official Python Implementation of ViLBench: A Suite for Vision-Language Process Reward Modeling
Python ★ 3 6mo agoExplain → -
vit_cert
[ECCV 2022] This repository includes the official implementation our paper "ViP: Unified Certified Detection and Recovery for Patch Attack with Vision Transformers"
Python ★ 3 3y agoExplain → -
Compress-Align
This repository includes the official implementation and dataset of our paper "Compress & Align: Curating Image-Text Data with Human Knowledge".
★ 2 2y agoExplain → -
UCSC-VLAA.github.io
No description.
JavaScript ★ 0 1d agoExplain → -
o1_medicine
No description.
JavaScript ★ 0 1y agoExplain →
No repos match these filters.