-
recogdrive
[ICLR 2026] ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving
Python ★ 548 5h agoExplain → -
dggt
DGGT: Feedforward 4D Reconstruction of Dynamic Driving Scenes using Unposed Images
Python ★ 538 5mo agoExplain → -
onevl
No description.
Python ★ 430 22d agoExplain → -
dasheng-lm
Efficient audio understanding with general audio captions
Python ★ 426 1mo agoExplain → -
r1-aqa
🤗 R1-AQA Model: mispeech/r1-aqa
Python ★ 326 1y agoExplain → -
unidrivevla
UniDriveVLA: Unifying Understanding, Perception, and Action Planning for Autonomous Driving
Python ★ 191 2mo agoExplain → -
controlfoley
ControlFoley: Unified and Controllable Video-to-Audio Generation with Cross-Modal Conflict Handling
Python ★ 136 8d agoExplain → -
lego-edit
No description.
Jupyter Notebook ★ 130 9mo agoExplain → -
diffrhythm2
No description.
Python ★ 120 7mo agoExplain → -
svor
SVOR - Stable Video Object Removal
Python ★ 108 1mo agoExplain → -
colar
[NeurIPS 2025] Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains
Python ★ 97 2mo agoExplain → -
time-r1
[NeurIPS'25] Time-R1: Post-Training Large Vision Language Model for Temporal Video Grounding
Python ★ 95 6mo agoExplain → -
genesis
[NeurIPS 2025]Genesis: Multimodal Driving Scene Generation with Spatio-Temporal and Cross-Modal Consistency
★ 88 9mo agoExplain → -
shuffle-r1
Official code repository of Shuffle-R1
Python ★ 82 3mo agoExplain → -
q-frame
[ICCV 2025] Implementation of the paper "Q-Frame: Query-aware Frame Selection and Multi-Resolution Adaptation for Video-LLMs"
Python ★ 80 7mo agoExplain → -
dasheng-denoiser
Official PyTorch inference code for the Interspeech 2025 paper: Efficient Speech Enhancement via Embeddings from Pre-trained Generative Audioencoders
Python ★ 80 1y agoExplain → -
dasheng-glap
Official Implementation of GLAP - General Language Audio Pretraining
Python ★ 73 1mo agoExplain → -
drivelaw
[CVPR2026] DriveLaW: Unifying Planning and Video Generation in a Latent Driving World
Python ★ 69 23d agoExplain → -
xares-llm
XARES-LLM
Python ★ 55 2mo agoExplain → -
worldsplat
[ICLR 2026] WorldSplat: Gaussian-Centric Feed-Forward 4D Scene Generation for Autonomous Driving
Python ★ 51 2mo agoExplain → -
gemmax
Gemma-based Multilingual Machine Translation Models
Shell ★ 48 4mo agoExplain → -
tts-prism
No description.
Python ★ 47 1mo agoExplain → -
dasheng-audiogen
end-to-end text to audio scene generation model
★ 42 3d agoExplain → -
mecat
No description.
Python ★ 41 1mo agoExplain → -
dasheng-tokenizer
State-of-the-art continious audio tokenization
★ 39 3mo agoExplain → -
traqpoint
No description.
Python ★ 31 1mo agoExplain → -
acavcaps
No description.
★ 30 2mo agoExplain → -
timeviper
[CVPR'26] TimeViper: A Hybrid Mamba-Transformer Vision-Language Model for Efficient Long Video Understanding
Python ★ 26 5mo agoExplain → -
guievalkit
GUIEvalKit: Open-source Evaluation Toolkit for GUI Agents
Python ★ 23 3mo agoExplain → -
btl-ui
[NeurIPS 2025] Implementation of the paper "BTL-UI: Blink-Think-Link Reasoning Model for GUI Agent"
Python ★ 19 6mo agoExplain → -
dar
DAR introduces the diagonal scanning order for next-token prediction and proposes a direction-aware autoregressive transformer framework.
Python ★ 19 1y agoExplain → -
pixel-perfect-depth
[NeurIPS 2025] Pixel-Perfect Depth
Python ★ 11 8mo agoExplain → -
mobilebench-ol
No description.
Python ★ 9 2mo agoExplain → -
prove
PROVE: A Perceptual RemOVal cohErence Benchmark for Visual Media
Python ★ 9 1mo agoExplain → -
proactive-mobile
ProactiveMobile
★ 6 3mo agoExplain → -
backtrackagent
No description.
Python ★ 4 7mo agoExplain → -
reachagent
No description.
Python ★ 4 1y agoExplain → -
drivemrp
DriveMRP: Enhancing Vision-Language Models with Synthetic Motion Data for Motion Risk Prediction
JavaScript ★ 4 11mo agoExplain → -
iot_spec_llm
No description.
Python ★ 3 2mo agoExplain → -
automine
No description.
Python ★ 2 9d agoExplain → -
ufo
No description.
Python ★ 2 1mo agoExplain → -
icpo
No description.
Python ★ 2 23d agoExplain → -
xiaomi-iot-suggest-llm
No description.
Python ★ 1 6mo agoExplain → -
uni-gaussians
No description.
HTML ★ 1 1y agoExplain → -
hyperclick
No description.
★ 0 5mo agoExplain → -
cogen
No description.
HTML ★ 0 1y agoExplain →
No repos match these filters.