2-day current streak·6-day longest streak
🌈 I am Xuyang Liu (刘旭洋), an incoming PhD student at PolyU, where I will join the VC Lab under the supervision of Prof. Lei Zhang (IEEE Fellow). I am…
🌈 I am Xuyang Liu (刘旭洋), an incoming PhD student at PolyU, where I will join the VC Lab under the supervision of Prof. Lei Zhang (IEEE Fellow). I am also currently working as a research intern at OPPO Research Institute. Previously, I earned my M.S. from Sichuan University and spent a wonderful year interning at Alibaba Group and Ant Group. I am fortunate to work closely with Dr. Siteng Huang and Prof. Linfeng Zhang.
📌 My research centers on Efficient Multimodal Large Language Models (MLLMs), including:
- 🖼️ Image Understanding: high-resolution understanding via context compression and fast decoding, including GlobalCom2[AAAI'26], V2Drop[CVPR'26], FiCoCo[AAAI'26], and MixKV[ICLR'26].
- 🎬 Video Understanding: long/audio-video, and streaming reasoning via efficient encoding and compression, including VidCom2[EMNLP'25], STC[CVPR'26], V-CAST, and OmniSIFT[ICML'26].
- 🎨 Content Generation: lightweight and efficient AIGC via feature caching, pruning, and fast decoding, including ToCa[ICLR'25], Flash-Unified[CVPR'26 Findings], and STDec.
- ⚙️ Efficiency Toolbox: efficient transfer/fine-tuning and benchmarking for downstream task adaptation, including M2IST[TCSVT'25], V-PETL[NeurIPS'24], and AutoGnothi[ICLR'25].
[email protected].-
Awesome-Generation-Acceleration ★ PINNED
📚 Collection of awesome generation acceleration resources.
★ 400 11mo agoExplain → -
Awesome-Token-level-Model-Compression ★ PINNED
📚 Collection of token-level model compression resources.
★ 198 9mo agoExplain → -
VidCom2 ★ PINNED
[EMNLP 2025 Main] Video Compression Commander: Plug-and-Play Inference Acceleration for Video Large Language Models
Python ★ 126 1mo agoExplain → -
GlobalCom2 ★ PINNED
[AAAI 2026] Global Compression Commander: Plug-and-Play Inference Acceleration for High-Resolution Large Vision-Language Models
Python ★ 42 4mo agoExplain → -
MixKV ★ PINNED
[ICLR 2026] Mixing Importance with Diversity: Joint Optimization for KV Cache Compression in Large Vision-Language Models
Python ★ 29 3mo agoExplain → -
V2Drop
[CVPR 2026] Variation-aware Vision Token Dropping for Faster Large Vision-Language Models
Python ★ 30 23d agoExplain → -
hermes-code-bridge
Use Hermes Agent as the control plane for local coding agents like Codex, Kimi Code, Claude Code, OpenCode, and Gemini CLI.
Python ★ 18 22d agoExplain → -
VGDiffZero
[ICASSP 2024] VGDiffZero: Text-to-image Diffusion Models Can Be Zero-shot Visual Grounders
Python ★ 17 1y agoExplain → -
M2IST
[TCSVT 2025] M2IST: Multi-Modal Interactive Side-Tuning for Efficient Referring Expression Comprehension
Python ★ 8 1y agoExplain → -
VALSE-24-Notes
VALSE 2024 | Useful Slides
★ 7 2y agoExplain → -
GLMLP-TRANS
[COMCOM 2022] GLMLP-TRANS: A transportation mode detection model using lightweight sensors integrated in smartphones
Python ★ 3 2y agoExplain → -
xuyang-liu16.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
JavaScript ★ 1 1h agoExplain → -
xuyang-liu16
No description.
★ 1 4d agoExplain → -
matplotlib-examples
Various codes of figures created by matplotlib in my paper
Python ★ 1 1y agoExplain → -
skills-2 ⑂
Give your agents the power of the Hugging Face ecosystem
★ 0 22d agoExplain → -
skills-1 ⑂
Public repository for Agent Skills
★ 0 22d agoExplain → -
skills ⑂
Skills Catalog for Codex
★ 0 22d agoExplain → -
Academic-project-page-template ⑂
A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/
★ 0 9mo agoExplain →
No repos match these filters.