Xuyang Liu

M.S. @ Sichuan University | Intern @ OPPO Research Institute

432 contributions in the last year

2-day current streak·6-day longest streak

Jun 2025

15161718192021222324252627282930

Jul 2025

12345678910111213141516171819202122232425262728293031

Aug 2025

12345678910111213141516171819202122232425262728293031

Sep 2025

123456789101112131415161718192021222324252627282930

Oct 2025

12345678910111213141516171819202122232425262728293031

Nov 2025

123456789101112131415161718192021222324252627282930

Dec 2025

12345678910111213141516171819202122232425262728293031

Jan 2026

12345678910111213141516171819202122232425262728293031

Feb 2026

12345678910111213141516171819202122232425262728

Mar 2026

12345678910111213141516171819202122232425262728293031

Apr 2026

123456789101112131415161718192021222324252627282930

May 2026

12345678910111213141516171819202122232425262728293031

Jun 2026

12345678910111213141516171819

🌈 I am Xuyang Liu (刘旭洋), an incoming PhD student at PolyU, where I will join the VC Lab under the supervision of Prof. Lei Zhang (IEEE Fellow). I am…

🌈 I am Xuyang Liu (刘旭洋), an incoming PhD student at PolyU, where I will join the VC Lab under the supervision of Prof. Lei Zhang (IEEE Fellow). I am also currently working as a research intern at OPPO Research Institute. Previously, I earned my M.S. from Sichuan University and spent a wonderful year interning at Alibaba Group and Ant Group. I am fortunate to work closely with Dr. Siteng Huang and Prof. Linfeng Zhang.

📌 My research centers on Efficient Multimodal Large Language Models (MLLMs), including:

🖼️ Image Understanding: high-resolution understanding via context compression and fast decoding, including GlobalCom2[AAAI'26], V2Drop[CVPR'26], FiCoCo[AAAI'26], and MixKV[ICLR'26].
🎬 Video Understanding: long/audio-video, and streaming reasoning via efficient encoding and compression, including VidCom2[EMNLP'25], STC[CVPR'26], V-CAST, and OmniSIFT[ICML'26].
🎨 Content Generation: lightweight and efficient AIGC via feature caching, pruning, and fast decoding, including ToCa[ICLR'25], Flash-Unified[CVPR'26 Findings], and STDec.
⚙️ Efficiency Toolbox: efficient transfer/fine-tuning and benchmarking for downstream task adaptation, including M2IST[TCSVT'25], V-PETL[NeurIPS'24], and AutoGnothi[ICLR'25].

📢 If you find these directions interesting, feel free to reach out via email: [email protected].

Show forks Show archived