👤 About Me

I am currently a first-year Ph.D. student in Computer Science at the University of Western Australia (UWA) advised by Prof.Mohammed Bennamoun, Prof.Farid Boussaid, jointly advised by Dr.Qiuhong Ke at Monash University. My research interests include Video Understanding, Multimodal Large Language Models (MLLMs), Model Generalization, Image/Video Restoration, etc. I love anime so I am also looking for ACG-related topics.

📝 Selected Publications

ICCV 2023
sym

High-resolution Document Shadow Removal via A Large-scale Real-world Dataset and A Frequency-aware Shadow Erasing Net

Zinuo Li, Xuhang Chen, Chi-Man Pun and Xiaodong Cun

[Paper]  [Code]

IJCAI 2023
sym

A Large-scale Film Style Dataset for Learning Multi-frequency Driven Film Enhancement

Zinuo Li, Xuhang Chen, Chi-Man Pun and Shuqiang Wang [Paper]  [Code]

AAAI 2024
sym

Devignet: High-Resolution Vignetting Removal via a Dual Aggregated Fusion Transformer With Adaptive Channel Expansion

Shenghong Luo‍ , Xuhang Chen , Weiwen Chen , Zinuo Li , Shuqiang Wang and Chi-Man Pun

[Paper]  [Code]

📝 Arxiv Papers

Arxiv
sym

Watch and Listen: Understanding Audio-Visual-Speech Moments with Multimodal LLM

Zinuo Li, Xian Zhang, Yongxin Guo, Mohammed Bennamoun, Farid Boussaid, Girish Dwivedi, Luqi Gong and Qiuhong Ke

[Paper]  [Code]