👤 About Me

I am currently a first-year Ph.D. student in Computer Science at the University of Western Australia (UWA) advised by Prof.Mohammed Bennamoun, Prof.Farid Boussaid, jointly advised by Dr.Qiuhong Ke @ Monash University. My research interests include Video Understanding, Multimodal Large Language Models (MLLMs), Model Generalization, Image/Video Restoration, etc. I love anime so I am also looking for ACG-related topics.

📝 Selected Publications

*:Equal Contribution; †: Corresponding Author

ICCV 2023
sym

High-resolution Document Shadow Removal via A Large-scale Real-world Dataset and A Frequency-aware Shadow Erasing Net

Zinuo Li*, Xuhang Chen*, Chi-Man Pun† and Xiaodong Cun†

[Paper]  [Code]

IJCAI 2023
sym

A Large-scale Film Style Dataset for Learning Multi-frequency Driven Film Enhancement

Zinuo Li*, Xuhang Chen*, Chi-Man Pun† and Shuqiang Wang†

[Paper]  [Code]

AAAI 2024
sym

Devignet: High-Resolution Vignetting Removal via a Dual Aggregated Fusion Transformer With Adaptive Channel Expansion

Shenghong Luo‍* , Xuhang Chen* , Weiwen Chen , Zinuo Li , Shuqiang Wang† and Chi-Man Pun†

[Paper]  [Code]

📝 Arxiv Papers

*:Equal Contribution; †: Corresponding Author

Arxiv
sym

Watch and Listen: Understanding Audio-Visual-Speech Moments with Multimodal LLM

Zinuo Li, Xian Zhang, Yongxin Guo, Mohammed Bennamoun, Farid Boussaid, Girish Dwivedi, Luqi Gong† and Qiuhong Ke†

[Paper]  [Code]