Publications

Action Recognition

Video Understanding

2025

  1. trisense.png
    Watch and Listen: Understanding Audio-Visual-Speech Moments with Multimodal LLM
    Zinuo Li, Xian Zhang, Yongxin Guo, Mohammed Bennamoun, Farid Boussaid, Girish Dwivedi, Luqi Gong, and Qiuhong Ke
    🏆 NeurIPS 2025| Advances in Neural Information Processing SystemsCCF-ACORE-A*

Image/Video Generation

Other Publications