Zhixi Cai
Monash University. Melbourne. Australia.
š Iām currently a Research Fellow (Post-Doctoral) at VL4AI Lab, Faculty of IT, Monash University. Supervised by Dr. Hamid Rezatofighi, my current core research area is video understanding and reasoning using neurosymbolic and large language models.
š” We believe the next wave of AI will be agentic and verifiable systems that plan, act, and explain their decisions by composing neural perception with symbolic representations and logic. Progress will come from neuro-symbolic combination with built-in verifications. Our aim is AI that is safe-by-default, auditable, and robust.
š I obtained my PhD in Monash University, supervised by A/Prof. Kalin Stefanov, A/Prof. Abhinav Dhall and Dr. Munawar Hayat in artificial intelligence domain. I completed my thesis Content-Driven Multimodal Deepfake Generation and Temporal Localization, which mainly focuses on deepfakes and video understanding.
š¬ Now I have published papers in CVPR, ECCV, ICCV, ACM MM, etc, and get two best paper awards in my PhD journey. Please refer to the publication page for more details.
š Iām the Associate Editor (Area Chair) of IROS. Iām also invited as the reviewer of CVPR, ICCV, ECCV, ACM MM, ICRA, TPAMI, TMM, TAFFC, and more.
š„ļø I enjoy programming and implementing some cool ideas. I have developped several interesting open source applications and libraries in my spare time. Please refer to the projects page for more details.
š ļø Also, I love discovering and fine-tuning tools in my hand, including both software tools and physical tools.
news
| Jan 27, 2026 | A paper are accepted by ICLR 2026. |
|---|---|
| Nov 08, 2025 | A paper are accepted by AAAI 2026. |
| Oct 13, 2025 | Host a tutorial (Multimodal Deepfake Generation and Detection: Challenges, Methods, and Future Directions) at ICMI 2025. |
| Jul 07, 2025 | A paper are accepted by RA-L. |
| Jun 26, 2025 | Two papers are accepted by ICCV 2025. |