publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2024

  1. JRDB-Social: A Multifaceted Robotic Dataset for Understanding of Context and Dynamics of Human Interactions Within Social Groups
    Simindokht Jahangard, Zhixi Cai, Shiki Wen, and Hamid Rezatofighi
    arXiv preprint arXiv:2404.04458, 2024
  2. HYDRA: A Hyper Agent for Dynamic Compositional Visual Reasoning
    Fucai Ke, Zhixi Cai, Simindokht Jahangard, Weiqing Wang, Pari Delir Haghighi, and Hamid Rezatofighi
    arXiv preprint arXiv:2403.12884, 2024

2023

  1. AV-Deepfake1M: A Large-Scale LLM-Driven Audio-Visual Deepfake Dataset
    Zhixi CaiShreya Ghosh, Aman Pankaj Adatia, Munawar HayatAbhinav Dhall, and Kalin Stefanov
    arXiv preprint arXiv:2311.15308, 2023
  2. A Multi-Label Speech Emotion Recognition for Cross Cultural Communication
    Tassadaq Hussain, Islam Nassar, Zhixi Cai, Hamid Rezatofighi, Munawar Hayat, and Nicholas Cummins
    In UKSPEECH, 2023
  3. Emolysis: A Multimodal Open-Source Group Emotion Analysis and Visualization Toolkit
    Shreya GhoshZhixi Cai, Parul Gupta, Garima Sharma, Abhinav DhallMunawar Hayat, and Tom Gedeon
    arXiv preprint arXiv:2305.05255, 2023
  4. Pavlok-Nudge: A Feedback Mechanism for Atomic Behaviour Modification with Snoring Usecase
    Shreya Ghosh, Rakibul Hasan, Pradyumna Agrawal, Zhixi Cai, Susannah Soon, Abhinav Dhall, and Tom Gedeon
    arXiv preprint arXiv:2305.06110, 2023
  5. Glitch in the Matrix: A Large Scale Benchmark for Content Driven Audio-Visual Forgery Detection and Localization
    Computer Vision and Image Understanding, 2023
  6. MARLIN: Masked Autoencoder for facial video Representation LearnINg
    Zhixi CaiShreya GhoshKalin StefanovAbhinav Dhall, Jianfei Cai, Hamid Rezatofighi, Reza Haffari, and Munawar Hayat
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023

2022

  1. Do You Really Mean That? Content Driven Audio-Visual Deepfake Dataset and Multimodal Method for Temporal Forgery Localization
    In 2022 International Conference on Digital Image Computing: Techniques and Applications (DICTA), 2022