Anindya Mondal
PhD Student in Computer Vision and AI
Surrey Institute for People-Centred AI (CVSSP), University of Surrey, UK
I am a final-year PhD student advised by Dr. Anjan Dutta, Dr. Xiatian Zhu, and Dr. Joaquin M. Prada. My research focuses on vision-language representation learning with applications in action recognition, object counting, and text-to-image synthesis. I aim to develop practical algorithms that enable robust scene understanding and generation in real-world settings by effectively integrating multi-modal signals.
Email: a.mondal [at] surrey.ac.uk
Research Interests
- Vision-Language Models: Multi-modal representation learning for robust scene understanding
- Generative AI: Text-to-image synthesis with precise instance control and counting
- Action Recognition: Actor-agnostic video understanding using multi-modal queries
- Object Counting: Multi-label counting with semantic-geometric priors for real-world applications
Publications
News
| Jan 2025 | Paper on multi-label object counting accepted at AAAI 2025! |
| Dec 2024 | Awarded AAAI 2025 Conference Travel Grant (worth $1200) |
| Oct 2023 | Presented work on actor-agnostic action recognition at ICCV Workshop 2023 in Paris |
| Sep 2022 | Started PhD at University of Surrey with full studentship funding |
Education
-
PhD in Computer Vision and AI (2022 – Present)
University of Surrey, UK
Awards & Recognition
- AAAI 2025 Conference Travel Grant (worth $1200), Philadelphia, USA
- ICCV 2023 Conference Grant, Paris, France
- University of Surrey Postgraduate Studentship (2022 – 2025), Full PhD funding
- Uplink Research Internship Award, ACM SIGKDD India Chapter
Teaching
-
Teaching Assistant (2023 – 2025), University of Surrey
Academic Service
- Peer Reviewer: ICASSP, ICCV, CVPR, ECCV, NeurIPS, ICPR, IEEE Transactions on Signal Processing (TSP), IEEE Transactions on Signal and Information Processing over Networks (TSIPN)