IMG

The Sound of Water: Inferring Physical Properties from Pouring Liquids
Piyush Bagad, Makarand Tapaswi, Cees G. M. Snoek, Andrew Zisserman
International Conference on Acoustics, Speech, and Signal Processing, ICASSP, 2025
Core Rank : B Google Rank :129
No Detail Left Behind: Revisiting Self-Retrieval for Fine-Grained Image Captioning
Manu Gaur, Darshan Singh S, Makarand Tapaswi
Transactions in Machine Learning Research, TMLR, 2025
Core Rank : - Google Rank :-
Seeing Eye to AI Comparing Human Gaze and Model Attention in Video Memorability
Prajneya Kumar, Eshika Khandelwal, Makarand Tapaswi, Vishnu Sreekumar
Winter Conference on Applications of Computer Vision, WACV, 2025
Core Rank : - Google Rank :109
Major Entity Identification: A Generalizable Alternative to Coreference Resolution
S Kawshik Manikantan, Shubham Toshniwal, Makarand Tapaswi, Vineet Gandhi
Conference on Empirical Methods in Natural Language Processing, EMNLP, 2024
Core Rank : A* Google Rank :193
MICap: A Unified Model for Identity-aware Movie Descriptions
Haran S K Raajesh, Naveen Reddy Desanur, Zeeshan Khan, Makarand Tapaswi
Computer Vision and Pattern Recognition, CVPR, 2024
Core Rank : A* Google Rank :440
Previously On ... From Recaps to Story Summarization
Aditya Kumar Singh, Dhruv Srivastava, Makarand Tapaswi
Computer Vision and Pattern Recognition, CVPR, 2024
Core Rank : A* Google Rank :440
How you feelin? Learning Emotions and Mental States in Movie Scenes
Dhruv Srivastava, Aditya Kumar Singh, Makarand Tapaswi
Computer Vision and Pattern Recognition, CVPR, 2023
Core Rank : A* Google Rank :440
GrapeQA: GRaph Augmentation and Pruning to Enhance Question-Answering
Dhaval Taunk, Lakshya Khanna, Kandru Siri Venkata Pavan Kumar, Vasudeva Varma Kalidindi, Charu Sharma, Makarand Tapaswi
WWW Workshop on Natural Language Processing for Knowledge Graph Construction, NLP4KGc, 2023
Core Rank : - Google Rank :-
DO VIDEO-LANGUAGE FOUNDATION MODELS HAVE A SENSE OF TIME?
Piyush Bagad, Makarand Tapaswi, Cees G. M. Snoek
workshop on International Conference on Learning Representations, ICLR-W, 2023
Core Rank : - Google Rank :-
Test of Time: Instilling Video-Language Models with a Sense of Time
Piyush Bagad, Makarand Tapaswi, Cees G. M. Snoek
Computer Vision and Pattern Recognition, CVPR, 2023
Core Rank : A* Google Rank :440