IMG

Major Entity Identification: A Generalizable Alternative to Coreference Resolution
S Kawshik Manikantan, Shubham Toshniwal, Makarand Tapaswi, Vineet Gandhi
Conference on Empirical Methods in Natural Language Processing, EMNLP, 2024
Core Rank : A* Google Rank :193
MICap: A Unified Model for Identity-aware Movie Descriptions
Haran S K Raajesh, Naveen Reddy Desanur, Zeeshan Khan, Makarand Tapaswi
Computer Vision and Pattern Recognition, CVPR, 2024
Core Rank : A* Google Rank :440
Previously On ... From Recaps to Story Summarization
Aditya Kumar Singh, Dhruv Srivastava, Makarand Tapaswi
Computer Vision and Pattern Recognition, CVPR, 2024
Core Rank : A* Google Rank :440
Eye vs. AI: Human Gaze and Model Attention in Video Memorability
Prajneya Kumar, Eshika Khandelwal, Makarand Tapaswi, Vishnu Sreekumar
Technical Report, arXiv, 2023
Core Rank : - Google Rank :-
How you feelin? Learning Emotions and Mental States in Movie Scenes
Dhruv Srivastava, Aditya Kumar Singh, Makarand Tapaswi
Computer Vision and Pattern Recognition, CVPR, 2023
Core Rank : A* Google Rank :440
GrapeQA: GRaph Augmentation and Pruning to Enhance Question-Answering
Dhaval Taunk, Lakshya Khanna, Kandru Siri Venkata Pavan Kumar, Vasudeva Varma Kalidindi, Charu Sharma, Makarand Tapaswi
WWW Workshop on Natural Language Processing for Knowledge Graph Construction, NLP4KGc, 2023
Core Rank : - Google Rank :-
DO VIDEO-LANGUAGE FOUNDATION MODELS HAVE A SENSE OF TIME?
Piyush Bagad, Makarand Tapaswi, Cees G. M. Snoek
workshop on International Conference on Learning Representations, ICLR-W, 2023
Core Rank : - Google Rank :-
Test of Time: Instilling Video-Language Models with a Sense of Time
Piyush Bagad, Makarand Tapaswi, Cees G. M. Snoek
Computer Vision and Pattern Recognition, CVPR, 2023
Core Rank : A* Google Rank :440
Unsupervised Audio-Visual Lecture Segmentation
Darshan Singh S, Anchit Gupta, Jawahar C V, Makarand Tapaswi
Winter Conference on Applications of Computer Vision, WACV, 2023
Core Rank : - Google Rank :109
Learning from Unlabeled 3D Environments for Vision-and-Language Navigation
Shizhe Chen, Pierre-louis Guhur, Makarand Tapaswi, Cordelia Schmid, Ivan Laptev
European Conference on Computer Vision, ECCV, 2022
Core Rank : A* Google Rank :206