IMG

MICap: A Unified Model for Identity-aware Movie Descriptions
Haran S K Raajesh, Naveen Reddy Desanur, Zeeshan Khan, Makarand Tapaswi
Computer Vision and Pattern Recognition, CVPR, 2024
Core Rank : A* Google Rank :440
Previously On ... From Recaps to Story Summarization
Aditya Kumar Singh, Dhruv Srivastava, Makarand Tapaswi
Computer Vision and Pattern Recognition, CVPR, 2024
Core Rank : A* Google Rank :440
How you feelin? Learning Emotions and Mental States in Movie Scenes
Dhruv Srivastava, Aditya Kumar Singh, Makarand Tapaswi
Computer Vision and Pattern Recognition, CVPR, 2023
Core Rank : A* Google Rank :440
GrapeQA: GRaph Augmentation and Pruning to Enhance Question-Answering
Dhaval Taunk, Lakshya Khanna, Kandru Siri Venkata Pavan Kumar, Vasudeva Varma Kalidindi, Charu Sharma, Makarand Tapaswi
WWW Workshop on Natural Language Processing for Knowledge Graph Construction, NLP4KGc, 2023
Core Rank : - Google Rank :-
DO VIDEO-LANGUAGE FOUNDATION MODELS HAVE A SENSE OF TIME?
Piyush Bagad, Makarand Tapaswi, Cees G. M. Snoek
workshop on International Conference on Learning Representations, ICLR-W, 2023
Core Rank : - Google Rank :-
Test of Time: Instilling Video-Language Models with a Sense of Time
Piyush Bagad, Makarand Tapaswi, Cees G. M. Snoek
Computer Vision and Pattern Recognition, CVPR, 2023
Core Rank : A* Google Rank :440
Unsupervised Audio-Visual Lecture Segmentation
Darshan Singh S, Anchit Gupta, Jawahar C V, Makarand Tapaswi
Winter Conference on Applications of Computer Vision, WACV, 2023
Core Rank : - Google Rank :109
Learning from Unlabeled 3D Environments for Vision-and-Language Navigation
Shizhe Chen, Pierre-louis Guhur, Makarand Tapaswi, Cordelia Schmid, Ivan Laptev
European Conference on Computer Vision, ECCV, 2022
Core Rank : A* Google Rank :206
Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language Navigation
Shizhe Chen, Pierre-louis Guhur, Makarand Tapaswi, Cordelia Schmid, Ivan Laptev
Computer Vision and Pattern Recognition, CVPR, 2022
Core Rank : A* Google Rank :440
Learning Object Manipulation Skills from Video via Approximate Differentiable Physics
Vladim´ır Petr´ık, Mohammad Nomaan Qureshi, Josef Sivic, Makarand Tapaswi
International Conference on Intelligent Robots and Systems, IROS, 2022
Core Rank : A Google Rank :86