IMG

Total Publications: 36
Character-aware Subtitling through Improved Alignment and Diverse Priors
Adhiraj Anil Deshmukh,Makarand Tapaswi,Vineet Gandhi
International Conference on Signal Processing and Communications, SPCOM, 2026
Core Rank : - Google Rank : 10
The Sound of Water: Inferring Physical Properties from Pouring Liquids
Piyush Bagad,Makarand Tapaswi,Cees G. M. Snoek,Andrew Zisserman
IEEE Transactions on Pattern Analysis and Machine Intelligence, IEEE-PAMI, 2026
Google Rank : 460
SRL-CLIP: Efficient CLIP Video Adaptation via Structured Semantic Role Labels
Darshan Singh S,Zeeshan Khan,Makarand Tapaswi
Computer Vision and Pattern Recognition Conference workshops, CVPR-W, 2026
Core Rank : - Google Rank : -
One Identity, Many Roles: Multimodal Entity Coreference for Enhanced Video Situation Recognition
Balaji Darur,Amanmeet Garg,Makarand Tapaswi
Computer Vision and Pattern Recognition - Findings, CVPR-F, 2026
STRinGS: Selective Text Refinement in Gaussian Splatting
Abhinav Digambar Raundhal,Gaurav Behera,Narayanan P J,Ravi Kiran Sarvadevabhatla,Makarand Tapaswi
Winter Conference on Applications of Computer Vision, WACV, 2026
Core Rank : A Google Rank : 109
Auditory CNN Analysis: What Do Layers Encode?
Pratyaksh Gautam,Makarand Tapaswi,Vinoo Alluri R
International Conference on Music Perception and Cognition, ICMPC, 2025
Core Rank : - Google Rank : -
MALeR: Improving Compositional Fidelity in Layout-Guided Generation
Shivank Saxena,Dhruv Srivastava,Makarand Tapaswi
ACM Transactions on Graphics, ACM-TG, 2025
Google Rank : 271
What You See is What You Ask: Evaluating Audio Descriptions
Divy Kala,Eshika Khandelwal,Makarand Tapaswi
Conference on Empirical Methods in Natural Language Processing, EMNLP, 2025
Core Rank : A* Google Rank : 193
Investigating Mechanisms for In-Context Vision Language Binding
Darshana S,Makarand Tapaswi,Vineet Gandhi
Computer Vision and Pattern Recognition Conference workshops, CVPR-W, 2025
Core Rank : - Google Rank : -
IdentifyMe: A Challenging Mention Resolution Benchmark for LLMs
S Kawshik Manikantan,Makarand Tapaswi,Vineet Gandhi,Shubham Toshniwal
North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL- HLT, 2025
Core Rank : A Google Rank : 132