IIITH

Character-aware Subtitling through Improved Alignment and Diverse Priors

Adhiraj Anil Deshmukh,Makarand Tapaswi,Vineet Gandhi

International Conference on Signal Processing and Communications, SPCOM, 2026

Core Rank : - Google Rank : 10

Abs PDF bibTex

The Sound of Water: Inferring Physical Properties from Pouring Liquids

Piyush Bagad,Makarand Tapaswi,Cees G. M. Snoek,Andrew Zisserman

IEEE Transactions on Pattern Analysis and Machine Intelligence, IEEE-PAMI, 2026

Google Rank : 460

Abs PDF DOI bibTex

SRL-CLIP: Efficient CLIP Video Adaptation via Structured Semantic Role Labels

Darshan Singh S,Zeeshan Khan,Makarand Tapaswi

Computer Vision and Pattern Recognition Conference workshops, CVPR-W, 2026

Core Rank : - Google Rank : -

Abs PDF bibTex

One Identity, Many Roles: Multimodal Entity Coreference for Enhanced Video Situation Recognition

Balaji Darur,Amanmeet Garg,Makarand Tapaswi

Computer Vision and Pattern Recognition - Findings, CVPR-F, 2026

Abs PDF DOI bibTex

STRinGS: Selective Text Refinement in Gaussian Splatting

Abhinav Digambar Raundhal,Gaurav Behera,Narayanan P J,Ravi Kiran Sarvadevabhatla,Makarand Tapaswi

Winter Conference on Applications of Computer Vision, WACV, 2026

Core Rank : A Google Rank : 109

Abs PDF DOI bibTex

Auditory CNN Analysis: What Do Layers Encode?

Pratyaksh Gautam,Makarand Tapaswi,Vinoo Alluri R

International Conference on Music Perception and Cognition, ICMPC, 2025

Core Rank : - Google Rank : -

Abs PDF bibTex

MALeR: Improving Compositional Fidelity in Layout-Guided Generation

Shivank Saxena,Dhruv Srivastava,Makarand Tapaswi

ACM Transactions on Graphics, ACM-TG, 2025

Google Rank : 271

Abs PDF bibTex

What You See is What You Ask: Evaluating Audio Descriptions

Divy Kala,Eshika Khandelwal,Makarand Tapaswi

Conference on Empirical Methods in Natural Language Processing, EMNLP, 2025

Core Rank : A* Google Rank : 193

Abs PDF bibTex

Investigating Mechanisms for In-Context Vision Language Binding

Darshana S,Makarand Tapaswi,Vineet Gandhi

Computer Vision and Pattern Recognition Conference workshops, CVPR-W, 2025

Core Rank : - Google Rank : -

Abs PDF bibTex

IdentifyMe: A Challenging Mention Resolution Benchmark for LLMs

S Kawshik Manikantan,Makarand Tapaswi,Vineet Gandhi,Shubham Toshniwal

North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL- HLT, 2025

Core Rank : A Google Rank : 132

Abs PDF DOI bibTex