IIITH

Investigating Mechanisms for In-Context Vision Language Binding

Darshana S, Makarand Tapaswi, Vineet Gandhi

Computer Vision and Pattern Recognition Conference workshops, CVPR-W, 2025

Core Rank : - Google Rank :-

Abs PDF bibTex

Pseudo-labelling meets Label Smoothing for Noisy Partial Label Learning

Darshana S, Naresh Manwani, Vineet Gandhi

Computer Vision and Pattern Recognition Conference workshops, CVPR-W, 2025

Core Rank : - Google Rank :-

Abs PDF bibTex

TIDE: Training Locally Interpretable Domain Generalization Models Enables Test-time Correction

Aishwarya Agarwal, Srikrishna Karanam, Vineet Gandhi

Computer Vision and Pattern Recognition, CVPR, 2025

Core Rank : A* Google Rank :440

Abs PDF bibTex

IdentifyMe: A Challenging Mention Resolution Benchmark for LLMs

S Kawshik Manikantan, Makarand Tapaswi, Vineet Gandhi, Shubham Toshniwal

North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL- HLT, 2025

Core Rank : A Google Rank :132

Abs PDF DOI bibTex

VELOCITI: Benchmarking Video-Language Compositional Reasoning with Strict Entailment

Darshana S, Varun Gupta, Darshan Singh S, Zeeshan Khan, Vineet Gandhi, Makarand Tapaswi

Computer Vision and Pattern Recognition, CVPR, 2025

Core Rank : A* Google Rank :440

Abs PDF bibTex

Prompt-to-Correct: Automated Test-Time Pronunciation Correction with Voice Prompts

Ayan Kashyap, Neilkumar Milankumar Shah, Vineet Gandhi

International Conference on Acoustics, Speech, and Signal Processing, ICASSP, 2025

Core Rank : B Google Rank :129

Abs PDF bibTex

EditIQ: Automated Cinematic Editing of Static Wide-Angle Videos via Dialogue Interpretation and Saliency Cues

Girmaji Rohit, Bhav Beri, Ramanathan Subramanian, Vineet Gandhi

International Conference on Intelligent User Interfaces, IUI, 2025

Core Rank : A Google Rank :52

Abs PDF DOI bibTex

MRI2Speech: Speech Synthesis from Articulatory Movements Recorded by Real-time MRI

Neilkumar Milankumar Shah, Ayan Kashyap, Shirish Karande, Vineet Gandhi

International Conference on Acoustics, Speech, and Signal Processing, ICASSP, 2025

Core Rank : B Google Rank :129

Abs PDF DOI bibTex

Advancing NAM-to-Speech Conversion with Novel Methods and the MultiNAM Dataset

Neilkumar Milankumar Shah, Shirish Karande, Vineet Gandhi

International Conference on Acoustics, Speech, and Signal Processing, ICASSP, 2025

Core Rank : B Google Rank :129

Abs PDF bibTex

Minimalistic Video Saliency Prediction via Efficient Decoder & Spatio Temporal Action Cues

Girmaji Rohit, Siddharth Jain, Bhav Beri, Sarthak Bansal, Vineet Gandhi

International Conference on Acoustics, Speech, and Signal Processing, ICASSP, 2025

Core Rank : B Google Rank :129

Abs PDF DOI bibTex