IMG

Investigating Mechanisms for In-Context Vision Language Binding
Darshana S, Makarand Tapaswi, Vineet Gandhi
Computer Vision and Pattern Recognition Conference workshops, CVPR-W, 2025
Core Rank : - Google Rank :-
Pseudo-labelling meets Label Smoothing for Noisy Partial Label Learning
Darshana S, Naresh Manwani, Vineet Gandhi
Computer Vision and Pattern Recognition Conference workshops, CVPR-W, 2025
Core Rank : - Google Rank :-
TIDE: Training Locally Interpretable Domain Generalization Models Enables Test-time Correction
Aishwarya Agarwal, Srikrishna Karanam, Vineet Gandhi
Computer Vision and Pattern Recognition, CVPR, 2025
Core Rank : A* Google Rank :440
IdentifyMe: A Challenging Mention Resolution Benchmark for LLMs
S Kawshik Manikantan, Makarand Tapaswi, Vineet Gandhi, Shubham Toshniwal
North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL- HLT, 2025
Core Rank : A Google Rank :132
VELOCITI: Benchmarking Video-Language Compositional Reasoning with Strict Entailment
Darshana S, Varun Gupta, Darshan Singh S, Zeeshan Khan, Vineet Gandhi, Makarand Tapaswi
Computer Vision and Pattern Recognition, CVPR, 2025
Core Rank : A* Google Rank :440
Prompt-to-Correct: Automated Test-Time Pronunciation Correction with Voice Prompts
Ayan Kashyap, Neilkumar Milankumar Shah, Vineet Gandhi
International Conference on Acoustics, Speech, and Signal Processing, ICASSP, 2025
Core Rank : B Google Rank :129
EditIQ: Automated Cinematic Editing of Static Wide-Angle Videos via Dialogue Interpretation and Saliency Cues
Girmaji Rohit, Bhav Beri, Ramanathan Subramanian, Vineet Gandhi
International Conference on Intelligent User Interfaces, IUI, 2025
Core Rank : A Google Rank :52
MRI2Speech: Speech Synthesis from Articulatory Movements Recorded by Real-time MRI
Neilkumar Milankumar Shah, Ayan Kashyap, Shirish Karande, Vineet Gandhi
International Conference on Acoustics, Speech, and Signal Processing, ICASSP, 2025
Core Rank : B Google Rank :129
Advancing NAM-to-Speech Conversion with Novel Methods and the MultiNAM Dataset
Neilkumar Milankumar Shah, Shirish Karande, Vineet Gandhi
International Conference on Acoustics, Speech, and Signal Processing, ICASSP, 2025
Core Rank : B Google Rank :129
Minimalistic Video Saliency Prediction via Efficient Decoder & Spatio Temporal Action Cues
Girmaji Rohit, Siddharth Jain, Bhav Beri, Sarthak Bansal, Vineet Gandhi
International Conference on Acoustics, Speech, and Signal Processing, ICASSP, 2025
Core Rank : B Google Rank :129