IMG

Prompt-to-Correct: Automated Test-Time Pronunciation Correction with Voice Prompts
Ayan Kashyap, Neilkumar Milankumar Shah, Vineet Gandhi
International Conference on Acoustics, Speech, and Signal Processing, ICASSP, 2025
Core Rank : B Google Rank :129
EditIQ: Automated Cinematic Editing of Static Wide-Angle Videos via Dialogue Interpretation and Saliency Cues
Girmaji Rohit, Bhav Beri, Ramanathan Subramanian, Vineet Gandhi
International Conference on Intelligent User Interfaces, IUI, 2025
Core Rank : A Google Rank :52
MRI2Speech: Speech Synthesis from Articulatory Movements Recorded by Real-time MRI
Neilkumar Milankumar Shah, Ayan Kashyap, Shirish Karande, Vineet Gandhi
International Conference on Acoustics, Speech, and Signal Processing, ICASSP, 2025
Core Rank : B Google Rank :129
Advancing NAM-to-Speech Conversion with Novel Methods and the MultiNAM Dataset
Neilkumar Milankumar Shah, Shirish Karande, Vineet Gandhi
Technical Report, arXiv, 2025
Core Rank : - Google Rank :-
Minimalistic Video Saliency Prediction via Efficient Decoder & Spatio Temporal Action Cues
Girmaji Rohit, Siddharth Jain, Bhav Beri, Sarthak Bansal, Vineet Gandhi
International Conference on Acoustics, Speech, and Signal Processing, ICASSP, 2025
Core Rank : B Google Rank :129
Major Entity Identification: A Generalizable Alternative to Coreference Resolution
S Kawshik Manikantan, Shubham Toshniwal, Makarand Tapaswi, Vineet Gandhi
Conference on Empirical Methods in Natural Language Processing, EMNLP, 2024
Core Rank : A* Google Rank :193
ParrotTTS: Text-to-speech synthesis exploiting disentangled self-supervised representations
Neilkumar Milankumar Shah, K Saiteja, Vishal Thambrahalli, Neha S, Anil Kumar Nelakanti, Vineet Gandhi
Conference of the European Chapter of the Association for Computational Linguistics (EACL), EACL, 2024
Core Rank : A Google Rank :56
StethoSpeech: Speech Generation Through a Clinical Stethoscope Attached to the Skin
Neilkumar Milankumar Shah, Neha S, Vishal Thambrahalli, Ramanathan Subramanian, Vineet Gandhi
international joint conference on pervasive and ubiquitous computing, Ubicomp, 2024
Core Rank : A* Google Rank :27
Towards Improving NAM-to-Speech Synthesis Intelligibility using Self-Supervised Speech Models
Neilkumar Milankumar Shah, Shirish Karande, Vineet Gandhi
Annual Conference of the International Speech Communication Association, INTERSPEECH, 2024
Core Rank : A Google Rank :111
Real Time GAZED: Online Shot Selection and Editing of Virtual Cameras from Wide-Angle Monocular Video Recordings
Achary Sudheer, Girmaji Rohit, Adhiraj Anil Deshmukh, Vineet Gandhi
Winter Conference on Applications of Computer Vision, WACV, 2024
Core Rank : - Google Rank :109