IMG

Dissecting errors in machine learning for retrosynthesis: a granular metric framework and a transformer-based model for more informative predictions
Arihanth Srikar Tadanki, H Surya Prakash Rao, Deva Priyakumar U
Digital Discovery, DD, 2025
Core Rank : - Google Rank :20
A Preliminary Analysis of Automatic Word and Syllable Prominence Detection in Non-Native Speech With Text-to-Speech Prosody Embeddings
Anindita Mondal, Rangavajjala Sankara Bharadwaj, Mallela Jhansi, Anil Kumar Vuppala, Chiranjeevi Yarra
Technical Report, arXiv, 2025
Core Rank : - Google Rank :-
MAPWise: Evaluating Vision-Language Models for Advanced Map Queries
Srija Mukhopadhyay, Abhishek Rajgaria, Prerana Khatiwada, Manish Shrivastava, Dan Roth, Vivek Gupta
North American Association for Computational Linguistics, NAACL, 2025
Core Rank : A
EditIQ: Automated Cinematic Editing of Static Wide-Angle Videos via Dialogue Interpretation and Saliency Cues
Girmaji Rohit, Bhav Beri, Ramanathan Subramanian, Vineet Gandhi
International Conference on Intelligent User Interfaces, IUI, 2025
Core Rank : A Google Rank :52
MRI2Speech: Speech Synthesis from Articulatory Movements Recorded by Real-time MRI
Neilkumar Milankumar Shah, Ayan Kashyap, Shirish Karande, Vineet Gandhi
International Conference on Acoustics, Speech, and Signal Processing, ICASSP, 2025
Core Rank : B Google Rank :129
Advancing NAM-to-Speech Conversion with Novel Methods and the MultiNAM Dataset
Neilkumar Milankumar Shah, Shirish Karande, Vineet Gandhi
Technical Report, arXiv, 2025
Core Rank : - Google Rank :-
The Sound of Water: Inferring Physical Properties from Pouring Liquids
Piyush Bagad, Makarand Tapaswi, Cees G. M. Snoek, Andrew Zisserman
International Conference on Acoustics, Speech, and Signal Processing, ICASSP, 2025
Core Rank : B Google Rank :129
No Detail Left Behind: Revisiting Self-Retrieval for Fine-Grained Image Captioning
Manu Gaur, Darshan Singh S, Makarand Tapaswi
Transactions in Machine Learning Research, TMLR, 2025
Core Rank : - Google Rank :-
Minimalistic Video Saliency Prediction via Efficient Decoder & Spatio Temporal Action Cues
Girmaji Rohit, Siddharth Jain, Bhav Beri, Sarthak Bansal, Vineet Gandhi
International Conference on Acoustics, Speech, and Signal Processing, ICASSP, 2025
Core Rank : B Google Rank :129
A Multi-modal Approach to Dysarthria Detection and Severity Assessment Using Speech and Text Information
Anuprabha, Krishna Gurugubelli, Kesavaraj V, Anil Kumar Vuppala
International Conference on Acoustics, Speech, and Signal Processing, ICASSP, 2025
Core Rank : B Google Rank :129