IMG

A Preliminary Analysis of Automatic Word and Syllable Prominence Detection in Non-Native Speech With Text-to-Speech Prosody Embeddings
Technical Report, arXiv, 2025
Core Rank : - Google Rank :-
MAPWise: Evaluating Vision-Language Models for Advanced Map Queries
North American Association for Computational Linguistics, NAACL, 2025
Core Rank : A
EditIQ: Automated Cinematic Editing of Static Wide-Angle Videos via Dialogue Interpretation and Saliency Cues
International Conference on Intelligent User Interfaces, IUI, 2025
Core Rank : A Google Rank :52
MRI2Speech: Speech Synthesis from Articulatory Movements Recorded by Real-time MRI
International Conference on Acoustics, Speech, and Signal Processing, ICASSP, 2025
Core Rank : B Google Rank :129
Advancing NAM-to-Speech Conversion with Novel Methods and the MultiNAM Dataset
International Conference on Acoustics, Speech, and Signal Processing, ICASSP, 2025
Core Rank : B Google Rank :129
Typical vs. Atypical Disfluency Classification: Introducing the IIITH-TISA Corpus and Temporal Context-Based Feature Representations
International Conference on Acoustics, Speech, and Signal Processing, ICASSP, 2025
Core Rank : B Google Rank :129
Enhancing Stutter Detection using Long-Term Average Spectrum Values
International Conference on Acoustics, Speech, and Signal Processing, ICASSP, 2025
Core Rank : B Google Rank :129
AdaptBot: Combining LLM with Knowledge Graphs and Human Input for Generic-to-Specific Task Decomposition and Knowledge Refinement
International Conference on Robotics and Automation, ICRA, 2025
Core Rank : A* Google Rank :122
The Sound of Water: Inferring Physical Properties from Pouring Liquids
International Conference on Acoustics, Speech, and Signal Processing, ICASSP, 2025
Core Rank : B Google Rank :129
No Detail Left Behind: Revisiting Self-Retrieval for Fine-Grained Image Captioning
Transactions in Machine Learning Research, TMLR, 2025
Core Rank : - Google Rank :-