IMG

EditIQ: Automated Cinematic Editing of Static Wide-Angle Videos via Dialogue Interpretation and Saliency Cues
International Conference on Intelligent User Interfaces, IUI, 2025
Core Rank : A Google Rank :52
MRI2Speech: Speech Synthesis from Articulatory Movements Recorded by Real-time MRI
International Conference on Acoustics, Speech, and Signal Processing, ICASSP, 2025
Core Rank : B Google Rank :129
Advancing NAM-to-Speech Conversion with Novel Methods and the MultiNAM Dataset
International Conference on Acoustics, Speech, and Signal Processing, ICASSP, 2025
Core Rank : B Google Rank :129
Typical vs. Atypical Disfluency Classification: Introducing the IIITH-TISA Corpus and Temporal Context-Based Feature Representations
International Conference on Acoustics, Speech, and Signal Processing, ICASSP, 2025
Core Rank : B Google Rank :129
Enhancing Stutter Detection using Long-Term Average Spectrum Values
International Conference on Acoustics, Speech, and Signal Processing, ICASSP, 2025
Core Rank : B Google Rank :129
AdaptBot: Combining LLM with Knowledge Graphs and Human Input for Generic-to-Specific Task Decomposition and Knowledge Refinement
International Conference on Robotics and Automation, ICRA, 2025
Core Rank : A* Google Rank :122
The Sound of Water: Inferring Physical Properties from Pouring Liquids
International Conference on Acoustics, Speech, and Signal Processing, ICASSP, 2025
Core Rank : B Google Rank :129
No Detail Left Behind: Revisiting Self-Retrieval for Fine-Grained Image Captioning
Transactions in Machine Learning Research, TMLR, 2025
Core Rank : - Google Rank :-
Minimalistic Video Saliency Prediction via Efficient Decoder & Spatio Temporal Action Cues
International Conference on Acoustics, Speech, and Signal Processing, ICASSP, 2025
Core Rank : B Google Rank :129
A Multi-modal Approach to Dysarthria Detection and Severity Assessment Using Speech and Text Information
International Conference on Acoustics, Speech, and Signal Processing, ICASSP, 2025
Core Rank : B Google Rank :129