IMG

NAM-to-Speech Conversion with Multitask-Enhanced Autoregressive Models
Annual Conference of the International Speech Communication Association, INTERSPEECH, 2025
Core Rank : A Google Rank :111
Simplifying Knowledge Transfer in Pretrained Models
Transactions in Machine Learning Research, TMLR, 2025
Core Rank : - Google Rank :-
Investigating Mechanisms for In-Context Vision Language Binding
Computer Vision and Pattern Recognition Conference workshops, CVPR-W, 2025
Core Rank : - Google Rank :-
Pseudo-labelling meets Label Smoothing for Noisy Partial Label Learning
Computer Vision and Pattern Recognition Conference workshops, CVPR-W, 2025
Core Rank : - Google Rank :-
TIDE: Training Locally Interpretable Domain Generalization Models Enables Test-time Correction
Computer Vision and Pattern Recognition, CVPR, 2025
Core Rank : A* Google Rank :440
IdentifyMe: A Challenging Mention Resolution Benchmark for LLMs
North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL- HLT, 2025
Core Rank : A Google Rank :132
VELOCITI: Benchmarking Video-Language Compositional Reasoning with Strict Entailment
Computer Vision and Pattern Recognition, CVPR, 2025
Core Rank : A* Google Rank :440
Prompt-to-Correct: Automated Test-Time Pronunciation Correction with Voice Prompts
International Conference on Acoustics, Speech, and Signal Processing, ICASSP, 2025
Core Rank : B Google Rank :129
EditIQ: Automated Cinematic Editing of Static Wide-Angle Videos via Dialogue Interpretation and Saliency Cues
International Conference on Intelligent User Interfaces, IUI, 2025
Core Rank : A Google Rank :52
MRI2Speech: Speech Synthesis from Articulatory Movements Recorded by Real-time MRI
International Conference on Acoustics, Speech, and Signal Processing, ICASSP, 2025
Core Rank : B Google Rank :129