IMG

ParrotTTS: Text-to-Speech synthesis by exploiting self-supervised representations
K Saiteja, Neil Kumar Shah, Vishal Tambrahalli , Neha S, Vineet Gandhi
Conference of the European Chapter of the Association for Computational Linguistics (EACL), EACL, 2024
Core Rank : A Google Rank :56
Real Time GAZED: Online Shot Selection and Editing of Virtual Cameras from Wide-Angle Monocular Video Recordings
Achary Sudheer, Girmaji Rohit, Adhiraj Anil Deshmukh, Vineet Gandhi
Winter Conference on Applications of Computer Vision, WCACV, 2024
Core Rank : - Google Rank :109
Bringing Generalization to Deep Multi-View Pedestrian Detection
Vora Jeet Vipul, Swetanjal Murati Dutta, Kanishk Jain, Shyamgopal Karthik, Vineet Gandhi
Winter Conference on Applications of Computer Vision Workshops, WCACV-W, 2023
Core Rank : - Google Rank :-
Ground then Navigate: Language-guided Navigation in Dynamic Scenes
Kanishk Jain, Varun Chhangani, Amogh Tiwari, K Madhava Krishna, Vineet Gandhi
International Conference on Robotics and Automation, ICRA, 2023
Core Rank : A* Google Rank :122
Test-Time Amendment with a Coarse Classifier for Fine-Grained Classification
Kanishk Jain, Shyamgopal Karthik, Vineet Gandhi
Neural Information Processing Systems, NeurIPS, 2023
Core Rank : A* Google Rank :337
Instance-Level Semantic Maps for Vision Language Navigation
Laksh Nanwani, Anmol Agarwal, Kanishk Jain, Raghav Prabhakar, Aaron Anthony Monis, Krishna Murthy, Abdul Hafez, Vineet Gandhi, K Madhava Krishna
Technical Report, arXiv, 2023
Core Rank : - Google Rank :-
RobustL2S: Speaker-Specific Lip-to-Speech Synthesis exploiting Self-Supervised Representations
Neha Sahipjohn, Neil Shah, Vishal Tambrahalli, Vineet Gandhi
Technical Report, arXiv, 2023
Core Rank : - Google Rank :-
Assessing active speaker detection algorithms through the lens of automated editing
Girmaji Rohit, Achary Sudheer, Adhiraj Anil Deshmukh, Vineet Gandhi
Workshop on Intelligent Cinematography and Editing, WICED, 2023
Real Time GAZED: Online Shot Selection and Editing of Virtual Cameras from Wide-Angle Monocular Video Recordings
Achary Sudheer, Girmaji Rohit, Adhiraj Anil Deshmukh, Vineet Gandhi
Technical Report, arXiv, 2023
Core Rank : - Google Rank :-
RobustL2S: Speaker-Specific Lip-to-Speech Synthesis exploiting Self Supervised Representations
Neha S, Neilkumar Milankumar Shah, Vishal Thambrahalli, Vineet Gandhi
Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), APSIPA, 2023
Core Rank : - Google Rank :-