IIITH

Assessing active speaker detection algorithms through the lens of automated editing

Girmaji Rohit, Achary Sudheer, Adhiraj Anil Deshmukh, Vineet Gandhi

Workshop on Intelligent Cinematography and Editing, WICED, 2023

Abs PDF bibTex

RobustL2S: Speaker-Specific Lip-to-Speech Synthesis exploiting Self-Supervised Representations

Neha Sahipjohn, Neil Shah, Vishal Tambrahalli, Vineet Gandhi

Technical Report, arXiv, 2023

Core Rank : - Google Rank :-

Abs PDF bibTex

Instance-Level Semantic Maps for Vision Language Navigation

Laksh Nanwani, Aditya Mathur, Anmol Agarwal, Kanishk Jain, Raghav Prabhakar, Aaron Anthony Monis, Krishna Murthy, Abdul Hafez, Vineet Gandhi, K Madhava Krishna

Technical Report, arXiv, 2023

Core Rank : - Google Rank :-

Abs PDF bibTex

Test-Time Amendment with a Coarse Classifier for Fine-Grained Classification

Kanishk Jain, Shyamgopal Karthik, Vineet Gandhi

Neural Information Processing Systems, NeurIPS, 2023

Core Rank : A* Google Rank :337

Abs PDF bibTex

Ground then Navigate: Language-guided Navigation in Dynamic Scenes

Kanishk Jain, Varun Chhangani, Amogh Tiwari, K Madhava Krishna, Vineet Gandhi

International Conference on Robotics and Automation, ICRA, 2023

Core Rank : A* Google Rank :122

Abs PDF bibTex

Bringing Generalization to Deep Multi-View Pedestrian Detection

Vora Jeet Vipul, Swetanjal Murati Dutta, Kanishk Jain, Shyamgopal Karthik, Vineet Gandhi

Winter Conference on Applications of Computer Vision Workshops, WACV-W, 2023

Core Rank : - Google Rank :-

Abs PDF bibTex

Framework to Computationally Analyse Kathakali Videos

Bulani Pratikkumar Sureshkumar Komal, S Jayachandran, Sarath S, Vineet Gandhi

Eurographics Workshop on Intelligent Cinematography and Editing, WICED, 2022

Core Rank : - Google Rank :-

Abs PDF DOI bibTex

Cross-Domain Class-Contrastive Learning: Finding Lower Dimensional Representations for Improved Domain Generalization

Saransh Dave, Ritam Basu, Vineet Gandhi

Indian Conference on Computer Vision, Graphics and Image Processing, ICVGIP, 2022

Core Rank : - Google Rank :-

Abs PDF bibTex

Framework to Computationally Analyze Kathakali Videos

Bulani Pratikkumar Sureshkumar Komal, Jayachandran S, Sarath Sivaprasad, Vineet Gandhi

Eurographics Workshop on Intelligent Cinematography and Editing, WICED, 2022

Core Rank : - Google Rank :-

Abs PDF bibTex

Empathic Machines: Using Intermediate Features as Levers to Emulate Emotions in Text-To-Speech Systems

K Saiteja, Sarath S, Niranjan Pedanekar, Anil Nelakanti, Vineet Gandhi

Conference of the North American Chapter of the Association for Computational Linguistics, NAACL, 2022

Core Rank : A Google Rank :132

Abs PDF bibTex