IIITH

Freeze and Reveal: Exposing Modality Bias in Vision-Language Models

Recent advance in Natural language Processing, RANLP, 2025

Core Rank : - Google Rank :25

Abs PDF DOI bibTex

@inproceedings{bib_Free_2025, AUTHOR = {Hruday, Kavuri Vivek and Vysishtya, Karanam and Jahnavi, Venkamsetty Venkata and Kriti, Madumadukala and Balaji, Darur Lakshmipathi and Kumaraguru, Ponnurangam }, TITLE = {Freeze and Reveal: Exposing Modality Bias in Vision-Language Models}, BOOKTITLE = {Recent advance in Natural language Processing}. YEAR = {2025}}

Freeze and Reveal: Exposing Modality Bias in Vision-Language Models

Abstract

Vision-Language Models (VLMs) achieve impressive multimodal performance but often inherit gender biases from their training data. This bias might be coming from both the vision and text modalities. In this work, we dissect the contributions of vision and text backbones to these biases by applying targeted debiasing—Counterfactual Data Augmentation (CDA) and Task Vector methods. Inspired by data-efficient approaches in hate speech classification, we introduce a novel metric, Degree of Stereotypicality (DoS), and a corresponding debiasing method, Data Augmentation Using DoS (DAUDoS), to reduce bias with minimal computational cost. We curate a gender-annotated dataset and evaluate all methods on the VisoGender benchmark to quantify improvements and identify the dominant source of bias. Our results show that CDA reduces the gender gap by 6% and DAUDoS by 3% but using only one-third the data. Both methods also improve the model’s ability to correctly identify gender in images by 3%, with DAUDoS achieving this improvement using only almost one-third of training data. From our experiments, we observed that CLIP’s vision encoder is more biased whereas PaliGemma2’s text encoder is more biased. By identifying whether the bias stems more from the vision or text encoders, our work enables more targeted and effective bias mitigation strategies in future multi-modal systems. We release our code public at https://github.com/ vivekhruday05/VLM_bias

SPIRIT: Short-Term Prediction of Solar IRradIance for Transfer Learning Using Foundation Models

Association for the Advancement of Artificial Intelligence Workshop, AAAI-W, 2025

Core Rank : - Google Rank :-

Abs PDF bibTex

@inproceedings{bib_SPIR_2025, AUTHOR = {Mishra, Aditya and Ravindra, T and Iyengar, Srinivasan and Kalyanaraman, Shivkumar and Kumaraguru, Ponnurangam }, TITLE = {SPIRIT: Short-Term Prediction of Solar IRradIance for Transfer Learning Using Foundation Models}, BOOKTITLE = {Association for the Advancement of Artificial Intelligence Workshop}. YEAR = {2025}}

SPIRIT: Short-Term Prediction of Solar IRradIance for Transfer Learning Using Foundation Models

Abstract

Traditional solar forecasting models are based on several years of site-specific historical irradiance data, often spanning five or more years, which are unavailable for newer photovoltaic farms. As renewable energy is highly intermittent, building accurate solar irradiance forecasting systems that are data-efficient is essential for efficient grid management and enabling the ongoing proliferation of solar energy, which is crucial to achieve the United Nations' net zero goals. In this work, we propose SPIRIT, a novel framework leveraging foundation models for solar irradiance forecasting, making it applicable to newer solar installations. Our approach outperforms state-of-the-art models in zero-shot transfer learning by upto 70%, enabling effective performance at new locations without relying on any past data. Further improvements in performance are achieved through fine-tuning, as more location-specific data becomes available. These findings are supported by statistical significance, further validating our approach. By dramatically reducing the forecasting setup timeline, SPIRIT accelerates solar farm deployment in all potential global sites, most of which lack historical data, thereby democratizing access to clean energy and enabling participation in the renewable energy transition.

LABELING COPILOT: A Deep Research Agent for Automated Data Curation in Computer Vision

IEEE Transactions on Big Data, IEEE-TBD, 2025

Google Rank :24

Abs PDF DOI bibTex

@inproceedings{bib_LABE_2025, AUTHOR = {Ganguly, Debargha and Kumar, Sumit and Balappanawar, Ishwar B and Chen, Weicong and Kambhatla, Shashank and Iyengar, Srinivasan and Kalyanaraman, Shivkumar and Kumaraguru, Ponnurangam and Chaudhary, Vipin }, TITLE = {LABELING COPILOT: A Deep Research Agent for Automated Data Curation in Computer Vision}, BOOKTITLE = {IEEE Transactions on Big Data}. YEAR = {2025}}

LABELING COPILOT: A Deep Research Agent for Automated Data Curation in Computer Vision

Abstract

Curating high-quality, domain-specific datasets is a major bottleneck for deploying robust vision systems, requiring complex trade-offs between data quality, diversity, and cost when researching vast, unlabeled data lakes. We introduce Labeling Copilot, the first data curation deep research agent for computer vision. A central orchestrator agent, powered by a large multimodal language model, uses multi-step reasoning to execute specialized tools across three core capabilities: (1) Calibrated Discovery sources relevant, in-distribution data from large repositories; (2) Controllable Synthesis generates novel data for rare scenarios with robust filtering; and (3) Consensus Annotation produces accurate labels by orchestrating multiple foundation models via a novel consensus mechanism incorporating non-maximum suppression and voting. Our large-scale validation proves the effectiveness of Labeling Copilot's components. The Consensus Annotation module excels at object discovery: on the dense COCO dataset, it averages 14.2 candidate proposals per image-nearly double the 7.4 ground-truth objects-achieving a final annotation mAP of 37.1%. On the web-scale Open Images dataset, it navigated extreme class imbalance to discover 903 new bounding box categories, expanding its capability to over 1500 total. Concurrently, our Calibrated Discovery tool, tested at a 10-million sample scale, features an active learning strategy that is up to 40x more computationally efficient than alternatives with equivalent sample efficiency. These experiments validate that an agentic workflow with optimized, scalable tools provides a robust foundation for curating industrial-scale datasets.

Do LLMs Adhere to Label Definitions? Examining Their Receptivity to External Label Definitions

Conference on Empirical Methods in Natural Language Processing, EMNLP, 2025

Core Rank : A* Google Rank :193

Abs PDF DOI bibTex

@inproceedings{bib_Do_L_2025, AUTHOR = {Mohammadi, Ali and Hanuma, Vedula Bhaskara and Lamba, Hemank and Raff, Edward and Kumaraguru, Ponnurangam and Ferraro, Francis and Gaur, Manas }, TITLE = {Do LLMs Adhere to Label Definitions? Examining Their Receptivity to External Label Definitions}, BOOKTITLE = {Conference on Empirical Methods in Natural Language Processing}. YEAR = {2025}}

Do LLMs Adhere to Label Definitions? Examining Their Receptivity to External Label Definitions

Abstract

Do LLMs genuinely incorporate external definitions, or do they primarily rely on their parametric knowledge? To address these questions, we conduct controlleD experiments across multiple explanation benchmark datasets (general and domain-specific) and label definition conditions, including expert-curated, LLM- generated, perturbed, and swapped definitions. Our results reveal that while explicit label definitions can enhance accuracy and explainability, their integration into an LLM’s task-solvinG processes is neither guaranteed nor consistent, suggesting reliance on internalized representations in many cases. Models often default to their internal representations, particularly in general tasks, whereas domain-specific tasks benefit more from explicit definitions. These findings underscore the need for a deeper understanding of how LLMs process external knowledge alongside their pre-existing capabilities.

SEMMA: A Semantic Aware Knowledge Graph Foundation Model

Conference on Empirical Methods in Natural Language Processing, EMNLP, 2025

Core Rank : A* Google Rank :193

Abs PDF DOI bibTex

@inproceedings{bib_SEMM_2025, AUTHOR = {A, Arvindh and Kumar, Sumit and Nayyeri, Mojtaba and Xiong, Bo and Kumaraguru, Ponnurangam and Vergari, Antonio and Staab, Steffen }, TITLE = {SEMMA: A Semantic Aware Knowledge Graph Foundation Model}, BOOKTITLE = {Conference on Empirical Methods in Natural Language Processing}. YEAR = {2025}}

SEMMA: A Semantic Aware Knowledge Graph Foundation Model

Abstract

Knowledge Graph Foundation Models (KGFMs) have shown promise in enabling zero-shot reasoning over unseen graphs by learning transferable patterns. However, most existing KGFMs rely solely on graph structure, overlooking the rich semantic signals encoded in textual attributes. We introduce SEMMA, a dual-module KGFM that systematically integrates transferable textual semantics alongside structure. SEMMA leverages Large Language Models (LLMs) to enrich relation identifiers, generating semantic embeddings that subsequently form a textual relation graph, which is fused with the structural component. Across 54 diverse KGs, SEMMA outperforms purely structural baselines like ULTRA in fully inductive link prediction. Crucially, we show that in more challenging generalization settings, where the test-time relation vocabulary is entirely unseen, structural methods collapse while SEMMA is 2x more effective. Our findings demonstrate that textual semantics are critical for generalization in settings where structure alone fails, highlighting the need for foundation models that unify structural and linguistic signals in knowledge reasoning.

Enhancing AI Safety Through the Fusion of Low Rank Adapters

Research and Applications of Foundation Models for Data Mining and Affective Computing Workshop, RAFDA-W, 2025

Abs PDF DOI bibTex

@inproceedings{bib_Enha_2025, AUTHOR = {Swaroop, G Satya and Vipparla, Sreeram and Singh, Harpreet and Goel, Shashwat and Kumaraguru, Ponnurangam }, TITLE = {Enhancing AI Safety Through the Fusion of Low Rank Adapters}, BOOKTITLE = {Research and Applications of Foundation Models for Data Mining and Affective Computing Workshop}. YEAR = {2025}}

Enhancing AI Safety Through the Fusion of Low Rank Adapters

Abstract

Instruction fine-tuning of large language models (LLMs) is a powerful method for improving task-specific performance, but it can inadvertently lead to a phenomenon where models generate harmful responses when faced with malicious prompts. In this paper, we explore Low-Rank Adapter Fusion (LoRA) as a means to mitigate these risks while preserving the model’s ability to handle diverse instructions effectively. Through an extensive comparative analysis against established baselines using recognized benchmark datasets, we demonstrate a 42% reduction in the harmfulness rate by leveraging LoRA fusion between a task adapter and a safety adapter, the latter of which is specifically trained on our safety dataset. In addition, we made noteworthy observations related to exaggerated safety behavior, where the model rejects safe prompts that closely resemble unsafe ones.

TAMAS: A Dataset for Investigating Security Risks in Multi-Agent LLM Systems

International Conference on Machine Learning, ICML-W, 2025

Core Rank : - Google Rank :-

Abs PDF bibTex

@inproceedings{bib_TAMA_2025, AUTHOR = {Kishorkumar, Kavathekar Ishan and Ashok, Jain Hemang and Rathod, Ameya Sandesh and Kumaraguru, Ponnurangam and Ganu, Tanuja }, TITLE = {TAMAS: A Dataset for Investigating Security Risks in Multi-Agent LLM Systems}, BOOKTITLE = {International Conference on Machine Learning}. YEAR = {2025}}

TAMAS: A Dataset for Investigating Security Risks in Multi-Agent LLM Systems

Abstract

Large Language Models (LLMs) have demonstrated strong capabilities as autonomous agents through tool use, planning, and decision-making abilities, leading to their widespread adoption across diverse tasks. As task complexity grows, multi-agent LLM systems are increasingly used to collaboratively solve problems. However, safety and security of these multi-agent systems remains largely unexplored. Existing benchmarks and datasets predominantly focus on single-agent settings, failing to capture the unique vulnerabilities of multi-agent dynamics and co-ordination. To address this gap, we introduce textbf{T}hreats and textbf{A}ttacks in textbf{M}ulti-textbf{A}gent textbf{S}ystems (textbf{TAMAS}), a dataset designed to evaluate the robustness and security of multi-agent LLM systems. TAMAS includes five distinct scenarios comprising 250 adversarial instances across five attack types and 163 different normal and attack tools, along with 100 harmless tasks. We assess system performance across 5 backbone LLMs and 3 agent interaction configurations from Autogen framework, highlighting critical challenges and failure modes in current multi-agent deployments. Our findings show that multi-agent systems are highly vulnerable to adversarial attacks, with Impersonation reaching a 73% success rate and other attacks ranging from 27% to 67%, underscoring the need for stronger defenses.

Just KIDDIN: Knowledge Infusion and Distillation for Detection of INdecent Memes

Association for Computational Linguistics - Findings, ACL-F, 2025

Abs PDF DOI bibTex

@inproceedings{bib_Just_2025, AUTHOR = {Garg, Rahul and Ashok, Jain Hemang and Kumaraguru, Ponnurangam and Kursuncu, Ugur }, TITLE = {Just KIDDIN: Knowledge Infusion and Distillation for Detection of INdecent Memes}, BOOKTITLE = {Association for Computational Linguistics - Findings}. YEAR = {2025}}

Just KIDDIN: Knowledge Infusion and Distillation for Detection of INdecent Memes

Abstract

Toxicity identification in online multimodal environments remains a challenging task due to the complexity of contextual connections across modalities (e.g., textual and visual). In this paper, we propose a novel framework that integrates Knowledge Distillation (KD) from Large Visual Language Models (LVLMs) and knowledge infusion to enhance the performance of toxicity detection in hateful memes. Our approach extracts sub-knowledge graphs from ConceptNet, a large-scale commonsense Knowledge Graph (KG) to be infused within a compact VLM framework. The relational context between toxic phrases in captions and memes, as well as visual concepts in memes enhance the model's reasoning capabilities. Experimental results from our study on two hate speech benchmark datasets demonstrate superior performance over the state-of-the-art baselines across AU-ROC, F1, and Recall with improvements of 1.1%, 7%, and 35%, respectively. Given the contextual complexity of the toxicity detection task, our approach showcases the significance of learning from both explicit (i.e. KG) as well as implicit (i.e. LVLMs) contextual cues incorporated through a hybrid neurosymbolic approach. This is crucial for real-world applications where accurate and scalable recognition of toxic content is critical for creating safer online environments.

From Human Judgements to Predictive Models: Unravelling Acceptability in Code-Mixed Sentences

ACM Trasactions on Asian and Low Resource Language Information Processing, TALLIP, 2025

Core Rank : - Google Rank :31

Abs PDF DOI bibTex

@inproceedings{bib_From_2025, AUTHOR = {Prashant, Kodali and Goel, Anmol and Asapu, Likhith and Bonagiri, Vamshi Krishna and Govil, Anirudh and Choudhury, Monojit and Kumaraguru, Ponnurangam and Shrivastava, Manish }, TITLE = {From Human Judgements to Predictive Models: Unravelling Acceptability in Code-Mixed Sentences}, BOOKTITLE = {ACM Trasactions on Asian and Low Resource Language Information Processing}. YEAR = {2025}}

From Human Judgements to Predictive Models: Unravelling Acceptability in Code-Mixed Sentences

Abstract

Current computational approaches for analysing or generating code-mixed sentences do not explicitly model ``naturalness'' or ``acceptability'' of code-mixed sentences, but rely on training corpora to reflect distribution of acceptable code-mixed sentences. Modelling human judgement for the acceptability of code-mixed text can help in distinguishing natural code-mixed text and enable quality-controlled generation of code-mixed text. To this end, we construct Cline - a dataset containing human acceptability judgements for English-Hindi~(en-hi) code-mixed text. Cline is the largest of its kind with 16,642 sentences, consisting of samples sourced from two sources: synthetically generated code-mixed text and samples collected from online social media. Our analysis establishes that popular code-mixing metrics such as CMI, Number of Switch Points, Burstines, which are used to filter/curate/compare code-mixed corpora have low correlation with human acceptability judgements, underlining the necessity of our dataset. Experiments using Cline demonstrate that simple Multilayer Perceptron (MLP) models when trained solely using code-mixing metrics as features are outperformed by fine-tuned pre-trained Multilingual Large Language Models (MLLMs). Specifically, among Encoder models XLM-Roberta and Bernice outperform IndicBERT across different configurations. Among Encoder-Decoder models, mBART performs better than mT5, however Encoder-Decoder models are not able to outperform Encoder-only models. Decoder-only models perform the best when compared to all other MLLMS, with Llama 3.2 - 3B models outperforming similarly sized Qwen, Phi models. Comparison with zero and fewshot capabilitites of ChatGPT show that MLLMs fine-tuned on larger data outperform ChatGPT, providing scope for improvement in code-mixed tasks. Zero-shot transfer from En-Hi to En-Te acceptability judgments are better than random baselines.

A shot of Cognac to forget bad memories: Corrective Unlearning in GNNs

International Conference on Machine Learning, ICML, 2025

Core Rank : A* Google Rank :171

Abs PDF bibTex

@inproceedings{bib_A_sh_2025, AUTHOR = {Kolipaka, Varshita and Sinha, Akshit and Mishra, Debangan and Kumar, Sumit and A, Arvindh and Goel, Shashwat and Kumaraguru, Ponnurangam }, TITLE = {A shot of Cognac to forget bad memories: Corrective Unlearning in GNNs}, BOOKTITLE = {International Conference on Machine Learning}. YEAR = {2025}}

A shot of Cognac to forget bad memories: Corrective Unlearning in GNNs

Abstract

Graph Neural Networks (GNNs) are increasingly being used for a variety of ML applications on graph data. Because graph data does not follow the independently and identically distributed i.i.d. assumption, adversarial manipulations or incorrect data can propagate to other data points through message passing, which deteriorates the model's performance. To allow model developers to remove the adverse effects of manipulated entities from a trained GNN, we study the recently formulated problem of Corrective Unlearning. We find that current graph unlearning methods fail to unlearn the effect of manipulations even when the whole manipulated set is known. We introduce a new graph unlearning method,Cognac, which can unlearn the effect of the manipulation set even when only % of it is identified. It recovers most of the performance of a strong oracle with fully corrected training data, even beating retraining from scratch without the deletion set, and is 8x more efficient while also scaling to large datasets. We hope our work assists GNN developers in mitigating harmful effects caused by issues in real-world data, post-training.