IIITH

Long Tailed Entity Extraction of Model Names using Distant Supervision

European Conference on Information Retreival Workshops, ECIR-W, 2022

Core Rank : - Google Rank :-

Abs PDF DOI bibTex

@inproceedings{bib_Long_2022, AUTHOR = {Daw, Swayatta and Pudi, Vikram }, TITLE = {Long Tailed Entity Extraction of Model Names using Distant Supervision}, BOOKTITLE = {European Conference on Information Retreival Workshops}. YEAR = {2022}}

Long Tailed Entity Extraction of Model Names using Distant Supervision

Abstract

No Results Found

Extraction of Competing Models using Distant Supervision and Graph Ranking

Association for the Advancement of Artificial Intelligence Workshop, AAAI-W, 2022

Core Rank : - Google Rank :-

Abs PDF DOI bibTex

@inproceedings{bib_Extr_2022, AUTHOR = {Daw, Swayatta and Pudi, Vikram }, TITLE = {Extraction of Competing Models using Distant Supervision and Graph Ranking}, BOOKTITLE = {Association for the Advancement of Artificial Intelligence Workshop}. YEAR = {2022}}

Extraction of Competing Models using Distant Supervision and Graph Ranking

Abstract

We introduce the task of detection of competing model entities from scientific documents. We define competing models as those models that solve a particular task that is investigated in the target research document. The task is challenging due to the fact that contextual information is required from the entire target document to predict the model entities. Hence, traditional sequence labelling approaches fail in such settings. Furthermore, model entities themselves are long-tailed in nature, i.e, their prevalence in scientific literature is limited, along with a scarcity of labelled data for training supervised learning techniques. To address the above bottlenecks, we combine an Unsupervised Graph Ranking algorithm with a SciBERT-CRF based sequence labeller to predict the entities. We introduce a strong baseline using the above mentioned pipeline. Also, to address the label scarcity of long-tailed model entities, we use distant supervision leveraging an external Knowledge Base (KB) to generate synthetic training data. We address the problem of overfitting in small sized datasets for supervised NER baselines using a simple entity replacement technique. We introduce this model as part of a starting point for an end-to-end automated framework to extract relevant model names and link them with their respective cited papers from research documents. We believe this task will serve as an important starting point to map the research landscape of computer science in a scalable manner, needing minimal human intervention. The code and dataset is available in the given link : https://github.com/Swayatta/Competing-Models

Practice Makes a Solver Perfect: Data Augmentation for Math Word Problem Solvers

Conference of the North American Chapter of the Association for Computational Linguistics, NAACL, 2022

Core Rank : A Google Rank :132

Abs PDF bibTex

@inproceedings{bib_Prac_2022, AUTHOR = {Kumar, Vivek and Pudi, Vikram and Maheshwary, Rishabh }, TITLE = {Practice Makes a Solver Perfect: Data Augmentation for Math Word Problem Solvers}, BOOKTITLE = {Conference of the North American Chapter of the Association for Computational Linguistics}. YEAR = {2022}}

Practice Makes a Solver Perfect: Data Augmentation for Math Word Problem Solvers

Abstract

Existing Math Word Problem (MWP) solvers have achieved high accuracy on benchmark datasets. However, prior works have shown that such solvers do not generalize well and rely on superficial cues to achieve high performance. In this paper, we first conduct experiments to showcase that this behaviour is mainly associated with the limited size and diversity present in existing MWP datasets. Next, we propose several data augmentation techniques broadly categorized into Substitution and Paraphrasing based methods. By deploying these methods we increase the size of existing datasets by five folds. Extensive experiments on two benchmark datasets across three state-of-the-art MWP solvers show that proposed methods increase the generalization and robustness of existing solvers. On average, proposed methods significantly increase the state-of-the-art results by over five percentage points on benchmark datasets. Further, the solvers trained on the augmented dataset perform comparatively better on the challenge test set. We also show the effectiveness of proposed techniques through ablation studies and verify the quality of augmented samples through human evaluation.

Multilinguals at SemEval-2022 Task 11: Transformer Based Architecture for Complex NER

Technical Report, arXiv, 2022

Core Rank : - Google Rank :-

Abs PDF bibTex

@inproceedings{bib_Mult_2022, AUTHOR = {Pandey, Amit and Daw, Swayatta and Pudi, Vikram }, TITLE = {Multilinguals at SemEval-2022 Task 11: Transformer Based Architecture for Complex NER}, BOOKTITLE = {Technical Report}. YEAR = {2022}}

Multilinguals at SemEval-2022 Task 11: Transformer Based Architecture for Complex NER

Abstract

We investigate the task of complex NER for the English language. The task is non-trivial due to the semantic ambiguity of the textual structure and the rarity of occurrence of such entities in the prevalent literature. Using pretrained language models such as BERT, we obtain a competitive performance on this task. We qualitatively analyze the performance of multiple architectures for this task. All our models are able to outperform the baseline by a significant margin. Our best performing model beats the baseline F1-score by over 9%.

Cross-lingual Alignment of Knowledge Graph Triples with Sentences

International Conference on Natural Language Processing., ICON, 2021

Core Rank : - Google Rank :5

Abs PDF DOI bibTex

@inproceedings{bib_Cros_2021, AUTHOR = {Daw, Swayatta and Rajendra, Sagare Shivprasad and Abhishek, Tushar and Pudi, Vikram and Kalidindi, Vasudeva Varma }, TITLE = {Cross-lingual Alignment of Knowledge Graph Triples with Sentences}, BOOKTITLE = {International Conference on Natural Language Processing.}. YEAR = {2021}}

Cross-lingual Alignment of Knowledge Graph Triples with Sentences

Abstract

The pairing of natural language sentences with knowledge graph triples is essential for many downstream tasks like data-to-text generation, facts extraction from sentences (semantic parsing), knowledge graph completion, etc. Most existing methods solve these downstream tasks using neural-based end-to-end approaches that require a large amount of well-aligned training data, which is difficult and expensive to acquire. Recently various unsupervised techniques have been proposed to alleviate this alignment step by automatically pairing the structured data (knowledge graph triples) with textual data. However, these approaches are not well suited for low resource languages that provide two major challenges: (1) unavailability of pair of triples and native text with the same content distribution and (2) limited Natural language Processing (NLP) resources. In this paper, we address the unsupervised pairing of knowledge graph triples with sentences for low resource languages, selecting Hindi as the low resource language. We propose cross-lingual pairing of English triples with Hindi sentences to mitigate the unavailability of content overlap. We propose two novel approaches: NER-based filtering with Semantic Similarity and Key-phrase Extraction with Relevance Ranking. We use our best method to create a collection of 29224 well-aligned English triples and Hindi sentence pairs. Additionally, we have also curated 350 human-annotated golden test datasets for evaluation. We make the code and dataset publicly available.

Adversarial Examples for Evaluating Math Word Problem Solvers

Conference on Empirical Methods in Natural Language Processing, EMNLP, 2021

Core Rank : A* Google Rank :193

Abs PDF bibTex

@inproceedings{bib_Adve_2021, AUTHOR = {Kumar, Vivek and Maheshwary, Rishabh and Pudi, Vikram }, TITLE = {Adversarial Examples for Evaluating Math Word Problem Solvers}, BOOKTITLE = {Conference on Empirical Methods in Natural Language Processing}. YEAR = {2021}}

Adversarial Examples for Evaluating Math Word Problem Solvers

Abstract

Standard accuracy metrics have shown that Math Word Problem (MWP) solvers have achieved high performance on benchmark datasets. However, the extent to which existing MWP solvers truly understand language and its relation with numbers is still unclear. In this paper, we generate adversarial attacks to evaluate the robustness of state-of-the-art MWP solvers. We propose two methods Question Reordering and Sentence Paraphrasing to generate adversarial attacks. We conduct experiments across three neural MWP solvers over two benchmark datasets. On average, our attack method is able to reduce the accuracy of MWP solvers by over 40 percentage points on these datasets. Our results demonstrate that existing MWP solvers are sensitive to linguistic variations in the problem text. We verify the validity and quality of generated adversarial examples through human evaluation.

A Strong Baseline for Query Efficient Attacks in a Black Box Setting

Conference on Empirical Methods in Natural Language Processing, EMNLP, 2021

Core Rank : A* Google Rank :193

Abs PDF bibTex

@inproceedings{bib_A_St_2021, AUTHOR = {Maheshwary, Rishabh and MAHESHWARY, SAKET and Pudi, Vikram }, TITLE = {A Strong Baseline for Query Efficient Attacks in a Black Box Setting}, BOOKTITLE = {Conference on Empirical Methods in Natural Language Processing}. YEAR = {2021}}

A Strong Baseline for Query Efficient Attacks in a Black Box Setting

Abstract

Existing black box search methods have achieved high success rate in generating adversarial attacks against NLP models. However, such search methods are inefficient as they do not consider the amount of queries required to generate adversarial attacks. Also, prior attacks do not maintain a consistent search space while comparing different search methods. In this paper, we propose a query efficient attack strategy to generate plausible adversarial examples on text classification and entailment tasks. Our attack jointly leverages attention mechanism and locality sensitive hashing (LSH) to reduce the query count. We demonstrate the efficacy of our approach by comparing our attack with four baselines across three different search spaces. Further, we benchmark our results across the same search space used in prior attacks. In comparison to attacks proposed, on an average, we are able to reduce the query count by 75% across all datasets and target models. We also demonstrate that our attack achieves a higher success rate when compared to prior attacks in a limited query setting.

Generating natural language attacks in a hard label black box setting

American Association for Artificial Intelligence, AAAI, 2021

Core Rank : A* Google Rank :95

Abs PDF bibTex

@inproceedings{bib_Gene_2021, AUTHOR = {Maheshwary, Rishabh and MAHESHWARY, SAKET and Pudi, Vikram }, TITLE = {Generating natural language attacks in a hard label black box setting}, BOOKTITLE = {American Association for Artificial Intelligence}. YEAR = {2021}}

Generating natural language attacks in a hard label black box setting

Abstract

We study an important and challenging task of attacking natural language processing models in a hard label black box setting. We propose a decision-based attack strategy that crafts high quality adversarial examples on text classification and entailment tasks. Our proposed attack strategy leverages population-based optimization algorithm to craft plausible and semantically similar adversarial examples by observing only the top label predicted by the target model. At each iteration, the optimization procedure allow word replacements that maximizes the overall semantic similarity between the original and the adversarial text. Further, our approach does not rely on using substitute models or any kind of training data. We demonstrate the efficacy of our proposed approach through extensive experimentation and ablation studies on five state-of-the-art target models across seven benchmark datasets. In comparison to attacks proposed in prior literature, we are able to achieve a higher success rate with lower word perturbation percentage that too in a highly restricted setting.

Temporal Analysis of Scientific Literature to Find Grand Challenges and Saturated Problems

Bridging the Gap between Information Science, Information Retrieval and Data Science, BIRDS, 2020

Core Rank : - Google Rank :-

Abs PDF bibTex

@inproceedings{bib_Temp_2020, AUTHOR = {Agrawal, Kritika and Pudi, Vikram }, TITLE = {Temporal Analysis of Scientific Literature to Find Grand Challenges and Saturated Problems}, BOOKTITLE = {Bridging the Gap between Information Science, Information Retrieval and Data Science}. YEAR = {2020}}

Temporal Analysis of Scientific Literature to Find Grand Challenges and Saturated Problems

Abstract

As scientific communities grow and evolve, there is emergence of new techniques and decline of old ones. The tremendous amount of research publications available online aims to solve a lot of interesting problems. With time, some of the fields have been studied well and research problems solved to a great extent. However, there are few difficult research problems which are yet not solved completely and interests a lot of researchers. In this paper, we aim to find research fields which are saturated and research fields which need to be explored yet. We first extract research problems in a semi supervised manner using a proven bootstrap framework from scientific literature of the last fifty years. We show how a simple statistics based model on top of the research problems extracted can find the saturated fields and grand challenges in any domain of computer science.

Generating Natural Language Attacks in a Hard Label Black Box Setting

Technical Report, arXiv, 2020

Core Rank : - Google Rank :-

Abs PDF bibTex

@inproceedings{bib_Gene_2020, AUTHOR = {Maheshwary, Rishabh and MAHESHWARY, SAKET and Pudi, Vikram }, TITLE = {Generating Natural Language Attacks in a Hard Label Black Box Setting}, BOOKTITLE = {Technical Report}. YEAR = {2020}}