Abstract
In linguistics, discourse is a unit of language longer than a single sentence, referring to spoken or written language in social contexts. Critical Discourse Analysis (CDA) is an interdisciplinary research
area involving linguistics and social sciences which aims to explore the links between language use and social practice. It is a qualitative approach used to describe, interpret, and explain how discourses construct and maintain social interactions.
In this thesis, we introduce the concepts of CDA and apply them to focus mainly on the analysis of Event and Political Discourses. We also explore the concept of Political Bias in news and attempt to have a better understanding of it by identifying, categorising and quantifying political bias in news articles. Throughout the thesis, we detail the process of curating and developing relevant datasets wherever necessary. In the Event Discourse Analysis, we develop a dataset, Manovaad, which consists of events belonging to domains like sports, politics, entertainment, trade, etc., and their news coverage at different levels of reporting. Using a Bidirectional-Recurrent Neural Network (Bi-RNN), we calculate the subjectivity
of each news article and then understand trends in the variation of subjectivity across local, national, and international news reports for a given event. We also analyse how the focus of the news report related to an event shifts as time passes, and compare the trends of focus shift across different levels. This analysis about focus shifts and subjectivity in reporting helps understand how significance associated with an event in terms of features like focus, perspective, inclination, and details is dependent largely
on the closeness of the event with the person or the organisation reporting it. In Political Discourse Analysis, we begin with a comparative case study of the election speeches given by the winning and losing party leaders in the 2014 Andhra Pradesh elections. We compare and contrast their interpersonal speech choices, highlight interesting patterns at both word and sentence levels, and how they affect the audience. The next part of our work on Political Discourse deals with Political Bias in the news. We often observe that in newspapers and news websites, the reporters tend to emphasize more on particular viewpoints selectively and present biased information aligned with their personal political ideology. This can lead to widespread alteration of mass political opinion and impact the decision of the voters. Our work on political bias is two-part. Firstly, we propose a method to enhance the performance of the current best bias detection network by incorporating linguistic information in the form of presupposition, which significantly outperforms the state-of-the-art for elugu.However, Political news is a domain where the majority of the news tends to have some bias. As a result, just detection of bias is not sufficient. For a reader to have a complete understanding of bias in a news article, it is essential to have insights into the ways in which bias is propagated, and an idea about the magnitude of bias. This forms the next part of our work, where we design a categorisation schema for Political Bias and build an annotated public corpus - PoBiCo-21. Along with it, we propose a novel
Sentiment-Based Ranking Mechanism for the quantification of Political bias. The majority of the work in this thesis has been done on Telugu language data. We highlight the challenges of working with a low-resource language and demonstrate how we can combine linguistic and computational techniques such as Machine Learning and Deep Learning to achieve the desired performance.