Abstract
Body shaming, a criticism based on the body’s shape, size, or appearance, has become a dangerous act on social media. With a rise in the reporting of body shaming experiences on the web, automated monitoring of body shaming posts will help rescue individuals, especially adolescents, from the emotional anguish they experience. To the best of our knowledge, this is the first work on body shaming detection, and we contribute the dataset in which the posts are tagged as body shaming or non-body shaming. We use transformer-based language models to detect body shaming posts. Further, we leverage unlabeled data in a semi-supervised manner using the GAN-BERT model, as it was developed for tasks where labeled data is scarce and unlabeled data is abundant. The findings of the experiments reveal that the algorithm learns valuable knowledge from the unlabeled dataset and outperforms many deep learning and conventional machine learning baselines.