UPSI Digital Repository (UDRep)
Start | FAQ | About
Menu Icon

QR Code Link :

Type :article
Subject :L Education
ISSN :16130073
Main Author :Jada, Pawan Kalyan
Additional Authors :Kingston Palthamburaj
Title :Analyzing social media content for detection of offensive text
Place of Production :Tanjung Malim
Publisher :Fakulti Bahasa Dan Komunikasi
Year of Publication :2021
Notes :CEUR Workshop Proceedings
Corporate Name :Universiti Pendidikan Sultan Idris
HTTP Link :Click to view web link

Abstract : Universiti Pendidikan Sultan Idris
To tackle the conundrum of detecting offensive comments/posts which are considerably informal, unstructured, miswritten and code-mixed, we introduce two inventive methods in this research paper. Offensive comments/posts on the social media platforms, can affect an individual, a group or underage alike. In order to classify comments/posts in two popular Dravidian languages, Tamil and Malayalam, as a part of the HASOC - DravidianCodeMix FIRE 2021 shared task, we employ two Transformer-based prototypes which successfully stood in the top 8 for all the tasks. The codes for our approach can be viewed and utilized1 ? 2021 Copyright for this paper by its authors.

References

Chakravarthi, B. R., Kumaresan, P. K., Sakuntharaj, R., Madasamy, A. K., Thavareesan, S., Premjith, B., . . . Mandl, T. (2021). Overview of the HASOC-DravidianCodeMix shared task on offensive language detection in tamil and malayalam. Paper presented at the CEUR Workshop Proceedings, , 3159 589-602. Retrieved from www.scopus.com

Chakravarthi, B. R., Muralidaran, V., Priyadharshini, R., & McCrae, J. P. (2020). Corpus creation for sentiment analysis in code-mixed tamil-english text. Proceedings of the 1st Joint Workshop on Spoken Language Technologies for Under-Resourced Languages (SLTU) and Collaboration and Computing for Under-Resourced Languages (CCURL), European Language Resources Association, , 202-210. Retrieved from www.scopus.com

Chakravarthi, B. R. (2020). HopeEDI: A multilingual hope speech detection dataset for equality, diversity, and inclusion. Proceedings of the Third Workshop on Computational Modeling of People’s Opinions, Personality, and Emotion’s in Social Media, , 41-53. Retrieved from www.scopus.com

Chakravarthi, B. R., & Muralidaran, V. (2021). Findings of the shared task on hope speech detection for equality, diversity, and inclusion. Proceedings of the First Workshop on Language Technology for Equality, Diversity and Inclusion, , 61-72. Retrieved from www.scopus.com

Conneau, A., Khandelwal, K., Goyal, N., Chaudhary, V., Wenzek, G., Guzmán, F., . . . Stoyanov, V. (2019). Unsupervised Cross-Lingual Representation Learning at Scale, Retrieved from www.scopus.com

Davidson, T., Warmsley, D., Macy, M., & Weber, I. (2017). Automated hate speech detection and the problem of offensive language. Automated Hate Speech Detection and the Problem of Offensive Language, Retrieved from www.scopus.com

Devlin, J., Chang, M. -., Lee, K., & Toutanova, K. (2018). BERT: Pre-training of deep bidirectional transformers for language understanding. BERT: Pre-Training of Deep Bidirectional Transformers for Language Understanding, Retrieved from www.scopus.com

Hande, A., Priyadharshini, R., Sampath, A., Thamburaj, K. P., Chandran, P., & Chakravarthi, B. R. (2021). Hope Speech Detection in Under-Resourced Kannada Language, Retrieved from www.scopus.com

Hande, A., Puranik, K., Priyadharshini, R., & Chakravarthi, B. R. (2021). Domain identification of scientific articles using transfer learning and ensembles doi:10.1007/978-3-030-75015-2_9 Retrieved from www.scopus.com

Hande, A., Puranik, K., Priyadharshini, R., Thavareesan, S., & Chakravarthi, B. R. (2021). Evaluating pretrained transformer-based models for COVID-19 fake news detection. Paper presented at the Proceedings - 5th International Conference on Computing Methodologies and Communication, ICCMC 2021, 766-772. doi:10.1109/ICCMC51019.2021.9418446 Retrieved from www.scopus.com

Hande, A., Priyadharshini, R., & Chakravarthi, B. R. (2020). KanCMD: Kannada CodeMixed dataset for sentiment analysis and offensive language detection. Proceedings of the Third Workshop on Computational Modeling of People’s Opinions, Personality, and Emotion’s in Social Media, , 54-63. Retrieved from www.scopus.com

Hassan, S., Samih, Y., Mubarak, H., Abdelali, A., Rashed, A., & Chowdhury, S. A. (2020). ALT submission for OSACT shared task on offensive language detection. Proceedings of the 4Th Workshop on Open-Source Arabic Corpora and Processing Tools, with a Shared Task on Offensive Language Detection, , 61-65. Retrieved from www.scopus.com

Hearst, M. A. (1998). Support vector machines. IEEE Intelligent Systems, 13(4), 18-28. Retrieved from www.scopus.com

Hochreiter, S., & Schmidhuber, J. (1997). Long short-term memory. Neural Computation, 9(8), 1735-1780. doi:10.1162/neco.1997.9.8.1735

Jada, P. K., Reddy, D. S., Hande, A., Priyadharshini, R., Sakuntharaj, R., & Chakravarthi, B. R. (2021). IIITT at CASE 2021 task 1: Leveraging pretrained language models for multilingual protest detection. Paper presented at the 4th Workshop on Challenges and Applications of Automated Extraction of Socio-Political Events from Text, CASE 2021 - Proceedings, 98-104. Retrieved from www.scopus.com

Liu, P., Li, W., & Zou, L. (2019). NULI at SemEval-2019 task 6: Transfer learning for offensive language detection using bidirectional transformers. Paper presented at the NAACL HLT 2019 - International Workshop on Semantic Evaluation, SemEval 2019, Proceedings of the 13th Workshop, 87-91. Retrieved from www.scopus.com

Liu, Y., Ott, M., Goyal, N., Du, J., Joshi, M., Chen, D., . . . Stoyanov, V. (2019). RoBERTa: A robustly optimized BERT pretraining approach. Roberta: A Robustly Optimized Bert Pretraining Approach, Retrieved from www.scopus.com

Puranik, K., Hande, A., Priyadharshini, R., Durairaj, T., Sampath, A., Thamburaj, K. P., & Chakravarthi, B. R. (2021). Attentive fine-tuning of transformers for translation of low-resourced languages @LoResMT 2021. Paper presented at the Proceedings of the 4th Workshop on Technologies for Machine Translation of Low-Resource Languages, LoResMT 2021, 134-143. Retrieved from www.scopus.com

Puranik, K., Hande, A., Priyadharshini, R., Thavareesan, S., & Chakravarthi, B. R. (2021). Iiitt@ Lt-Edi-eacl2021-Hope Speech Detection: There is always Hope in Transformers, Retrieved from www.scopus.com

Regmi, K., Naidoo, J., & Pilkington, P. (2010). Understanding the processes of translation and transliteration in qualitative research. International Journal of Qualitative Methods, 9(1), 16-26. Retrieved from www.scopus.com

Sanh, V., Debut, L., Chaumond, J., & Wolf, T. (2019). Distil-bert, a distilled version of bert: Smaller, faster, cheaper and lighter. DistilBERT, A Distilled Version of BERT: Smaller, Faster, Cheaper and Lighter, Retrieved from www.scopus.com


This material may be protected under Copyright Act which governs the making of photocopies or reproductions of copyrighted materials.
You may use the digitized material for private study, scholarship, or research.

Back to previous page

Installed and configured by Bahagian Automasi, Perpustakaan Tuanku Bainun, Universiti Pendidikan Sultan Idris
If you have enquiries, kindly contact us at pustakasys@upsi.edu.my or 016-3630263. Office hours only.