UPSI Digital Repository (UDRep)
|
|
|
Abstract : Universiti Pendidikan Sultan Idris |
To tackle the conundrum of detecting offensive comments/posts which are considerably informal, unstructured, miswritten and code-mixed, we introduce two inventive methods in this research paper. Offensive comments/posts on the social media platforms, can affect an individual, a group or underage alike. In order to classify comments/posts in two popular Dravidian languages, Tamil and Malayalam, as a part of the HASOC - DravidianCodeMix FIRE 2021 shared task, we employ two Transformer-based prototypes which successfully stood in the top 8 for all the tasks. The codes for our approach can be viewed and utilized1 ? 2021 Copyright for this paper by its authors. |
References |
Chakravarthi, B. R., Kumaresan, P. K., Sakuntharaj, R., Madasamy, A. K., Thavareesan, S., Premjith, B., . . . Mandl, T. (2021). Overview of the HASOC-DravidianCodeMix shared task on offensive language detection in tamil and malayalam. Paper presented at the CEUR Workshop Proceedings, , 3159 589-602. Retrieved from www.scopus.com Chakravarthi, B. R., Muralidaran, V., Priyadharshini, R., & McCrae, J. P. (2020). Corpus creation for sentiment analysis in code-mixed tamil-english text. Proceedings of the 1st Joint Workshop on Spoken Language Technologies for Under-Resourced Languages (SLTU) and Collaboration and Computing for Under-Resourced Languages (CCURL), European Language Resources Association, , 202-210. Retrieved from www.scopus.com Chakravarthi, B. R. (2020). HopeEDI: A multilingual hope speech detection dataset for equality, diversity, and inclusion. Proceedings of the Third Workshop on Computational Modeling of People’s Opinions, Personality, and Emotion’s in Social Media, , 41-53. Retrieved from www.scopus.com Chakravarthi, B. R., & Muralidaran, V. (2021). Findings of the shared task on hope speech detection for equality, diversity, and inclusion. Proceedings of the First Workshop on Language Technology for Equality, Diversity and Inclusion, , 61-72. Retrieved from www.scopus.com Conneau, A., Khandelwal, K., Goyal, N., Chaudhary, V., Wenzek, G., Guzmán, F., . . . Stoyanov, V. (2019). Unsupervised Cross-Lingual Representation Learning at Scale, Retrieved from www.scopus.com Davidson, T., Warmsley, D., Macy, M., & Weber, I. (2017). Automated hate speech detection and the problem of offensive language. Automated Hate Speech Detection and the Problem of Offensive Language, Retrieved from www.scopus.com Devlin, J., Chang, M. -., Lee, K., & Toutanova, K. (2018). BERT: Pre-training of deep bidirectional transformers for language understanding. BERT: Pre-Training of Deep Bidirectional Transformers for Language Understanding, Retrieved from www.scopus.com Hande, A., Priyadharshini, R., Sampath, A., Thamburaj, K. P., Chandran, P., & Chakravarthi, B. R. (2021). Hope Speech Detection in Under-Resourced Kannada Language, Retrieved from www.scopus.com Hande, A., Puranik, K., Priyadharshini, R., & Chakravarthi, B. R. (2021). Domain identification of scientific articles using transfer learning and ensembles doi:10.1007/978-3-030-75015-2_9 Retrieved from www.scopus.com Hande, A., Puranik, K., Priyadharshini, R., Thavareesan, S., & Chakravarthi, B. R. (2021). Evaluating pretrained transformer-based models for COVID-19 fake news detection. Paper presented at the Proceedings - 5th International Conference on Computing Methodologies and Communication, ICCMC 2021, 766-772. doi:10.1109/ICCMC51019.2021.9418446 Retrieved from www.scopus.com Hande, A., Priyadharshini, R., & Chakravarthi, B. R. (2020). KanCMD: Kannada CodeMixed dataset for sentiment analysis and offensive language detection. Proceedings of the Third Workshop on Computational Modeling of People’s Opinions, Personality, and Emotion’s in Social Media, , 54-63. Retrieved from www.scopus.com Hassan, S., Samih, Y., Mubarak, H., Abdelali, A., Rashed, A., & Chowdhury, S. A. (2020). ALT submission for OSACT shared task on offensive language detection. Proceedings of the 4Th Workshop on Open-Source Arabic Corpora and Processing Tools, with a Shared Task on Offensive Language Detection, , 61-65. Retrieved from www.scopus.com Hearst, M. A. (1998). Support vector machines. IEEE Intelligent Systems, 13(4), 18-28. Retrieved from www.scopus.com Hochreiter, S., & Schmidhuber, J. (1997). Long short-term memory. Neural Computation, 9(8), 1735-1780. doi:10.1162/neco.1997.9.8.1735 Jada, P. K., Reddy, D. S., Hande, A., Priyadharshini, R., Sakuntharaj, R., & Chakravarthi, B. R. (2021). IIITT at CASE 2021 task 1: Leveraging pretrained language models for multilingual protest detection. Paper presented at the 4th Workshop on Challenges and Applications of Automated Extraction of Socio-Political Events from Text, CASE 2021 - Proceedings, 98-104. Retrieved from www.scopus.com Liu, P., Li, W., & Zou, L. (2019). NULI at SemEval-2019 task 6: Transfer learning for offensive language detection using bidirectional transformers. Paper presented at the NAACL HLT 2019 - International Workshop on Semantic Evaluation, SemEval 2019, Proceedings of the 13th Workshop, 87-91. Retrieved from www.scopus.com Liu, Y., Ott, M., Goyal, N., Du, J., Joshi, M., Chen, D., . . . Stoyanov, V. (2019). RoBERTa: A robustly optimized BERT pretraining approach. Roberta: A Robustly Optimized Bert Pretraining Approach, Retrieved from www.scopus.com Puranik, K., Hande, A., Priyadharshini, R., Durairaj, T., Sampath, A., Thamburaj, K. P., & Chakravarthi, B. R. (2021). Attentive fine-tuning of transformers for translation of low-resourced languages @LoResMT 2021. Paper presented at the Proceedings of the 4th Workshop on Technologies for Machine Translation of Low-Resource Languages, LoResMT 2021, 134-143. Retrieved from www.scopus.com Puranik, K., Hande, A., Priyadharshini, R., Thavareesan, S., & Chakravarthi, B. R. (2021). Iiitt@ Lt-Edi-eacl2021-Hope Speech Detection: There is always Hope in Transformers, Retrieved from www.scopus.com Regmi, K., Naidoo, J., & Pilkington, P. (2010). Understanding the processes of translation and transliteration in qualitative research. International Journal of Qualitative Methods, 9(1), 16-26. Retrieved from www.scopus.com Sanh, V., Debut, L., Chaumond, J., & Wolf, T. (2019). Distil-bert, a distilled version of bert: Smaller, faster, cheaper and lighter. DistilBERT, A Distilled Version of BERT: Smaller, Faster, Cheaper and Lighter, Retrieved from www.scopus.com |
This material may be protected under Copyright Act which governs the making of photocopies or reproductions of copyrighted materials. You may use the digitized material for private study, scholarship, or research. |