TY - GEN
T1 - Ice and Fire
T2 - 4th Workshop on Threat, Aggression and Cyberbullying, TRAC 2024
AU - Friðriksdóttir, Steinunn Rut
AU - Simonsen, Annika
AU - Ásmundsson, Atli Snær
AU - Friðjónsdóttir, Guðrún Lilja
AU - Ingason, Anton Karl
AU - Snæbjarnarson, Vésteinn
AU - Einarsson, Hafsteinn
N1 - Publisher Copyright: © 2024 ELRA Language Resource Association.
PY - 2024
Y1 - 2024
N2 - This study introduces "Ice and Fire," a Multi-Task Learning (MTL) dataset tailored for sentiment analysis in the Icelandic language. It encompasses a wide range of linguistic tasks, including sentiment and emotion detection, as well as the identification of toxicity, hate speech, encouragement, sympathy, sarcasm/irony, and trolling. With 261 fully annotated blog comments and 1,045 comments annotated in at least one task, this contribution marks a significant step forward in the field of Icelandic natural language processing. The dataset provides a comprehensive resource for understanding the nuances of online communication in Icelandic and an interface to expand the annotation effort. Despite the challenges inherent in subjective interpretation of text, our findings highlight the positive potential of this dataset to improve text analysis techniques and encourage more inclusive online discourse in Icelandic communities. With promising baseline performances, "Ice and Fire" sets the stage for future research to enhance automated text analysis and develop sophisticated language technologies, contributing to healthier online environments and advancing Icelandic language resources.
AB - This study introduces "Ice and Fire," a Multi-Task Learning (MTL) dataset tailored for sentiment analysis in the Icelandic language. It encompasses a wide range of linguistic tasks, including sentiment and emotion detection, as well as the identification of toxicity, hate speech, encouragement, sympathy, sarcasm/irony, and trolling. With 261 fully annotated blog comments and 1,045 comments annotated in at least one task, this contribution marks a significant step forward in the field of Icelandic natural language processing. The dataset provides a comprehensive resource for understanding the nuances of online communication in Icelandic and an interface to expand the annotation effort. Despite the challenges inherent in subjective interpretation of text, our findings highlight the positive potential of this dataset to improve text analysis techniques and encourage more inclusive online discourse in Icelandic communities. With promising baseline performances, "Ice and Fire" sets the stage for future research to enhance automated text analysis and develop sophisticated language technologies, contributing to healthier online environments and advancing Icelandic language resources.
KW - Icelandic Language Resources
KW - Multi-Task Learning
KW - Sentiment Analysis
UR - https://www.scopus.com/pages/publications/85195197382
M3 - Conference contribution
T3 - TRAC 2024: 4th Workshop on Threat, Aggression and Cyberbullying at LREC-COLING 2024 - Workshop Proceedings
SP - 73
EP - 84
BT - TRAC 2024
A2 - Kumar, Ritesh
A2 - Ojha, Atul Kr.
A2 - Malmasi, Shervin
A2 - Chakravarthi, Bharathi Raja
A2 - Lahiri, Bornini
A2 - Singh, Siddharth
A2 - Ratan, Shyam
PB - European Language Resources Association (ELRA)
Y2 - 20 May 2024
ER -