Conferenza – 4-6/12/2024 – Pisa
Rachele Sprugnoli e Arianna Redaelli presentano due articoli alla decima conferenza di linguistica computazionale (CLiC-it 2024).
1) “Is Sentence Splitting a Solved Task? Experiments to the Intersection Between NLP and Italian Linguistics”. Sentence splitting, that is the segmentation of the raw input text into sentences, is a fundamental step in text processing. Although it is considered a solved task for texts such as news articles and Wikipedia pages, the performance of systems can vary greatly depending on the text genre. This paper presents the evaluation of the performance of eight sentence splitting tools adopting different approaches (rule-based, supervised, semi-supervised, and unsupervised learning) on Italian 19th-century novels, a genre that has not received sufficient attention so far but which can be an interesting common ground between Natural Language Processing and Digital Humanities.
2) “Annotation and Detection of Emotion Polarity in I Promessi Sposi: Dataset and Experiments”. Emotions play a crucial role in literature and are studied by various disciplines, e.g. literary criticism, psychology, anthropology and, more recently, also with computational methods in NLP. However, studies in the Italian context are still limited. This work therefore aims to advance the state of the art in the field of emotion analysis applied to historical texts by proposing a new dataset and describing the results of a set of emotion polarity detection experiments. The text analyzed is “I Promessi Sposi” in its final edition (published in 1840), one of the most important novels in the Italian literary and linguistic canon.