Rhetorical discourse signals: revisiting RST annotation in the CSTNews corpus
Abstract
Rhetorical Structure Theory (RST) is a discourse theory in which the coherence of a text can be characterized by a tree structure, where the discourse units are the leaves and the nodes represent the rhetorical relations between them. Although it is known that the identification of connectives that indicate these relations plays an important role in text processing, the absence of a prototypical discourse marker does not eliminate the possibility of their interpretation. In this paper, we describe the analysis of a sample from a corpus already annotated with RST, aiming to identify how these relations are signaled in the discourse. The results highlight the importance of investigating other flags in addition to DMs.
References
Cardoso, P.C.F.; Maziero, E.G.; Jorge, M.L.C.; Seno, E.M.R.; Di Felippo, A.; Rino, L.H.M.; Nunes, M.G.V.; Pardo, T.A.S. (2011) CSTNews - A Discourse-Annotated Corpus for Single and Multi-Document Summarization of News Texts in Brazilian Portuguese. In: Proceedings of the 3rd RST Brazilian Meeting, pp. 88-105. Cuiabá/MT, Brasil.
Das, D. e Taboada, M. (2018) RST Signalling Corpus: A corpus of signals of coherence relations. Language Resources and Evaluation, Vol 52, N. 1, pp. 149-184. [link].
Hirata-Vale, F. B. M. e Oliveira, T. P. (2014) Modelos e Métodos de Análise Funcionalista. In: GONÇALVES, A. V.; GÓIS, M. L. S. (Org.). Ciências da Linguagem: O Fazer Científico - Volume 2. Campinas: Mercado de Letras.
Mann, W. C. e Thompson, S. A. (1988) Rhetorical structure theory: Toward a functional theory of text organization. Text-interdisciplinary Journal for the Study of Discourse, Vol. 8, N.3, pp. 243–281.
Pardo, T. A. S. (2015) Métodos para análise discursiva automática. Tese (Doutorado em Ciências da Computação e Matemática Computacional). São Carlos: Universidade de São Paulo, 211p. [link].
Taboada, M. e Mann, W. C. (2006) Rhetorical Structure Theory: Looking back and moving ahead. Discourse Studies, Vol. 8, N. 3, pp. 423-459. [link].
Taboada, M. e Das, D.. (2013) Annotation upon annotation: Adding signalling information to a corpus of discourse relations. Dialogue & Discourse, V. 4, N. 2, pp. 249-281. https://doi.org/10.5087/dad.2013.211
