Analysis of Negation Annotation in Corpora Under the Universal Dependencies Guidelines
Abstract
This paper analyzes negation annotation in Portuguese based on the Universal Dependencies (UD) guidelines. By reviewing negative operators and exclusion prepositions in the UD Portuguese Bosque, Porttinari-base corpora, and the PortiLexicon-UD tool, we identify inconsistencies in morphological features and syntactic relations. We propose adjustments to improve annotation accuracy and consistency, supporting NLP tasks such as sentiment analysis, information extraction, and machine translation.References
Chapman, W. W., Bridewell, W., Hanbury, P., Cooper, G. F., and Buchanan, B. G. (2001). A simple algorithm for identifying negated findings and diseases in discharge summaries. Journal of Biomedical Informatics, 34(5):301–310.
Cyrino, S. M. L. (2024). More on negation in brazilian portuguese. Estudos linguísticos e literários.
de Moura Neves, M. H. (2000). Gramática de usos do português. Unesp.
Duran, M., Lopes, L., Nunes, M. d. G., and Pardo, T. (2023). The dawn of the porttinari multigenre treebank: Introducing its journalistic portion. In Anais do XIV Simpósio Brasileiro de Tecnologia da Informação e da Linguagem Humana (STIL 2023), pages 115–124, Porto Alegre, RS, Brasil. Sociedade Brasileira de Computação.
Goldin, I. and Chapman, W. W. (2003). Learning to detect negation with ‘not’ in medical texts. In Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2003) Workshop on Text Analysis and Search for Bioinformatics.
Jiménez-Zafra, S. M., Morante, R., Martín-Valdivia, M. T., and Ureña-López, L. A. (2020). Corpora annotated with negation: An overview. Computational Linguistics, 46(1):1–52.
Li, Y., Thomas, M. A., and Liu, D. (2021). From semantics to pragmatics: where is can lead in natural language processing (nlp) research. European Journal of Information Systems, 30(5):569–590.
Lopes, L., Duran, M. S., Fernandes, P., and Pardo, T. A. S. (2022a). Portilexicon-ud: a portuguese lexical resource according to universal dependencies model. In Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022), Marseille, France.
Lopes, L., Duran, M. S., Nunes, M. d. G. V., and Pardo, T. A. S. (2022b). Corpora building process according to the universal dependencies model: an experiment for portuguese. [link].
Mioto, C. (1992). Negação sentencial no português brasileiro e teoria da gramática. Tese de doutorado em linguística, Universidade Estadual de Campinas (UNICAMP), Campinas, SP.
Mioto, C. (1998). Tipos de negação. Cadernos de estudos linguisticos, 34.
Moia, T. (2024). A distribuição dos adjuntos temporais negativos no português contemporâneo: negação, concordância negativa e construções de grau. Diacrítica, 38(1):226–253.
Mutalik, P. G., Deshpande, A., and Nadkarni, P. M. (2001). Use of general-purpose negation detection to augment concept indexing of medical documents: a quantitative study using the umls. Journal of the American Medical Informatics Association, 8(6):598–609.
Rademaker, A., Chalub, F., Real, L., Freitas, C., Bick, E., and Paiva, V. (2017). Universal dependencies for portuguese. In Proceedings of the Fourth International Conference on Dependency Linguistics (Depling 2017), pages 197–206, Pisa, Italy.
Santana, M. (2019). News of the brazilian newspaper.
Schwenter, S. A. (2016). Some issues in negation in portuguese. In The Handbook of Portuguese Linguistics, pages 425–440. Wiley-Blackwell.
Straka, M., Straková, J., and Gamba, F. (2024). ÚFAL LatinPipe at EvaLatin 2024: Morphosyntactic analysis of Latin. In Proceedings of the Third Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA) @ LREC-COLING2024, pages 207–214, Torino, Itália. ELRA and ICCL.
Universal Dependencies (2021). Universal dependencies. [link]. Acesso em: 03 junho 2025.
Cyrino, S. M. L. (2024). More on negation in brazilian portuguese. Estudos linguísticos e literários.
de Moura Neves, M. H. (2000). Gramática de usos do português. Unesp.
Duran, M., Lopes, L., Nunes, M. d. G., and Pardo, T. (2023). The dawn of the porttinari multigenre treebank: Introducing its journalistic portion. In Anais do XIV Simpósio Brasileiro de Tecnologia da Informação e da Linguagem Humana (STIL 2023), pages 115–124, Porto Alegre, RS, Brasil. Sociedade Brasileira de Computação.
Goldin, I. and Chapman, W. W. (2003). Learning to detect negation with ‘not’ in medical texts. In Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2003) Workshop on Text Analysis and Search for Bioinformatics.
Jiménez-Zafra, S. M., Morante, R., Martín-Valdivia, M. T., and Ureña-López, L. A. (2020). Corpora annotated with negation: An overview. Computational Linguistics, 46(1):1–52.
Li, Y., Thomas, M. A., and Liu, D. (2021). From semantics to pragmatics: where is can lead in natural language processing (nlp) research. European Journal of Information Systems, 30(5):569–590.
Lopes, L., Duran, M. S., Fernandes, P., and Pardo, T. A. S. (2022a). Portilexicon-ud: a portuguese lexical resource according to universal dependencies model. In Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022), Marseille, France.
Lopes, L., Duran, M. S., Nunes, M. d. G. V., and Pardo, T. A. S. (2022b). Corpora building process according to the universal dependencies model: an experiment for portuguese. [link].
Mioto, C. (1992). Negação sentencial no português brasileiro e teoria da gramática. Tese de doutorado em linguística, Universidade Estadual de Campinas (UNICAMP), Campinas, SP.
Mioto, C. (1998). Tipos de negação. Cadernos de estudos linguisticos, 34.
Moia, T. (2024). A distribuição dos adjuntos temporais negativos no português contemporâneo: negação, concordância negativa e construções de grau. Diacrítica, 38(1):226–253.
Mutalik, P. G., Deshpande, A., and Nadkarni, P. M. (2001). Use of general-purpose negation detection to augment concept indexing of medical documents: a quantitative study using the umls. Journal of the American Medical Informatics Association, 8(6):598–609.
Rademaker, A., Chalub, F., Real, L., Freitas, C., Bick, E., and Paiva, V. (2017). Universal dependencies for portuguese. In Proceedings of the Fourth International Conference on Dependency Linguistics (Depling 2017), pages 197–206, Pisa, Italy.
Santana, M. (2019). News of the brazilian newspaper.
Schwenter, S. A. (2016). Some issues in negation in portuguese. In The Handbook of Portuguese Linguistics, pages 425–440. Wiley-Blackwell.
Straka, M., Straková, J., and Gamba, F. (2024). ÚFAL LatinPipe at EvaLatin 2024: Morphosyntactic analysis of Latin. In Proceedings of the Third Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA) @ LREC-COLING2024, pages 207–214, Torino, Itália. ELRA and ICCL.
Universal Dependencies (2021). Universal dependencies. [link]. Acesso em: 03 junho 2025.
Published
2025-09-29
How to Cite
MIRANDA JR., Isaac Souza de; VALE, Oto Araújo.
Analysis of Negation Annotation in Corpora Under the Universal Dependencies Guidelines. In: BRAZILIAN SYMPOSIUM IN INFORMATION AND HUMAN LANGUAGE TECHNOLOGY (STIL), 16. , 2025, Fortaleza/CE.
Anais [...].
Porto Alegre: Sociedade Brasileira de Computação,
2025
.
p. 627-634.
DOI: https://doi.org/10.5753/stil.2025.37865.
