Insights into the UD Tagset: Unveiling its Intricacies

Abstract


This opinion paper explores our inclination to draw on principles of syntactic analysis established in the grammars of our native language when using the Universal Dependencies tagset to assign dependency relation tags. Taking the Portuguese language as a case example, this study argues that a fine-grained comparison of concepts and terms used in traditional grammars of Brazilian Portuguese and those used by Universal Dependencies reveals gaps which lead to different interpretations and, ultimately, to a deviation from the envisaged universality of the dependency relations tagset.

Keywords: Universal Dependencies, dependency relations tagset, opinion paper, Portuguese syntactic analysis, corpus annotation

References

Andrews, Avery. (2007). The major functions of the noun phrase. In T. Shopen (Ed.), Language typology and syntactic description (pp. 62-154). Cambridge: Cambridge University Press.

Boland, Julie E.; Blodgett, Allison. (2006) Argument Status and PP-Attachment. Journal of Psycholinguistic Research, 35, pages 385–403. DOI 10.1007/s10936-006-9021-z

Bresnan Joan. (1982) The Mental Representation of Grammatical Relations. MIT Press, Cambridge, Massachusetts. https://doi.org/10.2307/414493

de Marneffe, Marie-Catherine; Manning, Christopher D.; Nivre, Joakim; Zeman, Daniel. (2021). Universal Dependencies. Computational Linguistics, 47(2):255–308. https://aclanthology.org/2021.cl-2.11

Nivre, Joakim; de Marneffe, Marie-Catherine; Ginter, Filip; Hajič, Jan; Manning, Christopher D.; Pyysalo, Sampo; Schuster, Sebastian; Tyers, Francis; Zeman, Daniel. (2020). Universal Dependencies v2: An Evergrowing Multilingual Treebank Collection. In Proceedings of the Twelfth Language Resources and Evaluation Conference (LREC 2020), pages 4034–4043, Marseille, France. European Language Resources Association. https://aclanthology.org/2020.lrec-1.497

Thompson, S. A. (1997). Discourse motivations for the core-oblique distinction as a language universal. In Akio Kamio (editor), Directions in Functional Linguistics, 36, pages 59–82. John Benjamins. https://doi.org/10.1075/slcs.36.06tho
Published
2023-09-25
DURAN, Magali Sanches. Insights into the UD Tagset: Unveiling its Intricacies. In: BRAZILIAN SYMPOSIUM IN INFORMATION AND HUMAN LANGUAGE TECHNOLOGY (STIL), 14. , 2023, Belo Horizonte/MG. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2023 . p. 483-490. DOI: https://doi.org/10.5753/stil.2023.25489.