PFN-PT

a Framenet annotator for Portuguese

Authors

DOI:

https://doi.org/10.14393/DL52-v16n4a2022-7

Keywords:

Portuguese Framenet (PFN-PT), Semantic Role Labeling (SRL), Semantic Parsing, Constraint Grammar (CG), PALAVRAS parser

Abstract

This article presents PFN-PT, a robust system for the automatic semantic annotation of Portuguese, consisting of a new, parsing-oriented framenet and a rule-based frame- and role-tagger. The framenet provides almost 13,000 valency frames covering 7,300 verb lemmas with 10,700 senses. Frame and role tagging is achieved by iterated matching of syntactic structures and semantic noun types with slot-filler conditions in the framenet. We discuss design principles and present frame and role statistics. In an evaluation run on news data, the system achieved an overall F-score of 92.2% for frame senses.

Downloads

Download data is not yet available.

Metrics

Metrics Loading ...

Author Biography

Eckhard Bick, University of Southern Denmark

PhD. University of Southern Denmark.

References

BAKER, C. F.; FILLMORE, J. C.; LOWE, J. B. The Berkeley FrameNet project. In: Proceedings of the COLING-ACL (Montreal, Canada). ACL, 1998. p. 86-90. DOI https://doi.org/10.3115/980451.980860

BICK, E. Automatic Semantic Role Annotation for Portuguese. In: Proceedings of TIL 2007 - 5th Workshop on Information and Human Language Technology / Anais do XXVII Congresso da SBC (Rio de Janeiro, July 5-6, 2007). Rio de Janeiro, 2007. p. 1713-1716.

BICK, E. A FrameNet for Danish. In: Proceedings of NODALIDA 2011 (May 11-13, Riga, Latvia). NEALT Proceedings Series, Vol 11. Tartu: Tartu University Library, 2011. p. 34-41.

BICK, E. PALAVRAS, a Constraint Grammar-based Parsing System for Portuguese. In: BERBER SARDINHA, T.; FERREIRA, T. de L. S. B. (ed.). Working with Portuguese Corpora. London/New York: Bloomsburry Academic, 2014. p 279-302.

BICK, E. Swedish-Danish Machine Translation in a Constraint Grammar Framework. In: PRZEPIÓRKOWSKI, A.; OGRODNICZUK, M. (ed.). Advances in Natural Language Processing, Proceedings of 9th International Conference on NLP (PolTAL 2014, Warsaw, Poland, September 17-19, 2014), Heidelberg: Springer, 2014. p. 216-227.

BICK, E.; DIDRIKSEN, T. CG-3 - Beyond Classical Constraint Grammar. In: MEGYESI, B. (ed.): Proceedings of NODALIDA 2015 (May 11-13, 2015, Vilnius, Lithuania). Linköping: LiU Electronic Press, 2015. p. 31-39

BICK, E. From Treebank to Propbank: A Semantic-Role and VerbNet Corpus for Danish. In: TIEDEMANN, J. (ed.): Proceedings of the 21st Nordic Conference on Computational Linguistics (NoDaLiDa 2017, Göteborg). NEALT Proceedings Series Vol. 29. Linköping University Electronic Press, 2017. p. 202-210.

BOAS, H. C. From theory to practice: Frame semantics and the design of FrameNet. In: LANGER, S.; SCHNORBUSCH, D. (ed.): Semantik im Lexikon. Tübingen: Gunter Narr Verlag, 2005. p. 129 - 159.

BÖHMOVÁ, A.; HAJIČ, J.; HAJIČOVÁ, E.; HLADKÁ, B. The Prague dependency treebank. In: Treebanks. Springer: Dordrecht, 2003. p. 103-127. DOI https://doi.org/10.1007/978-94-010-0201-1_7

BURCHARDT, A., ERK, K., FRANK, A., KOWALSKI, A; PADO, S.; PINKAL, M. Using FrameNet for the semantic analysis of German: Annotation, representation and automation. In: BOAS, H. C. (ed.): Multilingual FrameNets in Computational Lexicography: Methods and Applications. Mouton de Guyter, 2009. p. 209-244.

CAI, R.; LAPATA, M. Syntax-aware Semantic Role Labeling without Parsing. In: Transactions of the Association for Computational Linguistics, 7. ACL, 2019. p 343-356. DOI https://doi.org/10.1162/tacl_a_00272

DO, Q. N.; LEEUWENBERG, A.; HEYMAN, G.; MOENS, M. A Flexible and Easy-to-use Semantic Role Labeling Framework for Different Languages. In: Proceedings of COLING 2018 (Demos). ACL, 2018. p. 161-165

FELLBAUM, C. (ed.). WordNet: An Eletronic Lexical Database. In: Language, Speech and Communications. MIT Press: Cambridge, Massachusetts, 1998. DOI https://doi.org/10.7551/mitpress/7287.001.0001

FILLMORE, C. J. The case for case. In: BACH; HARMS (ed.). Universals in Linguistic Theory. New York: Holt, Rinehart, and Winston, 1968. p. 1-88.

FILLMORE, C. J.; BAKER, C. F. Frame semantics for text understanding. In: Proceedings of WordNet and Other Lexical Resources Workshop. NAACL, 2001

GILARDI, L.; BAKER, C. F. Learning to Align across Languages: Toward Multilingual FrameNet. In: International FrameNet Workshop 2018: Multilingual FrameNets and Constructions (Miyaki, Japan). 2018. p. 13-22

HARTMANN, S.; KUZNETSOV, I.; MARTÍN-VALDIVIA, M. T.; GUREVYCH, I. Out-of-domain framenet semantic role labeling. In: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics. Vol. 1. Long Papers. ACL, 2017. p. 471-482. DOI https://doi.org/10.18653/v1/E17-1045

HERMANN, K.M; DAS, D.; WESTON, J.; GANCHEV, K. Semantic Frame Identification with distributed Word Representations. In: Proceedings of the 52nd Annual Meeting of ACL (Baltimore, Mariland). ACL, 2014. P. 1448-1458. DOI https://doi.org/10.3115/v1/P14-1136

JOHNSON, C. R.; FILLMORE, C. J. The FrameNet tagset for frame-semantic and syntactic coding of predicate-argument structure. In: Proceedings of the 1st Meeting of the North American Chapter of the Association for Computational Linguistics (ANLP-NAACL 2000, Seattle WA). ACL, 2000. p. 56-62.

KIPPER, K.; KORHONEN, A; RYANT, N.; PALMER, M. A Large-Scale Extension of VerbNet with Novel Verb Classes. In: Proceedings of the 12th EURALEX International Congress (Turin, Italy, September 2006). 2006. p. 173-184

OHARA, K., FUJII, S.; OHORI, T.; SUZUKI, R.; SAITO, H.; ISHIZAKI, S. The Japanese FrameNet project: an introduction. In: Proceedings of the satellite workshop "Building lexical resources from semantically annotated corpora” (LREC 2004). ELRA, 2004. p. 9–11.

PALMER, M; GILDEA, D.; KINGSBURY, P. The Proposition Bank: An Annotated Corpus of Semantic Roles. Computational Linguistics, 31:1., p. 71-105, 2005. DOI https://doi.org/10.1162/0891201053630264

PALMER, A.; SPORLEDER, C. Evaluating FrameNet-style semantic parsing: the role of coverage gaps in FrameNet. In: Proceedings of the 23rd international conference on computational linguistics: posters. ACL, 2010. p. 928-936.

REHBEIN, I; RUPPENHOFER, J.; SPORLEDER, C.; PINKAL, M. Adding nominal spice to SALSA - frame-semantic annotation of German nouns and verbs. In: Proceedings of KONVENS 2012 (Vienna, Austria). 2014. p. 89-97.

RUPPENHOFER, J.; ELLSWORTH, M.; PETRUCK, M. R. L.; JOHNSON, C. R.; SCHEFFCZYK, J. FrameNet II: Extended Theory and Practice. 2010. Available at: http://framenet.icsi.berkeley.edu. (accessed 28. Oct. 2021).

SALOMÃO, M. FrameNet Brasil: um trabalho em progresso. Calidoscopio 7. p. 171-182, 2009. DOI https://doi.org/10.4013/cld.2009.73.01

SHI, L.; MIHALCEA, R. Open Text Semantic Parsing Using FrameNet and WordNet. In: Proceedings of HLT-NAACL 2004, Demonstration Papers. 2004. p. 19-22. DOI https://doi.org/10.3115/1614025.1614031

SUBIRATS, C.; PETRUCK, M. Surprise: Spanish FrameNet!. Estudios de Lingüística del Español, n. 31, 2010.

TAULÉ, M. et al. Mapping Syntactic Functions into Semantic Roles. In: Proceedings of

TLT2005. 2005. p. 185¬-194.

TORRENT, T.T.; ELLSWORTH, M.J. Behind the Labels: Criteria for defining analytical categories in FrameNet Brasil. Veredas 17(1), p. 44-65, 2013.

Published

2022-09-12

How to Cite

BICK, E. PFN-PT: a Framenet annotator for Portuguese. Domínios de Lingu@gem, Uberlândia, v. 16, n. 4, p. 1401–1435, 2022. DOI: 10.14393/DL52-v16n4a2022-7. Disponível em: https://seer.ufu.br/index.php/dominiosdelinguagem/article/view/63821. Acesso em: 25 nov. 2024.