Anotação semântica automática

um novo Framenet para o português

Autores

DOI:

https://doi.org/10.14393/DL52-v16n4a2022-7

Palavras-chave:

FrameNet Português, Etiquetagem semântica, Papéis semânticos, Gramática de restrições (CG), PALAVRAS

Resumo

Este artigo apresenta o PFN-PT, um sistema robusto para a anotação semântica automática de Português, consistindo numa nova framenet com foco em parsing, e um tagger para frames e papéis semânticos baseado em regras. A framenet contêm cerca de 13.000 padrões sintáticos cobrindo 7.300 lemas verbais com 10.700 sentidos. A etiquetagem é realizada por meio de um alinhamento iterativo de estruturas sintáticas e classe semântica de substantivos com as condições listadas no framenet para argumentos sintáticos. Discutimos princípios de desenho e apresentamos estatísticas de distribuição de categorias. Numa avaliação realizada com base em textos jornalísticos, o sistema alcançou 92,2% sentidos/frames corretos para verbos.

Downloads

Não há dados estatísticos.

Métricas

Carregando Métricas ...

Biografia do Autor

Eckhard Bick, University of Southern Denmark

PhD. University of Southern Denmark.

Referências

BAKER, C. F.; FILLMORE, J. C.; LOWE, J. B. The Berkeley FrameNet project. In: Proceedings of the COLING-ACL (Montreal, Canada). ACL, 1998. p. 86-90. DOI https://doi.org/10.3115/980451.980860

BICK, E. Automatic Semantic Role Annotation for Portuguese. In: Proceedings of TIL 2007 - 5th Workshop on Information and Human Language Technology / Anais do XXVII Congresso da SBC (Rio de Janeiro, July 5-6, 2007). Rio de Janeiro, 2007. p. 1713-1716.

BICK, E. A FrameNet for Danish. In: Proceedings of NODALIDA 2011 (May 11-13, Riga, Latvia). NEALT Proceedings Series, Vol 11. Tartu: Tartu University Library, 2011. p. 34-41.

BICK, E. PALAVRAS, a Constraint Grammar-based Parsing System for Portuguese. In: BERBER SARDINHA, T.; FERREIRA, T. de L. S. B. (ed.). Working with Portuguese Corpora. London/New York: Bloomsburry Academic, 2014. p 279-302.

BICK, E. Swedish-Danish Machine Translation in a Constraint Grammar Framework. In: PRZEPIÓRKOWSKI, A.; OGRODNICZUK, M. (ed.). Advances in Natural Language Processing, Proceedings of 9th International Conference on NLP (PolTAL 2014, Warsaw, Poland, September 17-19, 2014), Heidelberg: Springer, 2014. p. 216-227.

BICK, E.; DIDRIKSEN, T. CG-3 - Beyond Classical Constraint Grammar. In: MEGYESI, B. (ed.): Proceedings of NODALIDA 2015 (May 11-13, 2015, Vilnius, Lithuania). Linköping: LiU Electronic Press, 2015. p. 31-39

BICK, E. From Treebank to Propbank: A Semantic-Role and VerbNet Corpus for Danish. In: TIEDEMANN, J. (ed.): Proceedings of the 21st Nordic Conference on Computational Linguistics (NoDaLiDa 2017, Göteborg). NEALT Proceedings Series Vol. 29. Linköping University Electronic Press, 2017. p. 202-210.

BOAS, H. C. From theory to practice: Frame semantics and the design of FrameNet. In: LANGER, S.; SCHNORBUSCH, D. (ed.): Semantik im Lexikon. Tübingen: Gunter Narr Verlag, 2005. p. 129 - 159.

BÖHMOVÁ, A.; HAJIČ, J.; HAJIČOVÁ, E.; HLADKÁ, B. The Prague dependency treebank. In: Treebanks. Springer: Dordrecht, 2003. p. 103-127. DOI https://doi.org/10.1007/978-94-010-0201-1_7

BURCHARDT, A., ERK, K., FRANK, A., KOWALSKI, A; PADO, S.; PINKAL, M. Using FrameNet for the semantic analysis of German: Annotation, representation and automation. In: BOAS, H. C. (ed.): Multilingual FrameNets in Computational Lexicography: Methods and Applications. Mouton de Guyter, 2009. p. 209-244.

CAI, R.; LAPATA, M. Syntax-aware Semantic Role Labeling without Parsing. In: Transactions of the Association for Computational Linguistics, 7. ACL, 2019. p 343-356. DOI https://doi.org/10.1162/tacl_a_00272

DO, Q. N.; LEEUWENBERG, A.; HEYMAN, G.; MOENS, M. A Flexible and Easy-to-use Semantic Role Labeling Framework for Different Languages. In: Proceedings of COLING 2018 (Demos). ACL, 2018. p. 161-165

FELLBAUM, C. (ed.). WordNet: An Eletronic Lexical Database. In: Language, Speech and Communications. MIT Press: Cambridge, Massachusetts, 1998. DOI https://doi.org/10.7551/mitpress/7287.001.0001

FILLMORE, C. J. The case for case. In: BACH; HARMS (ed.). Universals in Linguistic Theory. New York: Holt, Rinehart, and Winston, 1968. p. 1-88.

FILLMORE, C. J.; BAKER, C. F. Frame semantics for text understanding. In: Proceedings of WordNet and Other Lexical Resources Workshop. NAACL, 2001

GILARDI, L.; BAKER, C. F. Learning to Align across Languages: Toward Multilingual FrameNet. In: International FrameNet Workshop 2018: Multilingual FrameNets and Constructions (Miyaki, Japan). 2018. p. 13-22

HARTMANN, S.; KUZNETSOV, I.; MARTÍN-VALDIVIA, M. T.; GUREVYCH, I. Out-of-domain framenet semantic role labeling. In: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics. Vol. 1. Long Papers. ACL, 2017. p. 471-482. DOI https://doi.org/10.18653/v1/E17-1045

HERMANN, K.M; DAS, D.; WESTON, J.; GANCHEV, K. Semantic Frame Identification with distributed Word Representations. In: Proceedings of the 52nd Annual Meeting of ACL (Baltimore, Mariland). ACL, 2014. P. 1448-1458. DOI https://doi.org/10.3115/v1/P14-1136

JOHNSON, C. R.; FILLMORE, C. J. The FrameNet tagset for frame-semantic and syntactic coding of predicate-argument structure. In: Proceedings of the 1st Meeting of the North American Chapter of the Association for Computational Linguistics (ANLP-NAACL 2000, Seattle WA). ACL, 2000. p. 56-62.

KIPPER, K.; KORHONEN, A; RYANT, N.; PALMER, M. A Large-Scale Extension of VerbNet with Novel Verb Classes. In: Proceedings of the 12th EURALEX International Congress (Turin, Italy, September 2006). 2006. p. 173-184

OHARA, K., FUJII, S.; OHORI, T.; SUZUKI, R.; SAITO, H.; ISHIZAKI, S. The Japanese FrameNet project: an introduction. In: Proceedings of the satellite workshop "Building lexical resources from semantically annotated corpora” (LREC 2004). ELRA, 2004. p. 9–11.

PALMER, M; GILDEA, D.; KINGSBURY, P. The Proposition Bank: An Annotated Corpus of Semantic Roles. Computational Linguistics, 31:1., p. 71-105, 2005. DOI https://doi.org/10.1162/0891201053630264

PALMER, A.; SPORLEDER, C. Evaluating FrameNet-style semantic parsing: the role of coverage gaps in FrameNet. In: Proceedings of the 23rd international conference on computational linguistics: posters. ACL, 2010. p. 928-936.

REHBEIN, I; RUPPENHOFER, J.; SPORLEDER, C.; PINKAL, M. Adding nominal spice to SALSA - frame-semantic annotation of German nouns and verbs. In: Proceedings of KONVENS 2012 (Vienna, Austria). 2014. p. 89-97.

RUPPENHOFER, J.; ELLSWORTH, M.; PETRUCK, M. R. L.; JOHNSON, C. R.; SCHEFFCZYK, J. FrameNet II: Extended Theory and Practice. 2010. Available at: http://framenet.icsi.berkeley.edu. (accessed 28. Oct. 2021).

SALOMÃO, M. FrameNet Brasil: um trabalho em progresso. Calidoscopio 7. p. 171-182, 2009. DOI https://doi.org/10.4013/cld.2009.73.01

SHI, L.; MIHALCEA, R. Open Text Semantic Parsing Using FrameNet and WordNet. In: Proceedings of HLT-NAACL 2004, Demonstration Papers. 2004. p. 19-22. DOI https://doi.org/10.3115/1614025.1614031

SUBIRATS, C.; PETRUCK, M. Surprise: Spanish FrameNet!. Estudios de Lingüística del Español, n. 31, 2010.

TAULÉ, M. et al. Mapping Syntactic Functions into Semantic Roles. In: Proceedings of

TLT2005. 2005. p. 185¬-194.

TORRENT, T.T.; ELLSWORTH, M.J. Behind the Labels: Criteria for defining analytical categories in FrameNet Brasil. Veredas 17(1), p. 44-65, 2013.

Publicado

12.09.2022

Como Citar

BICK, E. Anotação semântica automática: um novo Framenet para o português . Domínios de Lingu@gem, Uberlândia, v. 16, n. 4, p. 1401–1435, 2022. DOI: 10.14393/DL52-v16n4a2022-7. Disponível em: https://seer.ufu.br/index.php/dominiosdelinguagem/article/view/63821. Acesso em: 12 nov. 2024.