Low-resource AMR-to-Text Generation: A Study on Brazilian Portuguese

Marco Antonio Sobrevilla Cabezudo, Thiago Alexandre Salgueiro Pardo

Resumen


This work presents a study of how varied strategies for tackling lowresource AMR-to-text generation for three approaches are helpful in Brazilian Portuguese. Specifically, we explore the helpfulness of additional translated corpus, different granularity levels in input representation, and three preprocessing steps. Results show that translation is useful. However, it must be used in each approach differently. In addition, finer-grained representations as characters and subwords improve the performance and reduce the bias on the development set, and preprocessing steps are helpful in different contexts, being delexicalisation and preordering the most important ones.

Texto completo:

PDF