Spelling Normalisation of Basque Historical Texts

Ainara Estarrona, Izaskun Etxeberria, Ander Soraluze, Manuel Padilla


This paper presents a computational method and its evaluation in a real scenario with the aim of normalising Basque historical texts in order to be analysed using standard Natural Language Processing tools (NLP). This normalisation work is part of a more general ongoing project called Basque in the Making (BIM): A Historical Look at a European Language Isolate, whose main objective is the systematic and diachronic study of a number of grammatical features of the Basque language.

Texto completo: