fonemas is a Python library of methods and functions for phonologic and phonetic ranscription of Spanish words.
This library was originally intended to analyse only pohonological features relevant to verse scansion. It has expanded its functionality ever since to become a fully featured phonological and phonetic analyser with IPA and SAMPA support.
silabeador is a Python library of methods and functions for syllabic division and prosodic stress detecting for Spanish. Automatic verse scansion required a syllable separator tolerant to non-Spanish consonant clusters and unusual and non-Spanish diacritics. Other libraries available take for granted that the words are well-constructed according to the Spanish grammar. This is not always the case in particular text types, as in ü as regularly used or as a metric diacritic (e.g., Si-güen-za vs crü-el).
It achieves 99.81 % accuracy when tested against the corpus EDFU without exceptions and 98.51 % when applying exceptions such as the implicit hiatus in verbs in -uar (a-cen-tu-ar) or -uir (re-hu-ir), or words some nouns (a-rri-e-ro).
txt2tei is a Python script to encode Spanish Siglo de Oro plays as XML-TEI files. It takes a minimally annotated tabular TXT file resembling the printed layout with a reduced set of simple tags. The script speeds up the process of encoding TEI files by automating their annotation, requiring just an (almost) visual edition of the sourceTXT files.