A set of files prepared for the 2016 Vienna ACDH Neo-Latin workshop.
Select 20 documents containing letters from CroALa, a collection of Croatian Neo-Latin texts. Tokenize the texts. Use the Perseus Morphological Service to retrieve lemmata and morphological information. Add unambiguously identified lemmata to tokenized words.
How do I get set up?
- The detailed description: NeoLatin@ACDH
- The XML files (the tokenized ones are in tokenized
- XQueries and XSL
- An XML database such as BaseX
- Capability to run XSL transformations (e. g. with oXygen XML editor)
- Writing tests
- Code review