Dear all,
I am glad to announce the release of Opera Latina Adnotata (v0.2.0), a multilayer Latin corpus consisting of 736 texts and 17M+ tokens searchable by:
1. word form 2. lemma 3. morphology (POS and morphological features) 4. syntax (dependency syntax following the AGDT annotation scheme) 5. CTS URN for work, author, and edition 6. CTS structure (e.g., "book," "section," etc.) 7. author name 8. work title 9. (experimental) IPA transcription of word forms (the "Classical Latin" one)
The data is hosted on Zenodo [1] and can be queried online through ANNIS [2]. More information can be found in the associated repository [3].
Best regards, Giuseppe Celano
----- [1] https://zenodo.org/records/15183688 [2] https://annis.varro.informatik.uni-leipzig.de/ola020#_q=bGVtbWE9InByYWVzYWdp... [3] https://github.com/OperaLatinaAdnotata/OLA