Opera Latina Adnotata (v0.2.0) - Corpora

10 Apr 2025


      Dear all,
I am glad to announce the release of Opera Latina Adnotata (v0.2.0), a  
multilayer Latin corpus consisting of 736 texts and 17M+ tokens  
searchable by:
1. word form
2. lemma
3. morphology (POS and morphological features)
4. syntax (dependency syntax following the AGDT annotation scheme)
5. CTS URN for work, author, and edition
6. CTS structure (e.g., "book," "section," etc.)
7. author name
8. work title
9. (experimental) IPA transcription of word forms (the "Classical Latin" one)
The data is hosted on Zenodo [1] and can be queried online through  
ANNIS [2]. More information can be found in the associated repository  
[3].
Best regards,
Giuseppe Celano
-----
[1] https://zenodo.org/records/15183688
[2]  
https://annis.varro.informatik.uni-leipzig.de/ola020#_q=bGVtbWE9InByYWVzYWdp...
[3] https://github.com/OperaLatinaAdnotata/OLA
-- 
Universität Leipzig
Institute of Computer Science
Augustusplatz 10
04109 Leipzig
Deutschland
celano@informatik.uni-leipzig.de