Dear all,
I am glad to announce the release of Opera Latina Adnotata (v0.2.0), a
multilayer Latin corpus consisting of 736 texts and 17M+ tokens
searchable by:
1. word form
2. lemma
3. morphology (POS and morphological features)
4. syntax (dependency syntax following the AGDT annotation scheme)
5. CTS URN for work, author, and edition
6. CTS structure (e.g., "book," "section," etc.)
7. author name
8. work title
9. (experimental) IPA transcription of word forms (the "Classical Latin" one)
The data is hosted on Zenodo [1] and can be queried online through
ANNIS [2]. More information can be found in the associated repository
[3].
Best regards,
Giuseppe Celano
-----
[1] https://zenodo.org/records/15183688
[2]
https://annis.varro.informatik.uni-leipzig.de/ola020#_q=bGVtbWE9InByYWVzYWdpdW0i&ql=aql&_c=b2xhX3YwLjIuMF8yLG9sYV92MC4yLjBfMQ&cl=5&cr=5&s=0&l=10
[3] https://github.com/OperaLatinaAdnotata/OLA
--
Universität Leipzig
Institute of Computer Science
Augustusplatz 10
04109 Leipzig
Deutschland
[email protected]
_______________________________________________
Corpora mailing list -- [email protected]
https://list.elra.info/mailman3/postorius/lists/corpora.list.elra.info/
To unsubscribe send an email to [email protected]