Is there any established xml or other markup language for novels and short stories?

I'm particularly interested in marking up dialogue with the name of the character who is speaking, and then in tools that allow extracting the dialogue of each character (e.g. to analyse and contrast the vocabulary each uses).

If so, following on from that, are there open-source ML models that try to identify the speaker to add this markup, and existing training data?

Thanks,
Darren

_______________________________________________
Corpora mailing list -- [email protected]
https://list.elra.info/mailman3/postorius/lists/corpora.list.elra.info/
To unsubscribe send an email to [email protected]

Reply via email to