Hi all,

Currently, there are two html-formats, "html" and "html-noent". This is
the difference between them:

$ echo å | apertium -f html-noent nob-nno
å

$ echo å | apertium -f html       nob-nno
å

ie. the one named "html" replaces some (but not all!) non-ascii
characters with xml &entities.

I believe the "html" behaviour is more unexpected/surprising to new
users, and it would make sense to make the "html-noent" behaviour the
default, while renaming the other one to something like "html-ent", so
that we could get:

$ echo å | apertium -f html     nob-nno
å

$ echo å | apertium -f html-ent nob-nno
å

(and keeping "html-noent" around as an alias to avoid breakage)

But first: Does anyone have hard-to-change scripts or programs that
depend on the current behaviour (while still needing up-to-date apertium
versions)?

Attachment: signature.asc
Description: PGP signature

_______________________________________________
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to