I report the development status of the Encoded Text Module, just for info.
- I uploaded today the changes needed for the After_I and After_Soft_Dotted context checks in special casing algorithms, closing flyspray task #16. Also uploaded some bugfixes.
- Windows host encoding support is in NEXT state in flyspray, so anyone willing to contribute can take it (http://www.gnupdf.org/flyspray/index.php?do=details&task_id=15&project=2).
- I am still stuck with the tests for pdf_text_filter, but the most important ones are already implemented (not commited).
- I found while checking After_Soft_Dotted context condition that Unicode Word Boundary Rule #4 is not implemented. It involves grapheme cluster boundary checking, which is not trivial, so I added a new task in flyspray (http://www.gnupdf.org/flyspray/index.php?do=details&task_id=31&project=2). I am with it now.
But anyway, I guess the module can be used without problem as is for the filesystem module and/or others.
-Aleksander
