https://bugs.documentfoundation.org/show_bug.cgi?id=60145
--- Comment #8 from Mike Kaganski <[email protected]> ---
A code pointer: SwASCIIParser::ReadChars() in sw/source/filter/ascii/parasc.cxx
does the autodetection of the encoding (of a 4 KiB buffer) using
SwIoSystem::IsDetectableText. The latter only checks for BOM. I suppose we
should not change that, but change the following processing (in case when
currentCharSet == RTL_TEXTENCODING_DONTKNOW).
In that case, we should possibly try treating the file as UTF-8, with options
that strictly detect invalid sequences, and in case of failure, restart with
RTL_TEXTENCODING_ASCII_US (or maybe user/working locale?).
--
You are receiving this mail because:
You are the assignee for the bug.
_______________________________________________
Libreoffice-bugs mailing list
[email protected]
https://lists.freedesktop.org/mailman/listinfo/libreoffice-bugs