https://bugs.documentfoundation.org/show_bug.cgi?id=147088

Stephan Bergmann <sberg...@redhat.com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |er...@redhat.com

Christian Lohmaier <cl...@documentfoundation.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |cl...@documentfoundation.or
                   |                            |g

--- Comment #7 from Stephan Bergmann <sberg...@redhat.com> ---
So the test.fods (attachment 177937) contains

> <table:table-cell table:formula="of:=UNICHAR(65535)" 
> office:value-type="string" office:string-value="" calcext:value-type="string">

with an empty office:string-value="", while the test.xlsx (attachment 177938)
xl/worksheets/sheet1.xml stream contains

> <c r="A1" s="0" t="str"><f aca="false">_xlfn.UNICHAR(65535)</f><v>�</v></c>

with an (UTF-8 encoded) U+FFFF.

Eike, do you know whether saving to .[f]ods has some code that explicitly
filters out non-characters, whereas saving to .xslx presumably implicitly
relied on the sal/rtl/textenc code converting to UTF-8 to filter out
non-characters (and which it no longer does since
<https://git.libreoffice.org/core/+/cd563e7b807fe038ebefb228e70bc587c040d17d%5E%21>
"Do not exclude Unicode noncharacters from rtl_convertUnicodeToText")?

--- Comment #8 from Stephan Bergmann <sberg...@redhat.com> ---
(In reply to Stephan Bergmann from comment #7)
> Eike, do you know whether saving to .[f]ods has some code that explicitly
> filters out non-characters, whereas saving to .xslx presumably implicitly
> relied on the sal/rtl/textenc code converting to UTF-8 to filter out
> non-characters (and which it no longer does since
> <https://git.libreoffice.org/core/+/
> cd563e7b807fe038ebefb228e70bc587c040d17d%5E%21> "Do not exclude Unicode
> noncharacters from rtl_convertUnicodeToText")?

[I assume Bugzilla failed to send out emails for comment 7 due to the verbatim
U+FFFF contained in that comment, which, it claimed in its web UI, it couldn't
convert to UTF-8; phh]

--- Comment #9 from Eike Rathke <er...@redhat.com> ---
sax/source/expatwrap/saxwriter.cxx SaxWriterHelper::convertToXML() does such
thing with IsInvalidChar()

--- Comment #10 from Eike Rathke <er...@redhat.com> ---
Fwiw, opening the attached .fods in Calc for me cell A1 is not empty but
contains the expected =UNICHAR(65535) formula expression.

(and yes, the literal 0xffff glyph in comment 7 kicks Bugzilla and its mailing
into the abyss for every comment added).

--- Comment #11 from Christian Lohmaier <cl...@documentfoundation.org> ---
Note: the U+FFFF in the original line from comment#7 has been replaced by a
U+FFFD to not trip up bugzilla

-- 
You are receiving this mail because:
You are the assignee for the bug.

Reply via email to