https://bugs.documentfoundation.org/show_bug.cgi?id=172089

            Bug ID: 172089
           Summary: Tab character gets corrupted when imported via CSV
           Product: LibreOffice
           Version: 25.8.6.2 release
          Hardware: All
                OS: All
            Status: UNCONFIRMED
          Severity: normal
          Priority: medium
         Component: Calc
          Assignee: [email protected]
          Reporter: [email protected]

Created attachment 207071
  --> https://bugs.documentfoundation.org/attachment.cgi?id=207071&action=edit
Test file to import

Today I came across an interesting character corruption bug in LibreOffice
Calc.

I am attaching a CSV file where the 2nd cell has 2 text lines, and the second
line has a tab character.

I imported that file with LibreOffice Calc. The file has an UTF-8 BOM (the
first 3 bytes are 'special'), so LibreOffice Calc recognised that the character
encoding is UTF-8. That does not really matter, because the contents are plain
ASCII.

I selected "Separated by", and then "Comma", and the file was imported
correctly.

Then I marked both cells, copied them to the clipboard, and then I pasted them
into Emacs and into Pluma (a standard text editor). Both of them showed that
the tab character had been replaced with ASCII hex code 01. That ASCII control
character is called SOH (Start of Heading). That was unexpected.

I am running Ubuntu MATE 22.04.5 LTS, which is relatively old, but LibreOffice
is newer, from this Ubuntu PPA:

https://launchpad.net/~libreoffice/+archive/ubuntu/libreoffice-still

All other applications seem able to copy and paste several text lines with
embedded tab characters without any problems.

-- 
You are receiving this mail because:
You are the assignee for the bug.

Reply via email to