https://bugs.documentfoundation.org/show_bug.cgi?id=172089
Bug ID: 172089
Summary: Tab character gets corrupted when imported via CSV
Product: LibreOffice
Version: 25.8.6.2 release
Hardware: All
OS: All
Status: UNCONFIRMED
Severity: normal
Priority: medium
Component: Calc
Assignee: [email protected]
Reporter: [email protected]
Created attachment 207071
--> https://bugs.documentfoundation.org/attachment.cgi?id=207071&action=edit
Test file to import
Today I came across an interesting character corruption bug in LibreOffice
Calc.
I am attaching a CSV file where the 2nd cell has 2 text lines, and the second
line has a tab character.
I imported that file with LibreOffice Calc. The file has an UTF-8 BOM (the
first 3 bytes are 'special'), so LibreOffice Calc recognised that the character
encoding is UTF-8. That does not really matter, because the contents are plain
ASCII.
I selected "Separated by", and then "Comma", and the file was imported
correctly.
Then I marked both cells, copied them to the clipboard, and then I pasted them
into Emacs and into Pluma (a standard text editor). Both of them showed that
the tab character had been replaced with ASCII hex code 01. That ASCII control
character is called SOH (Start of Heading). That was unexpected.
I am running Ubuntu MATE 22.04.5 LTS, which is relatively old, but LibreOffice
is newer, from this Ubuntu PPA:
https://launchpad.net/~libreoffice/+archive/ubuntu/libreoffice-still
All other applications seem able to copy and paste several text lines with
embedded tab characters without any problems.
--
You are receiving this mail because:
You are the assignee for the bug.