https://bugzilla.wikimedia.org/show_bug.cgi?id=54289

       Web browser: ---
            Bug ID: 54289
           Summary: CsvResultPrinter needs UTF-8 byte order mark in order
                    for Excel to properly recognize UTF-8 encoding
           Product: MediaWiki extensions
           Version: master
          Hardware: All
                OS: All
            Status: NEW
          Severity: enhancement
          Priority: Unprioritized
         Component: Semantic MediaWiki
          Assignee: [email protected]
          Reporter: [email protected]
                CC: [email protected], [email protected],
                    [email protected], [email protected]
    Classification: Unclassified
   Mobile Platform: ---

Created attachment 13314
  --> https://bugzilla.wikimedia.org/attachment.cgi?id=13314&action=edit
Patch showing how to add the UTF-8 byte order mark to the output of the
CsvResultPrinter

The problem can be duplicated by having an inline query which returns results
containing non-latin characters.  If you specify the output format to be csv
with a delimiter of a semicolon, once you click on the link for the results,
Excel will open it directly.  While the file is encoded as UTF-8, Excel can't
figure that out.  Reimporting the file into Excel with the correct encoding is
a non-intuitive multi-step process that is a bit of a hassle.  

Appending the UTF-8 byte order mark (see
http://roosmaa.net/importing-utf-8-csvs-in-excel/) to the output of the
CsvResultPrinter makes everything work like it should.

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to