[Wikidata-bugs] [Maniphest] [Commented On] T165228: Use UTF-8 for files with query results from the WDQS
abian added a comment. None of the provided formats (verbose or non-verbose JSON file, verbose or non-verbose TSV file, CSV file) is correct. This problem doesn't seem to depend on the web browser nor on the operating system. You can also use this query for testing. You should be able to download and read all the characters from the results using UTF-8 encoding and without any line break.TASK DETAILhttps://phabricator.wikimedia.org/T165228EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: abianCc: Mbch331, Smalyshev, VIGNERON, Lea_Lacroix_WMDE, Lucas_Werkmeister_WMDE, Jonas, Ash_Crow, abian, Aklapper, GoranSMilovanovic, QZanden, EBjune, merbst, Avner, debt, Gehel, FloNight, Xmlizer, Izno, jkroll, Wikidata-bugs, Jdouglas, aude, Deskana, Manybubbles___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T165228: Use UTF-8 for files with query results from the WDQS
Mbch331 added a comment. I can reproduce it with the link Ash_Crow shared. I run that query and choose download -> CSV in the menu. I've attached the resulting file. When I open the file in Notepad++ it already looks strange with line break within values. I'm using Firefox 53.0.2 (64-bit) on Windows 10. Doing the same on Chrome 58.0.3029.110 (64-bit) on the same computer results in the same result. Values with line breaks in them. F8135195: query.csvTASK DETAILhttps://phabricator.wikimedia.org/T165228EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: Mbch331Cc: Mbch331, Smalyshev, VIGNERON, Lea_Lacroix_WMDE, Lucas_Werkmeister_WMDE, Jonas, Ash_Crow, abian, Aklapper, GoranSMilovanovic, QZanden, EBjune, merbst, Avner, debt, Gehel, FloNight, Xmlizer, Izno, jkroll, Wikidata-bugs, Jdouglas, aude, Deskana, Manybubbles___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T165228: Use UTF-8 for files with query results from the WDQS
VIGNERON added a comment. @Smalyshev in my case, I tried several queries on Ubuntu and Windows, with Chrome and Firefox and for every option (JSON TSV, CSV, verbose or not), but always opened with LibreOffice Calc (version: 5.1.6.2), the problem is always the same. The original query (on Wikidata:Bistro) was this one (instance of family name with writing system Latin script)TASK DETAILhttps://phabricator.wikimedia.org/T165228EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: VIGNERONCc: Smalyshev, VIGNERON, Lea_Lacroix_WMDE, Lucas_Werkmeister_WMDE, Jonas, Ash_Crow, abian, Aklapper, GoranSMilovanovic, QZanden, EBjune, merbst, Avner, debt, Gehel, FloNight, Xmlizer, Izno, jkroll, Wikidata-bugs, Jdouglas, aude, Deskana, Manybubbles, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T165228: Use UTF-8 for files with query results from the WDQS
Smalyshev added a comment. @abian could you please add: query that you were running browser (type & version) that you were using? which of the download options did you choose? TASK DETAILhttps://phabricator.wikimedia.org/T165228EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: SmalyshevCc: Smalyshev, VIGNERON, Lea_Lacroix_WMDE, Lucas_Werkmeister_WMDE, Jonas, Ash_Crow, abian, Aklapper, GoranSMilovanovic, QZanden, EBjune, merbst, Avner, debt, Gehel, FloNight, Xmlizer, Izno, jkroll, Wikidata-bugs, Jdouglas, aude, Deskana, Manybubbles, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T165228: Use UTF-8 for files with query results from the WDQS
VIGNERON added a comment. I've tried several encodings, including all the ISO-8859 (from ISO-8859-1 to ISO-8859-15) but none seems to match the encoding used...TASK DETAILhttps://phabricator.wikimedia.org/T165228EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: VIGNERONCc: VIGNERON, Lea_Lacroix_WMDE, Lucas_Werkmeister_WMDE, Jonas, Ash_Crow, abian, Aklapper, GoranSMilovanovic, QZanden, EBjune, merbst, Avner, debt, Gehel, FloNight, Xmlizer, Izno, jkroll, Smalyshev, Wikidata-bugs, Jdouglas, aude, Deskana, Manybubbles, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs