[Wikidata-bugs] [Maniphest] [Commented On] T165228: Use UTF-8 for files with query results from the WDQS

2017-05-20 Thread abian
abian added a comment.
None of the provided formats (verbose or non-verbose JSON file, verbose or non-verbose TSV file, CSV file) is correct. This problem doesn't seem to depend on the web browser nor on the operating system.

You can also use this query for testing. You should be able to download and read all the characters from the results using UTF-8 encoding and without any line break.TASK DETAILhttps://phabricator.wikimedia.org/T165228EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: abianCc: Mbch331, Smalyshev, VIGNERON, Lea_Lacroix_WMDE, Lucas_Werkmeister_WMDE, Jonas, Ash_Crow, abian, Aklapper, GoranSMilovanovic, QZanden, EBjune, merbst, Avner, debt, Gehel, FloNight, Xmlizer, Izno, jkroll, Wikidata-bugs, Jdouglas, aude, Deskana, Manybubbles___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T165228: Use UTF-8 for files with query results from the WDQS

2017-05-20 Thread Mbch331
Mbch331 added a comment.
I can reproduce it with the link Ash_Crow shared. I run that query and choose download -> CSV in the menu. I've attached the resulting file. When I open the file in Notepad++ it already looks strange with line break within values.
I'm using Firefox 53.0.2 (64-bit) on Windows 10. Doing the same on Chrome 58.0.3029.110 (64-bit) on the same computer results in the same result. Values with line breaks in them.
F8135195: query.csvTASK DETAILhttps://phabricator.wikimedia.org/T165228EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: Mbch331Cc: Mbch331, Smalyshev, VIGNERON, Lea_Lacroix_WMDE, Lucas_Werkmeister_WMDE, Jonas, Ash_Crow, abian, Aklapper, GoranSMilovanovic, QZanden, EBjune, merbst, Avner, debt, Gehel, FloNight, Xmlizer, Izno, jkroll, Wikidata-bugs, Jdouglas, aude, Deskana, Manybubbles___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T165228: Use UTF-8 for files with query results from the WDQS

2017-05-20 Thread VIGNERON
VIGNERON added a comment.
@Smalyshev in my case, I tried several queries on Ubuntu and Windows, with Chrome and Firefox and for every option (JSON TSV, CSV, verbose or not), but always opened with LibreOffice Calc (version: 5.1.6.2), the problem is always the same.

The original query (on Wikidata:Bistro) was this one (instance of family name with writing system Latin script)TASK DETAILhttps://phabricator.wikimedia.org/T165228EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: VIGNERONCc: Smalyshev, VIGNERON, Lea_Lacroix_WMDE, Lucas_Werkmeister_WMDE, Jonas, Ash_Crow, abian, Aklapper, GoranSMilovanovic, QZanden, EBjune, merbst, Avner, debt, Gehel, FloNight, Xmlizer, Izno, jkroll, Wikidata-bugs, Jdouglas, aude, Deskana, Manybubbles, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T165228: Use UTF-8 for files with query results from the WDQS

2017-05-20 Thread Smalyshev
Smalyshev added a comment.
@abian could you please add:


query that you were running
browser (type & version) that you were using?
which of the download options did you choose?
TASK DETAILhttps://phabricator.wikimedia.org/T165228EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: SmalyshevCc: Smalyshev, VIGNERON, Lea_Lacroix_WMDE, Lucas_Werkmeister_WMDE, Jonas, Ash_Crow, abian, Aklapper, GoranSMilovanovic, QZanden, EBjune, merbst, Avner, debt, Gehel, FloNight, Xmlizer, Izno, jkroll, Wikidata-bugs, Jdouglas, aude, Deskana, Manybubbles, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T165228: Use UTF-8 for files with query results from the WDQS

2017-05-20 Thread VIGNERON
VIGNERON added a comment.
I've tried several encodings, including all the ISO-8859 (from ISO-8859-1 to ISO-8859-15) but none seems to match the encoding used...TASK DETAILhttps://phabricator.wikimedia.org/T165228EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: VIGNERONCc: VIGNERON, Lea_Lacroix_WMDE, Lucas_Werkmeister_WMDE, Jonas, Ash_Crow, abian, Aklapper, GoranSMilovanovic, QZanden, EBjune, merbst, Avner, debt, Gehel, FloNight, Xmlizer, Izno, jkroll, Smalyshev, Wikidata-bugs, Jdouglas, aude, Deskana, Manybubbles, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs