afs commented on issue #1259:
URL: https://github.com/apache/jena/issues/1259#issuecomment-1100607544
Hi @LorenzBuehmann ,
I vaguely (it was a very long time ago!) recall this coming up before. A
difference now is that the only site this affects is likely to be wikidata (and
then, only for now).
Here is a MVCE:
```java
public static void main(String...args) {
// U00E7
String qs = "SELECT ?x { BIND('CuraƧao' As ?x) }";
String qsx = "SELECT ?x { BIND('Cura\\u00E7ao' As ?x) }";
RowSet rowSet = QueryExecHTTP
.service("https://query.wikidata.org/sparql")
//.sendMode(QuerySendMode.asPostForm)
//.sendMode(QuerySendMode.asPost)
.sendMode(QuerySendMode.asGetAlways)
.queryString(qs)
.select();
RowSetOps.out(rowSet);
}
```
After checking, the corruption is on the request receiving and `qsx` works
in all three cases.
The three different sendModes give three different results.
* `asGetAlways` works
* `asPost` is corrupted in a way that looks like UTF-8 read as ISO-8859-?
* `asPostForm` is a different corruption, not sure what and that might be
Jena.
I don't know why ISO-8859 is being used if their servers are Linux (system
default). It hints it is a choice in the Blazegraph code.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]