On 28/01/15 10:31, Lorenz Bühmann wrote:
Hello Andy,

first of all, thanks for the answer. I added answers to your comments
inline below.


Comments inline and at the end ...

...

This is a warning - the parser emits the data and continues ...

(I'm somewhat tempted to turn the NF tests off - while strictly
correct, few people worry or understand NF - feedback welcome).

Form my point of view the warnings are quite confusing, although I
usually tend to ignore such kind of warnings.

Very true.

In Unicode you can write the same thing in different ways especially with accented characters. You can have a code point for the letter with the accent, or the code point for the letter with accent followed by "out this accent(modifier) on the character before".



and now we have a real error.

What's line 513? (You can get the response by using curl or wget).
Well, from what I can see line 513 contains

ns56:Лауреати_премії_«Еммі» ,

so I guess the char « is unknown to some reason.

Yes.
« is not legal in a prefix name.

11:48:30,584 ErrorHandlerFactory$ErrorLogger - [line: 513, col: 24]
Unknown char: «(171;0x00AB)

The actual error is from looking for a new turtle token and does nto
find a start-of-token marker like < or " or a digit.  So it assumes a
prefix name (which does not start with an identifing character)

It might be badly written data (some unescaped significant character
earlier in the triple).  It's structural problem with the data sent back.
Ok, so the Dbpedia endpoint aka Virtuoso seems to return some illegal
structural data. Probably I'll have to file an issue or at least ask on
their mailing list.

Yes.

This is not a data problem. The other end (DBpedia) should not send illegal Turtle ever.



(Hmm - the stack trace does not seem to quite agree with the current
codebase.  What version are you running?)
I used JENA ARQ 2.11.2, but now updated to

JENA ARQ 2.12.1
JENA Core 2.12.1
JENA IRI 1.1.1

The stacktrace seems to be the same as before:


Thanks. 2.11.2 should be OK - I didn't know the code had moved about that much so I suspected a much older version.

        Andy

Reply via email to