On Mon, Oct 27, 2014 at 11:57 AM, Stefano Bocconi <[email protected]> wrote: > Dear list, > > Jena does not parse some DBpedia entities (such as > http://dbpedia.org/page/Pachakutik_Plurinational_Unity_Movement_β_New_Country), > I am not sure who is not following the specs. > > I have the following 2 errors: > > jena.riot - [line: 15, col: 38] Unknown char: β(8211;0x2013) > jena.riot - [line: 20, col: 20] Unknown char: ΒΊ(186;0x00BA) > > From [1] it seems that at least the first character causing a riot error is > legal. > > [163s] PN_CHARS_BASE ::= [A-Z] | [a-z] | [#x00C0-#x00D6] | > [#x00D8-#x00F6] | [#x00F8-#x02FF] | [#x0370-#x037D] | [#x037F-#x1FFF] | > [#x200C-#x200D] | [#x2070-#x218F] | [#x2C00-#x2FEF] | [#x3001-#xD7FF] | > [#xF900-#xFDCF] | [#xFDF0-#xFFFD] | [#x10000-#xEFFFF]
Assuming those ranges are ordered, doesn't 0x2013 fall in between [#x200C-#x200D] | [#x2070-#x218F]? //JT -- Joshua Taylor, http://www.cs.rpi.edu/~tayloj/
