Hi Patrick, Thank you.
Just to specify what I mean by broken IRI support. I know IRIs work in Virtuoso quite good, better than in most other RDF Stores and it's just the RDF/XML serializer that has a small encoding bug, but RDF/XML seems to be the default serialization for SPARQL answers. People just use a common RDF framework, try to query the endpoint and get garbled results, after which they complain about the endpoint not working right. I know you can specify another serialization format like N3 or Turtle or use a small hack and get the right encoding, but I found that out the hard way as most people who try to query any Internationalized DBpedia endpoint will do. Kind Regards, Alexandru On 10/19/2011 05:08 PM, Patrick van Kleef wrote: > Hi Alexandru, > >> It would be quite nice to get an answer about this issue from someone at >> OpenLink since it seems that they do read this mailing list and this is >> a known issue. >> BTW I need to correct the title of this mail. The issue is not with the >> DBpedia VAD, it is with Virtuoso itself since the SPARQL endpoint >> returns the same garbled results. So at this time the Virtuoso IRI >> handling is broken at least when using SPARQL . > > I have passed on your observation to the Virtuoso development team and > i am awaiting an answer. > > > Patrick > --- > OpenLink Software > >> On 10/18/2011 09:29 AM, Dimitris Kontokostas wrote: >>> Hi Alexandru, >>> >>> This is a known issue and we reported it to virtuoso ~9 months ago. >>> Unfortunatelly we use debian packages for our installation which >>> usually are a little behind from the latest releases, so we can't say >>> if it is fixed >>> >>> But, IRIs cannot be 100% serialized in RDF/XML. >>> So even if Virtuoso fixes the encoding, the rdf might still be invalid >>> >>> Regards, >>> Dimitris >>> >>> On Mon, Oct 17, 2011 at 6:42 PM, Alexandru >>> Todor<[email protected]> wrote: >>>> Hi, >>>> >>>> I've recieved a mail a couple of weeks ago from some users of the >>>> German >>>> DBpedia a few weeks ago who where reporting that they weren't getting >>>> any results when querying the endpoint for URIs that contained German >>>> umlauts(or any other utf8 characters). I reported the issue to the >>>> Jena >>>> mailing list and they fixed it, but in the process we also >>>> discovered a >>>> bug with Virtuoso. >>>> >>>> There is a problem with the IRI encoding in the DBpedia >>>> Internationalization VAD. Namely when querying the SPARQL endpoint the >>>> encoding of the IRIs in RDF/XML is garbled. The issue can be found in >>>> both Greek and German endpoints. >>>> >>>> For example: http://de.dbpedia.org/data/Berlin-Dahlem.rdf , in the >>>> first >>>> XML lines yo you will notice things linke >>>> http://de.dbpedia.org/resource/Königin-Luise-Stiftung instead of >>>> http://de.dbpedia.org/resource/Königin-Luise-Stiftung or >>>> http://de.dbpedia.org/resource/Gernot_Michael_Müller instead of >>>> http://de.dbpedia.org/resource/Gernot_Michael_Müller. You will notice >>>> simmilar issues if you look at this resource from the Greek DBpedia: >>>> http://el.dbpedia.org/data/Αλέξανδρος_ο_Μέγας.rdf . >>>> >>>> This problems is that when querying the Internationalization Endpoints >>>> not only with Jena but with any other SPARQL client, the user is going >>>> to getting garbled IRIs if they contain UTF8 characters. >>>> >>>> >>>> Kind Regards, >>>> Alexandru Todor >>>> >>>> >>>> ------------------------------------------------------------------------------ >>>> >>>> >>>> All the data continuously generated in your IT infrastructure >>>> contains a >>>> definitive record of customers, application performance, security >>>> threats, fraudulent activity and more. Splunk takes this data and >>>> makes >>>> sense of it. Business sense. IT sense. Common sense. >>>> http://p.sf.net/sfu/splunk-d2d-oct >>>> _______________________________________________ >>>> Dbpedia-discussion mailing list >>>> [email protected] >>>> https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion >>>> >>> >>> >> >> >> ------------------------------------------------------------------------------ >> >> >> All the data continuously generated in your IT infrastructure contains a >> definitive record of customers, application performance, security >> threats, fraudulent activity and more. Splunk takes this data and makes >> sense of it. Business sense. IT sense. Common sense. >> http://p.sf.net/sfu/splunk-d2d-oct >> _______________________________________________ >> Dbpedia-discussion mailing list >> [email protected] >> https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion > ------------------------------------------------------------------------------ All the data continuously generated in your IT infrastructure contains a definitive record of customers, application performance, security threats, fraudulent activity and more. Splunk takes this data and makes sense of it. Business sense. IT sense. Common sense. http://p.sf.net/sfu/splunk-d2d-oct _______________________________________________ Dbpedia-discussion mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
