2018-01-20 0:15 GMT+01:00 Andy Seaborne <[email protected]>: > Hi, > > Minimal, example file? >
?xml version="1.0" encoding="UTF-8"?> <rdf:RDF xmlns:foaf="http://xmlns.com/foaf/0.1/" > <foaf:Organization rdf:about=" https://www.communecter.org/#organization.detail.id.5898612440bb4e7d28cfc81a" > <foaf:homepage rdf:resource="*http://[email protected] <http://[email protected]>*"/> </foaf:Organization> </rdf:RDF> > You can try resetting the IRI resolver rules: jena-iri is quite > configurable even if its capabilities are conveniently exposed: > > In released versions, the IRIFactory is > > IRIFactory.iriImplementation() > I'm puzzled because there > > but note that JENA-1462 change this to > > IRIResolver.iriFactory() > > which is the one used for Turtle et al. > > (RDF/XML is slightly different to the RIOT languages of Turtle and > N-triples etc.) > > RDFParserBuilder itself does not have a "set resolver" method (maybe it > should?) so this is a global change. > Thanks for the clear explanations. This is indeed flexible but a bit time consuming . Ideally, a new option in riot command line would say: recover and continue after errors. Want an issue ? That all said, > > Passing the input through a text processing stage (perl, sed ...) is > probably the better way - fix up the errors. > Sure, but I'm at the end of data flow: a crowd sourcing site gathers (variable) quality data, then a developer converts several such sites in a unique XML format, then me applying XSLT for RDF. So upstream it's curated, and I report everything I find . And bad IRI's do not prevent the RDF to be loaded in TDB . > Andy > > > On 19/01/18 15:12, Jean-Marc Vanel wrote: > >> Hi >> >> I have a 5 Mb RDF/XML file with bad URL's , generated by an XSLT. >> URI's like : >> http://htcinemalecratere.com&#8206; >> >> I want to get all the messages , but this stops at the first error : >> >> riotcmd.riot --output=rdfxml --verbose --debug --nocheck file.rdf >> >> with such message: >> >> org.apache.jena.shared.BadURIException: Only well-formed absolute URIrefs >> can be included in RDF/XML output: <http://anais-pontault@googleg >> roups.com> >> Code: 58/PROHIBITED_COMPONENT_PRESENT in USER: A component that is >> prohibited by the scheme is present. >> >> And this outputs no message : >> riotcmd.riot file.rdf >> >> -- Jean-Marc Vanel http://www.semantic-forms.cc:9111/display?displayuri=http://jmvanel.free.fr/jmv.rdf%23me#subject <http://www.semantic-forms.cc:9111/display?displayuri=http://jmvanel.free.fr/jmv.rdf%23me> Déductions SARL - Consulting, services, training, Rule-based programming, Semantic Web +33 (0)6 89 16 29 52 Twitter: @jmvanel , @jmvanel_fr ; chat: irc://irc.freenode.net#eulergui
