2018-01-20 0:15 GMT+01:00 Andy Seaborne <[email protected]>:

> Hi,
>
> Minimal, example file?
>

?xml version="1.0" encoding="UTF-8"?>
<rdf:RDF xmlns:foaf="http://xmlns.com/foaf/0.1/"; >
    <foaf:Organization
        rdf:about="
https://www.communecter.org/#organization.detail.id.5898612440bb4e7d28cfc81a";
>
      <foaf:homepage rdf:resource="*http://[email protected]
<http://[email protected]>*"/>
  </foaf:Organization>
</rdf:RDF>


> You can try resetting the IRI resolver rules: jena-iri is quite
> configurable even if its capabilities are conveniently exposed:
>
> In released versions, the IRIFactory is
>
> IRIFactory.iriImplementation()
>
I'm puzzled because there

>
> but note that JENA-1462 change this to
>
> IRIResolver.iriFactory()
>
> which is the one used for Turtle et al.
>
> (RDF/XML is slightly different to the RIOT languages of Turtle and
> N-triples etc.)
>
> RDFParserBuilder itself does not have a "set resolver" method (maybe it
> should?) so this is a global change.
>

Thanks for the clear explanations.
This is indeed flexible but a bit time consuming .
Ideally, a new option in riot command line would say: recover and continue
after errors. Want an issue ?


That all said,
>
> Passing the input through a text processing stage (perl, sed ...) is
> probably the better way - fix up the errors.
>

Sure, but I'm at the end of data flow: a crowd sourcing site gathers
(variable) quality data, then a developer converts several such sites in a
unique XML format, then me applying XSLT for RDF. So upstream it's curated,
and I report everything I find . And bad IRI's do not prevent the RDF to be
loaded in TDB .


>     Andy
>
>
> On 19/01/18 15:12, Jean-Marc Vanel wrote:
>
>> Hi
>>
>> I have a 5 Mb  RDF/XML file with bad URL's , generated by an XSLT.
>> URI's like :
>> http://htcinemalecratere.com&amp;#8206;
>>
>> I want to get all the messages , but this stops at the first error :
>>
>> riotcmd.riot --output=rdfxml --verbose --debug --nocheck file.rdf
>>
>> with such message:
>>
>> org.apache.jena.shared.BadURIException: Only well-formed absolute URIrefs
>> can be included in RDF/XML output: <http://anais-pontault@googleg
>> roups.com>
>> Code: 58/PROHIBITED_COMPONENT_PRESENT in USER: A component that is
>> prohibited by the scheme is present.
>>
>> And this outputs no message :
>> riotcmd.riot  file.rdf
>>
>>


-- 
Jean-Marc Vanel
http://www.semantic-forms.cc:9111/display?displayuri=http://jmvanel.free.fr/jmv.rdf%23me#subject
<http://www.semantic-forms.cc:9111/display?displayuri=http://jmvanel.free.fr/jmv.rdf%23me>
Déductions SARL - Consulting, services, training,
Rule-based programming, Semantic Web
+33 (0)6 89 16 29 52
Twitter: @jmvanel , @jmvanel_fr ; chat: irc://irc.freenode.net#eulergui

Reply via email to