Spaces in URIs are particularly problematic; even if you can get them
into the data, using the data will likely break.
When ingesting data from somewhere else, it is good to check it before
loading, then fix as needed before loading.
riot --check file....
Andy
http://lov.okfn.org/lov.nq.gz is only 749810 quads. tdbloader2 is
overkill. Use tdbloader. tdbloader2 is an advantage for much larger
data (100 million+ and even then it is not always faster)
On 07/04/17 13:17, Martynas Jusevičius wrote:
This question comes up regurarly: http://markmail.org/message/seqiw74hhdx2u64j
On Fri, Apr 7, 2017 at 2:10 PM, Laura Morales <[email protected]> wrote:
I'm trying to import the LOV dump [1] into Fuseki using tdbloader2. Unfortunately some
quads are "broken" in the sense that they're not well-formed. For example this
one
ERROR [line: 203556, col: 152] Bad character in IRI (space):
<http://securitytoolbox.appspot.com/MASO#Objectif[space]...>
org.apache.jena.riot.RiotException: [line: 203556, col: 152] Bad character in IRI
(space): <http://securitytoolbox.appspot.com/MASO#Objectif[space]...>
Is there an option to tell tdbloader2 to simply ignore these nquads (or show a
warning) and keep going instead of raising an exception and halting?
-----------------
[1] http://lov.okfn.org/lov.nq.gz