Michael

N-Triples and N-Quads are explicitly defined as 1 triple per line so CR or
LF characters in a RDF term will always be invalid in these formats

The on-disk TDB format has been stable since TDB 0.9.0 and so can be used
with any Jena tooling that supports TDB 0.9.0 or higher (though we
recommend you always use the latest versions wherever possible since our
support model for past versions is "please upgrade")

Note that 2.13.0 does provide for some low level customisation of the
storage which requires the store to be always read with the same setup as
it was created.  However if you are using the stock command line tools
this won't affect you.

Rob

On 31/03/2015 10:06, "Michael Brunnbauer" <[email protected]> wrote:

>
>hi all,
>
>tdbloader2 will not accept IRIs with CR or LF like this one from the
>Wikidata
>RDF dump:
>
> 
><http://freital.de/index.phtml?La=1&object=tx|530.4535.1&NavID=530.81&sub=
>0\n>
>
>But it will happily accept IRIs with |{}\\^`"
>
>I guess there is no chance that the Semantic Web community agrees on how a
>valid ntriples/nquads file looks like?
>
>Also, tdbloader2 seems to be gradually slowed down from 100k triples/s to
>< 1000 triples/s on a normal disk drive by random access after ca. 10
>million
>triples. Is this unavoidable? I made this change to tdbloader2 but I
>think it
>is not relevant during the data phase:
>
>-    SORT_ARGS="--buffer-size=50%"
>+    SORT_ARGS="--buffer-size=2048M"
>
>I have tried with Jena 2.13.0 and 2.11.1.
>
>Can a TDB generated with Jena 2.13.0 be used with Fuseki 1.1.1?
>
>Regards,
>
>Michael Brunnbauer
>
>-- 
>++  Michael Brunnbauer
>++  netEstate GmbH
>++  Geisenhausener Straße 11a
>++  81379 München
>++  Tel +49 89 32 19 77 80
>++  Fax +49 89 32 19 77 89
>++  E-Mail [email protected]
>++  http://www.netestate.de/
>++
>++  Sitz: München, HRB Nr.142452 (Handelsregister B München)
>++  USt-IdNr. DE221033342
>++  Geschäftsführer: Michael Brunnbauer, Franz Brunnbauer
>++  Prokurist: Dipl. Kfm. (Univ.) Markus Hendel




Reply via email to