Michael N-Triples and N-Quads are explicitly defined as 1 triple per line so CR or LF characters in a RDF term will always be invalid in these formats
The on-disk TDB format has been stable since TDB 0.9.0 and so can be used with any Jena tooling that supports TDB 0.9.0 or higher (though we recommend you always use the latest versions wherever possible since our support model for past versions is "please upgrade") Note that 2.13.0 does provide for some low level customisation of the storage which requires the store to be always read with the same setup as it was created. However if you are using the stock command line tools this won't affect you. Rob On 31/03/2015 10:06, "Michael Brunnbauer" <[email protected]> wrote: > >hi all, > >tdbloader2 will not accept IRIs with CR or LF like this one from the >Wikidata >RDF dump: > > ><http://freital.de/index.phtml?La=1&object=tx|530.4535.1&NavID=530.81&sub= >0\n> > >But it will happily accept IRIs with |{}\\^`" > >I guess there is no chance that the Semantic Web community agrees on how a >valid ntriples/nquads file looks like? > >Also, tdbloader2 seems to be gradually slowed down from 100k triples/s to >< 1000 triples/s on a normal disk drive by random access after ca. 10 >million >triples. Is this unavoidable? I made this change to tdbloader2 but I >think it >is not relevant during the data phase: > >- SORT_ARGS="--buffer-size=50%" >+ SORT_ARGS="--buffer-size=2048M" > >I have tried with Jena 2.13.0 and 2.11.1. > >Can a TDB generated with Jena 2.13.0 be used with Fuseki 1.1.1? > >Regards, > >Michael Brunnbauer > >-- >++ Michael Brunnbauer >++ netEstate GmbH >++ Geisenhausener Straße 11a >++ 81379 München >++ Tel +49 89 32 19 77 80 >++ Fax +49 89 32 19 77 89 >++ E-Mail [email protected] >++ http://www.netestate.de/ >++ >++ Sitz: München, HRB Nr.142452 (Handelsregister B München) >++ USt-IdNr. DE221033342 >++ Geschäftsführer: Michael Brunnbauer, Franz Brunnbauer >++ Prokurist: Dipl. Kfm. (Univ.) Markus Hendel
