On Friday, January 17, 2014 08:22:53 PM Andrey Siver wrote: > Dears All, > > I'm newby for Jena. > > I would like to upload a large .ttl files (the YAGO2s files from here: > http://www.mpi-inf.mpg.de/yago-naga/yago/downloads.html), but got > errors (and many-many warnings...) like this: > > org.apache.jena.riot.RiotException: [line: 958109, col: 78] Broken IRI > (bad character: '<'): wikicategory_Sports_clubs_established > > Also the resulting database become broken for some queries. > > So, I would like to upload the data by statements via Jena API and try > to fix the data "on fly". > > Are there any examples how to do it?
Why don't you check the file for problems before trying to upload it? You can use the Jena command-line tools to read the file (and throw the statements away) and then you can fix the errors. Assuming you're using a text editor that can deal with a million-line file. Chris
