Re: Cleaning triples with Riot

2012-09-03 Thread Andy Seaborne
On 03/09/12 12:41, Michael Brunnbauer wrote: Hello Andy, On Tue, Aug 28, 2012 at 04:53:55PM +0100, Andy Seaborne wrote: Can anyone tell me how to rewrite this portion of code so that the parser will throw an exception for the invalid integer ? No need - TDB ought to handle that (invalid valu

Re: Cleaning triples with Riot

2012-09-03 Thread Michael Brunnbauer
Hello Andy, On Tue, Aug 28, 2012 at 04:53:55PM +0100, Andy Seaborne wrote: > >Can anyone tell me how to rewrite this portion of code so that the parser > >will throw an exception for the invalid integer ? > > No need - TDB ought to handle that (invalid values aren't supposed to be > errors - th

Re: Cleaning triples with Riot

2012-08-28 Thread Andy Seaborne
On 28/08/12 16:19, Michael Brunnbauer wrote: ... My current problem is that the tool will not remove this invalid graph from dbpedia 3.8 with an out of range integer: "60955609

Re: Cleaning triples with Riot

2012-08-28 Thread Michael Brunnbauer
hi all find attached the sourcecode for a (sloppy written) tool to clean compressed nquad dumps for tdbloader2. The tool assumes that the named graphs are not scattered in the dump and processes one named graph at a time. If there is an exception, the corresponding named graph is not written to s

Re: Cleaning triples with Riot

2012-08-27 Thread Frank Lee
: Friday, August 24, 2012 5:03 AM Subject: Re: Cleaning triples with Riot On 21/08/12 16:55, Leigh Dodds wrote: > Hi, > > I'm doing some testing of TDB for a client. They have data in an older > RDB database which accepted triples that TDB now rejects. What's being rejected?

Re: Cleaning triples with Riot

2012-08-24 Thread Andy Seaborne
On 21/08/12 16:55, Leigh Dodds wrote: Hi, I'm doing some testing of TDB for a client. They have data in an older RDB database which accepted triples that TDB now rejects. What's being rejected? If it's syntax, then text processing n-triples is usually necessary. Is there a way I can run a d

Re: Cleaning triples with Riot

2012-08-21 Thread Stefan Scheffler
Hey Leigh, I had this problem some months ago. At this time there where no mechanism to "exclude" invalid triples. So i changed the Jena-Api and included my own parsing proccess. But it was quite complicated and it just works for my data. I think there are two not so complicated ways. *first:

Cleaning triples with Riot

2012-08-21 Thread Leigh Dodds
Hi, I'm doing some testing of TDB for a client. They have data in an older RDB database which accepted triples that TDB now rejects. Is there a way I can run a data dump through riot to clean it (i.e. leaving only acceptable triples) or getting TDB to reject triples but continue to load the rest?