Damian Steer wrote:
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
On 26/10/11 15:44, Dave Reynolds wrote:
I have a large N-Triples or N-Quads file, what's the best way
(i.e. the more strict the better for me) to validate the data in
it, before ingestion?
Use Eyeball?
Dave
The riot command line in ARQ has options for checking:
riot --validate <file>
or
riot --check=true <file> | more_processing
<http://openjena.org/wiki/RIOT>
Damian
Hi Damian
I should have RTFM ;-)
riot --validate data.nt
WARN [line: 1, col: 20] Lexical form '6.0' not valid for datatype
http://www.w3.org/2001/XMLSchema#int
WARN [line: 2, col: 20] Lexical form '6.0' not valid for datatype
http://www.w3.org/2001/XMLSchema#integer
riot --check=true data.nt
WARN [line: 1, col: 20] Lexical form '6.0' not valid for datatype
http://www.w3.org/2001/XMLSchema#int
<foo:bar1> <foo:p> "6.0"^^<http://www.w3.org/2001/XMLSchema#int> .
WARN [line: 2, col: 20] Lexical form '6.0' not valid for datatype
http://www.w3.org/2001/XMLSchema#integer
<foo:bar2> <foo:p> "6.0"^^<http://www.w3.org/2001/XMLSchema#integer> .
<foo:bar3> <foo:p> "6"^^<http://www.w3.org/2001/XMLSchema#int> .
<foo:bar4> <foo:p> "6"^^<http://www.w3.org/2001/XMLSchema#integer> .
Thanks,
Paolo
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.11 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/
iEYEARECAAYFAk6oHn8ACgkQAyLCB+mTtylGQQCgn65gJocIz8kcYmJIx6YWFE4U
LJEAnR++yM8HRxcIXMFOwtbQowGDUdIW
=GZPa
-----END PGP SIGNATURE-----