I stumbled upon this on Andy Seaborne's Twitter. Might be useful to stomp out a bug or two. It's a log of the results of parsing the DBpedia dumps with a pretty strict N-Triples parser: http://www.openjena.org/~afs/DBPedia35-parse-log-2010-04-15.txt
~25k errors, which is not too bad for a 100M+ dataset. Best, Richard ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Dbpedia-discussion mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
