Hi,
I have been working with a few select files from the v3.8 dump of DBpedia,
and have noticed duplicate entries in one of the files, *images_en.nt*.
The entire file is 1.4 GiB, and contains 7 370 587 lines.
I came across one statement, which is present in this file 10 times:
<
http://upload.wikimedia.org/wikipedia/commons/3/32/CentralMichiganChippewas.png
<http://purl.org/dc/elements/1.1/rights
<http://en.wikipedia.org/wiki/File:CentralMichiganChippewas.png> .
This one statement is present on lines:
2997045
3588625
5294480
5424560
5798660
5910525
6009955
6516790
6894525
7338075
Can someone tell me why this is? There may be other instances, but I've
only come across this one, and I wanted to check with the
community-at-large to see if this is known and/or intentional.
Thanks,
- A
------------------------------------------------------------------------------
Everyone hates slow websites. So do we.
Make your web apps faster with AppDynamics
Download AppDynamics Lite for free today:
http://p.sf.net/sfu/appdyn_d2d_mar
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion