[DBpedia-developers] compression algorithm used for dump files

2016-09-16 Thread Jörn Hees
Hi, as i mentioned at the DBpedia meetup yesterday, i'd like to discuss the motivation to use bz2 as compression algorithm for the dump files. bz2 might have the advantage that it's well known, but apart from that it's outdated. Other compression algorithms (for example xz) compress and

[DBpedia-developers] dump file checksums

2016-09-16 Thread Jörn Hees
Hi, as i mentioned at the DBpedia meetup yesterday, it would be great if there were checksum files for the dump files (for example in each of the folders). My use-case is mostly to be able to quickly check if i have the current version of files. It happened to me a couple of times already that

[DBpedia-developers] DBpedia endpoint reproducibility

2016-09-16 Thread Jörn Hees
Hi, for some evaluations i'd like to reproduce the public DBpedia endpoint as closely as possible (SPARQL + lookup with content negotiation, e.g. http://dbpedia.org/resource/Kaiserslautern). What exactly is loaded how on the online version? So for example: - What was the starting state of the