I love this change, thank you!
On Wed, Apr 4, 2018 at 4:33 PM Ariel Glenn WMF wrote:
> Those of you that rely on the abstracts dumps will have noticed that the
> content for wikidata is pretty much useless. It doesn't look like a
> summary of the page because main
http://dumps.wikimedia.org/wikidatawiki/20150207/
http://dumps.wikimedia.org/wikidatawiki/20150204/
What's wrong?
--
Amir
___
Xmldatadumps-l mailing list
Xmldatadumps-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l
Hello,
Wikidata dumps (e.g this http://dumps.wikimedia.org/wikidatawiki/20140612/)
have an annoying plus one named Yahoo abstracts, It has more than 16 GBs
(mainly because it's not zipped) and because content of Wikidata pages are
saved in term of numbers and codes instead of wikitext (e.g. this