Thanks for the quick fix! I'll verify it too with the next run.

I discovered this while building a link graph directly from the
pages-articles dump, and finding that I had more broken links (missing
target articles) than expected.

On Tue, Feb 27, 2018 at 4:10 AM, Ariel Glenn WMF <ar...@wikimedia.org>
wrote:

> It turns out that this happens for exactly 27 pages, those at the end of
> each enwiki-20180220-stub-articlesXX.xml.gz file.  Tracking here:
> https://phabricator.wikimedia.org/T188388
>
> Ariel
>
> On Tue, Feb 27, 2018 at 10:45 AM, Ryan Hitchman <hitchm...@gmail.com>
> wrote:
>
>> Multiple pages are missing from the enwiki pages-articles-multistream
>> dumps from 20180201 and 20180220.
>>
>> Page id 88444: "Phosphor" doesn't appear in the index or in the data
>> stream. This also happens for TARDIS, Psalm 132, and many others
>>
>> Why would the dump be partial?
>>
>> _______________________________________________
>> Xmldatadumps-l mailing list
>> Xmldatadumps-l@lists.wikimedia.org
>> https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l
>>
>>
>
_______________________________________________
Xmldatadumps-l mailing list
Xmldatadumps-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l

Reply via email to