[Xmldatadumps-l] New web server for dumps/datasets, OLD ONE GOING AWAY

2018-04-04 Thread Ariel Glenn WMF
Folks,

As you'll have seen from previous email, we are now using a new beefier
webserver for your dataset downloading needs. And the old server is going
away on TUESDAY April 10th.

This means that if you are using 'dataset1001.wikimedia.org' or the IP
address itself in your scripts, you MUST change it before Tuesday, or it
will stop working.

There will be no further reminders.

Thanks!

Ariel
___
Xmldatadumps-l mailing list
Xmldatadumps-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l


[Xmldatadumps-l] Change for abstracts dumps, primarily for wikidata

2018-04-04 Thread Ariel Glenn WMF
Those of you that rely on the abstracts dumps will have noticed that the
content for wikidata is pretty much useless.  It doesn't look like a
summary of the page because main namespace articles on wikidata aren't
paragraphs of text. And there's really no useful summary to be generated,
even if we were clever.

We have instead decided to produce abstracts output only for pages in the
main namespace that consist of text. For pages that are of type
wikidata-item, json and so on, the  tag will contain the
attribute 'not-applicable' set to the empty string. This impacts a very few
pages on other wikis; for the full list and for more information on this
change, see  https://phabricator.wikimedia.org/T178047

We hope this change will be merged in a week or so; it won't take effect
for wikidata until the next dumps run on April 20th, since the wikidata
abstracts are already in progress.

If you have any questions, don't hesitate to ask.

Ariel
___
Xmldatadumps-l mailing list
Xmldatadumps-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l


Re: [Xmldatadumps-l] [Wikitech-l] Change for abstracts dumps, primarily for wikidata

2018-04-04 Thread Amir Ladsgroup
I love this change, thank you!

On Wed, Apr 4, 2018 at 4:33 PM Ariel Glenn WMF  wrote:

> Those of you that rely on the abstracts dumps will have noticed that the
> content for wikidata is pretty much useless.  It doesn't look like a
> summary of the page because main namespace articles on wikidata aren't
> paragraphs of text. And there's really no useful summary to be generated,
> even if we were clever.
>
> We have instead decided to produce abstracts output only for pages in the
> main namespace that consist of text. For pages that are of type
> wikidata-item, json and so on, the  tag will contain the
> attribute 'not-applicable' set to the empty string. This impacts a very few
> pages on other wikis; for the full list and for more information on this
> change, see  https://phabricator.wikimedia.org/T178047
>
> We hope this change will be merged in a week or so; it won't take effect
> for wikidata until the next dumps run on April 20th, since the wikidata
> abstracts are already in progress.
>
> If you have any questions, don't hesitate to ask.
>
> Ariel
> ___
> Wikitech-l mailing list
> wikitec...@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikitech-l
___
Xmldatadumps-l mailing list
Xmldatadumps-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l


Re: [Xmldatadumps-l] [Dumps] Dumps web service migration on 2018-04-04

2018-04-04 Thread Madhumitha Viswanathan
Reminder: This is happening in a few minutes.

On Sun, Apr 1, 2018 at 10:12 PM, Madhumitha Viswanathan <
mviswanat...@wikimedia.org> wrote:

> Hello dumps.wikimedia.org users,
>
> The servers that host the dumps.wikimedia.org site are being replaced
> with shiny new hardware! The web service migration is set to happen at
> 14:30 UTC on Wednesday, April 4 2018. If you are trying to connect to
> dumps.wikimedia.org around the migration window, you might experience a
> short downtime. The switchover should ideally only take a few minutes, and
> we'll keep you posted once it's all done, or if anything changes!
>
> As always, please feel free to reach out to us with any questions or
> concerns.
>
> Best,
>
> Madhumitha Viswanathan & Ariel Glenn
> Wikimedia Foundation
>



-- 
Madhumitha Viswanathan
Operations Engineer, Cloud Services
___
Xmldatadumps-l mailing list
Xmldatadumps-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l