Thanks for sending me to https://phabricator.wikimedia.org/T149021! That
seems to answer the question I forgot to ask: does the mediawiki_history
table include creation of deleted pages, and it looks like it does. I'll
reuse the query and findings from that task then. Always great to find
shortcuts like that, thanks again!


Cheers,
Morten


On 14 August 2017 at 08:00, Nuria Ruiz <[email protected]> wrote:

> >Would there happen to be a dataset of that available somewhere?
>
> Data is available on public labs replicas but sql is complicated to write
> and likely to time out due the volume of data that is combing. Data is also
> available on Hadoop Data Lake which is not public yet (it is our plan to
> make it so). This data has already been used to gather such a stats. See:
> https://phabricator.wikimedia.org/T149021
>
> On Sun, Aug 13, 2017 at 10:10 AM, Morten Wang <[email protected]> wrote:
>
>> Hello everyone,
>>
>> I'm currently working gathering data for the Autoconfirmed article
>> creation trial project[1]. One of the measures we're interested in is the
>> number of new articles, both surviving and deleted, that is created per
>> day. I know that recent data is logged through EventBus, but if possible
>> I'd would also like to have historic stats on this (e.g. going back a
>> handful of years). Would there happen to be a dataset of that available
>> somewhere?
>>
>>
>> References:
>> 1: https://meta.wikimedia.org/wiki/Research:Autoconfirmed_ar
>> ticle_creation_trial
>>
>> Cheers,
>> Morten
>>
>> _______________________________________________
>> Analytics mailing list
>> [email protected]
>> https://lists.wikimedia.org/mailman/listinfo/analytics
>>
>>
>
> _______________________________________________
> Analytics mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/analytics
>
>
_______________________________________________
Analytics mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/analytics

Reply via email to