Thanks Christian. I do not believe that we need to backfill the TSVs that
are filled from the udp2log stream.

Oliver -- GLEE uses the geo-edit data.

-Toby

On Mon, Dec 1, 2014 at 4:57 AM, Oliver Keyes <[email protected]> wrote:

> Thanks, Christian! :)
>
> What do we use geowiki for, out of interest?
>
> On 1 December 2014 at 06:59, Christian Aistleitner <
> [email protected]> wrote:
>
>> Hi,
>>
>> On Sun, Nov 30, 2014 at 12:34:27AM -0800, Ori Livneh wrote:
>> > See message below about a network outage currently affecting multiple
>> > servers in eqiad.
>>
>> The network is up again, and the affected machines are again reachable.
>>
>> TL;DR:
>>
>> +------------------------------+-----------+--------------------------------+
>> | Dataset                      | Affected? | Will be backfilled?
>>   |
>>
>> +------------------------------+-----------+--------------------------------+
>> | Analytics slave databases    | no        | ---
>>   |
>> | Analytics cluster            | no        | ---
>>   |
>> | Pagecounts-all-sites         | no        | ---
>>   |
>> | Pagecounts-raw               | yes       | yes, not completed yet
>>    |
>> | TSVs                         | yes       | no
>>    |
>> | EventLogging database        | yes       | yes, done
>>   |
>> | EventLogging graphite graphs | yes       | no
>>    |
>> | geowiki                      | yes       | yes, done
>>   |
>> | Wikipedia Zero graphs        | yes       | Excluded 2014-11-30 from
>> plots |
>>
>> +------------------------------+-----------+--------------------------------+
>>
>> I'll track updates on
>>
>>   https://phabricator.wikimedia.org/T76334
>>
>> Best regards,
>> Christian
>>
>>
>>
>> * Pagecounts-raw
>>
>>   pagecounts-20141130-040000.gz
>>   pagecounts-20141130-050000.gz
>>   pagecounts-20141130-060000.gz
>>   pagecounts-20141130-070000.gz
>>   pagecounts-20141130-080000.gz
>>   pagecounts-20141130-090000.gz
>>   pagecounts-20141130-100000.gz
>>   pagecounts-20141130-110000.gz
>>   projectcounts-20141130-040000
>>   projectcounts-20141130-050000
>>   projectcounts-20141130-060000
>>   projectcounts-20141130-070000
>>   projectcounts-20141130-080000
>>   projectcounts-20141130-090000
>>   projectcounts-20141130-100000
>>   projectcounts-20141130-110000
>>
>> are bad.
>>
>> We'll backfill them from pagecounts-all-sites.
>>
>> If you're still using pagecounts-raw, please consider switching to
>> pagecounts-all-sites:
>>
>>   https://wikitech.wikimedia.org/wiki/Analytics/Pagecounts-all-sites
>>
>>
>>
>> * TSVs:
>>
>> All udp2log streams are affected. Calling out only the most prominent
>> ones:
>> ** sampled-1000 TSVs
>> ** mobile-sampled-100 TSVs
>> ** zero TSVs
>> ** edits TSVs
>>
>> They are all missing data between 2014-11-30T03:50 and
>> 2014-11-30T10:13.
>>
>> Properly backfilling them from the cluster would be possible, but this
>> would need serious data massaging. If not one says the data for
>> 2014-11-30 is badly needed, I would not backfill the TSVs.
>>
>>
>>
>> * EventLogging:
>>
>> ** Database is up and running and backfilled.
>> No artifacts are expected.
>>
>> ** EventLogging stats on graphite
>> Note that this item is only about the graphs in graphite. The data in
>> the database (see above) is ok!)
>>
>> The overall counts are ok and should not show artifacts.
>> The per schema counts are basically blank for 2014-11-30.  Backfilling
>> them would be really time consuming, and the historic parts of those
>> graphs do not seem to be used anyways. So I suggest to not backfill
>> here.
>>
>>
>>
>> * geowiki:
>>
>> Data for the affected period has been backfilled.
>>
>>
>>
>> * Wikipedia Zero graphs:
>>
>> 2014-11-30 has been added to the list of dates that will not show up
>> in the Wikipedia Zero plots.
>>
>>
>>
>>
>> --
>> ---- quelltextlich e.U. ---- \\ ---- Christian Aistleitner ----
>>                            Companies' registry: 360296y in Linz
>> Christian Aistleitner
>> Kefermarkterstrasze 6a/3     Email:  [email protected]
>> 4293 Gutau, Austria          Phone:          +43 7946 / 20 5 81
>>                              Fax:            +43 7946 / 20 5 81
>>                              Homepage: http://quelltextlich.at/
>> ---------------------------------------------------------------
>>
>> _______________________________________________
>> Analytics mailing list
>> [email protected]
>> https://lists.wikimedia.org/mailman/listinfo/analytics
>>
>>
>
>
> --
> Oliver Keyes
> Research Analyst
> Wikimedia Foundation
>
> _______________________________________________
> Analytics mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/analytics
>
>
_______________________________________________
Analytics mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/analytics

Reply via email to