This is awesome. Roughly, by eye, it looks like automata are about 2% of
ZRR overall and 5% of ZRR for fulltext search, which was around 15% before
the holidays (and lower over the holidays—during The Time of Unreliable
User Behavior).

Is there a write up for this project? I know it had to be a ton of work,
and I'm curious about the details (possibly more so than most).

Do you think you got most of them? Or was the result high-precision but not
exhaustive?

Thanks for working on this!

—Trey

Trey Jones
Software Engineer, Discovery
Wikimedia Foundation

On Mon, Jan 4, 2016 at 1:29 PM, Oliver Keyes <[email protected]> wrote:

> Hey all,
>
> After several weeks of work to switch all the scripts over and
> backfill, all the Discovery dashboards now have the ability to filter
> crawlers and automated software out from graphs where that is
> relevant. You should notice a simple checkbox on, for example, the
> Zero Results Rate data or Wikidata Query Service traffic.
>
> While a bit of backfilling is still waiting on the servers syncing up,
> this work is essentially complete, and provides another way to look at
> data on how people are using search (and who those people are). It was
> a heck of a lot of work, by both myself and Mikhail, but it's
> hopefully valuable :).
>
> For Discovery Analytics,
>
> --
> Oliver Keyes
> Count Logula
> Wikimedia Foundation
>
> _______________________________________________
> discovery mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/discovery
>
_______________________________________________
discovery mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/discovery

Reply via email to