+1. Could we look at the pageIDs rather than titles? Is that being passed through uniformly yet?
On 23 January 2016 at 13:08, Toby Negrin <[email protected]> wrote: > Thanks Dan -- I'm just concerned that we might be missing something (like > the central notice banners back in the day) with a fairly large magnitude. > > -Toby > > On Sat, Jan 23, 2016 at 3:36 AM, Dan Andreescu <[email protected]> > wrote: >> >> Yes and no, it kind of depends whether we want to lose data. We've been >> talking about better ways to say "Unknown" but /wiki/Unknown is a page too >> :) We're just not focusing on this level of detail yet, bigger fish to fry, >> caveat emptor, etc. >> >> >> On Saturday, January 23, 2016, Toby Negrin <[email protected]> wrote: >>> >>> Is that a bug in the ETL? >>> >>> On Friday, January 22, 2016, Oliver Keyes <[email protected]> wrote: >>>> >>>> Actually - is Hadoop's "nothing was provided in this field!" making it >>>> doubly confusing :/ >>>> >>>> On 22 January 2016 at 22:06, Dan Garry <[email protected]> wrote: >>>> > On 22 January 2016 at 15:17, Ryan Kaldari <[email protected]> >>>> > wrote: >>>> >> >>>> >> Any idea why the most popular article in India is "-"? >>>> > >>>> > >>>> > That specific article often sees a lot of traffic. This is normally >>>> > caused >>>> > by a bot, spider, or other automaton. Unfortunately, by definition no >>>> > method >>>> > of detecting automated traffic is perfect, so things like this often >>>> > slip >>>> > through. >>>> > >>>> > Dan >>>> > >>>> > -- >>>> > Dan Garry >>>> > Lead Product Manager, Discovery >>>> > Wikimedia Foundation >>>> > >>>> > _______________________________________________ >>>> > Analytics mailing list >>>> > [email protected] >>>> > https://lists.wikimedia.org/mailman/listinfo/analytics >>>> > >>>> >>>> >>>> >>>> -- >>>> Oliver Keyes >>>> Count Logula >>>> Wikimedia Foundation >>>> >>>> _______________________________________________ >>>> Analytics mailing list >>>> [email protected] >>>> https://lists.wikimedia.org/mailman/listinfo/analytics >> >> >> _______________________________________________ >> Analytics mailing list >> [email protected] >> https://lists.wikimedia.org/mailman/listinfo/analytics >> > > > _______________________________________________ > Analytics mailing list > [email protected] > https://lists.wikimedia.org/mailman/listinfo/analytics > -- Oliver Keyes Count Logula Wikimedia Foundation _______________________________________________ Analytics mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/analytics
