Page Ids are still not coming in uniformly, but in this case should be there enough to figure out what - is, maybe. That's a good idea.
On Saturday, January 23, 2016, Oliver Keyes <[email protected]> wrote: > +1. Could we look at the pageIDs rather than titles? Is that being > passed through uniformly yet? > > On 23 January 2016 at 13:08, Toby Negrin <[email protected] > <javascript:;>> wrote: > > Thanks Dan -- I'm just concerned that we might be missing something (like > > the central notice banners back in the day) with a fairly large > magnitude. > > > > -Toby > > > > On Sat, Jan 23, 2016 at 3:36 AM, Dan Andreescu <[email protected] > <javascript:;>> > > wrote: > >> > >> Yes and no, it kind of depends whether we want to lose data. We've been > >> talking about better ways to say "Unknown" but /wiki/Unknown is a page > too > >> :) We're just not focusing on this level of detail yet, bigger fish to > fry, > >> caveat emptor, etc. > >> > >> > >> On Saturday, January 23, 2016, Toby Negrin <[email protected] > <javascript:;>> wrote: > >>> > >>> Is that a bug in the ETL? > >>> > >>> On Friday, January 22, 2016, Oliver Keyes <[email protected] > <javascript:;>> wrote: > >>>> > >>>> Actually - is Hadoop's "nothing was provided in this field!" making it > >>>> doubly confusing :/ > >>>> > >>>> On 22 January 2016 at 22:06, Dan Garry <[email protected] > <javascript:;>> wrote: > >>>> > On 22 January 2016 at 15:17, Ryan Kaldari <[email protected] > <javascript:;>> > >>>> > wrote: > >>>> >> > >>>> >> Any idea why the most popular article in India is "-"? > >>>> > > >>>> > > >>>> > That specific article often sees a lot of traffic. This is normally > >>>> > caused > >>>> > by a bot, spider, or other automaton. Unfortunately, by definition > no > >>>> > method > >>>> > of detecting automated traffic is perfect, so things like this often > >>>> > slip > >>>> > through. > >>>> > > >>>> > Dan > >>>> > > >>>> > -- > >>>> > Dan Garry > >>>> > Lead Product Manager, Discovery > >>>> > Wikimedia Foundation > >>>> > > >>>> > _______________________________________________ > >>>> > Analytics mailing list > >>>> > [email protected] <javascript:;> > >>>> > https://lists.wikimedia.org/mailman/listinfo/analytics > >>>> > > >>>> > >>>> > >>>> > >>>> -- > >>>> Oliver Keyes > >>>> Count Logula > >>>> Wikimedia Foundation > >>>> > >>>> _______________________________________________ > >>>> Analytics mailing list > >>>> [email protected] <javascript:;> > >>>> https://lists.wikimedia.org/mailman/listinfo/analytics > >> > >> > >> _______________________________________________ > >> Analytics mailing list > >> [email protected] <javascript:;> > >> https://lists.wikimedia.org/mailman/listinfo/analytics > >> > > > > > > _______________________________________________ > > Analytics mailing list > > [email protected] <javascript:;> > > https://lists.wikimedia.org/mailman/listinfo/analytics > > > > > > -- > Oliver Keyes > Count Logula > Wikimedia Foundation > > _______________________________________________ > Analytics mailing list > [email protected] <javascript:;> > https://lists.wikimedia.org/mailman/listinfo/analytics >
_______________________________________________ Analytics mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/analytics
