Gotcha. So, for transparency...what are we calculating? Country? City? :D

On 23 February 2015 at 13:59, Joseph Allemandou
<[email protected]> wrote:
> As per the IRC discussion, we won't recompute historical data, but start
> computing new values from the deploy time onward.
> A new "version" field, and associated documentation will also be provided,
> allowing to follow changes along time.
> Thanks for your inputs !
> Best
>
>
> On Mon, Feb 23, 2015 at 4:58 PM, Oliver Keyes <[email protected]> wrote:
>>
>> I think it should be fine-ish; it depends what we're calculating. When
>> you say "geocoded information", what do you mean? Country? City? I
>> wouldn't expect country to move about a lot in 60 days (which is the
>> range of our data): I would expect city to.
>>
>> What's the status on getting an oozie job or similar to compute going
>> forward? To me that's more of a priority than historical data.
>>
>> On 23 February 2015 at 10:53, Joseph Allemandou
>> <[email protected]> wrote:
>> > Hi,
>> >
>> > As part of my first assignment, I'll recompute our historical webrequest
>> > dataset, adding client_ip and geocoded information.
>> >
>> > While it seems correct to compute historical client_ip based on the
>> > existing
>> > ip and the x_forwarded_for, the use of the current state of the geocoded
>> > maxmind library to compute historical data is more error-prone.
>> >
>> > I can either compute it anyway, knowing that there'll be some errors, or
>> > put
>> > null values for data older than a given point in time.
>> >
>> > I'll launch the script to recompute the data as soon as max(a consensus
>> > is
>> > find on this matter, operations gives me the right to run the script) :)
>> >
>> > Thanks
>> > --
>> > Joseph Allemandou
>> > Data Engineer @ Wikimedia Foundation
>> > IRC: joal
>> >
>> > _______________________________________________
>> > Analytics mailing list
>> > [email protected]
>> > https://lists.wikimedia.org/mailman/listinfo/analytics
>> >
>>
>>
>>
>> --
>> Oliver Keyes
>> Research Analyst
>> Wikimedia Foundation
>>
>> _______________________________________________
>> Analytics mailing list
>> [email protected]
>> https://lists.wikimedia.org/mailman/listinfo/analytics
>
>
>
>
> --
> Joseph Allemandou
> Data Engineer @ Wikimedia Foundation
> IRC: joal
>
> _______________________________________________
> Analytics mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/analytics
>



-- 
Oliver Keyes
Research Analyst
Wikimedia Foundation

_______________________________________________
Analytics mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/analytics

Reply via email to