If I remember correctly, Chris had the maxmind db on github with a script
that update it and commit changes. Thus making possible to "play back time"
and get the state of the db how it was when than data was calculated.

I think Dan has that script & cron running in his homedir, if we could
productionize this .. or at least document it on wikitech it will be great.

Thanks,

Nuria



On Mon, Feb 23, 2015 at 7:53 AM, Joseph Allemandou <
[email protected]> wrote:

> Hi,
>
> As part of my first assignment, I'll recompute our historical webrequest
> dataset, adding client_ip and geocoded information.
>
> While it seems correct to compute historical client_ip based on the
> existing ip and the x_forwarded_for, the use of the current state of the
> geocoded maxmind library to compute historical data is more error-prone.
>
> I can either compute it anyway, knowing that there'll be some errors, or
> put null values for data older than a given point in time.
>
> I'll launch the script to recompute the data as soon as max(a consensus is
> find on this matter, operations gives me the right to run the script) :)
>
> Thanks
> --
> *Joseph Allemandou*
> Data Engineer @ Wikimedia Foundation
> IRC: joal
>
> _______________________________________________
> Analytics mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/analytics
>
>
_______________________________________________
Analytics mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/analytics

Reply via email to