Oliver, 

You might want to check our documentation in wikitech regarding identity 
reconstruction. I think it covers your point #1.


https://wikitech.wikimedia.org/wiki/Analytics/Data/Preventing_identity_reconstruction

Nuria



> On Jan 2, 2016, at 10:00 AM, Oliver Keyes <[email protected]> wrote:
> 
> Hey y'all
> 
> I'm working on a piece of research (largely recreational) on the old
> problem of fingerprinting users with minimal information - namely the
> combination of a user agent and an IP address. Basically I'm looking
> to put together a piece of work showing:
> 
> 1. How sub-standard it is;
> 2. How fast it decays;
> 3. How the sub-standardness varies by (platform|location)
> 
> This would be pretty doable with internal data; basically I'd need a
> schema with IP, user agent and a per-user UUID that's got a decent
> (>=24 hours) expiry time. My question: does anyone know of a table
> with recent data that meets these requirements? And, if not, anyone
> with EventLogging experience interested in working on the problem with
> me?
> 
> -- 
> Oliver Keyes
> Count Logula
> Wikimedia Foundation
> 
> _______________________________________________
> Analytics mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/analytics
_______________________________________________
Analytics mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/analytics

Reply via email to