Hi all,

In order to setup a process to build and maintain the DDR, the first 
requirement is to identify a way to allow the uploading of http logs from 
contributors, in order to analyze the user agents. We should not receive just 
user agent lists, but full http logs, for these reasons:
- we need the timestamp for each user agent, in order to identify the frequency 
of each user agent (we will not be able to process each user agent, but 
hopefully enough to cover 99% of the http requests;
- we need the source IP address, in order to have a geographical MAP of 
distribution of the Devices, for two reasons: (1) analyzing IP addresses (using 
a geographical DB, like GeoIP) we would have a map of uncovered regions, so we 
would be able to improve the global coverage chasing contributions from 
specific regions. (2) the same device can have different user agents depending 
on the region where is has been commercialized and using IPs we can improve 
analysis. 

At the same time, uploading such information without a clear policy about how 
that data is handled could imply privacy issues, so we must keep the upload 
area private and guarantee that the information is used consistently with the 
objectives of DeviceMAP project, and not for other purposes. 

Do you agree on this?

Anyone from the Apache infrastructure team can help to identify what is the 
technical solution to satisfy these requirements?  

Cheers,
Stefano.

Reply via email to