On 9/2/14 6:31 AM, Hanno Schlichting wrote:
As mentioned last week both OpenCellID and MLS have gone forward and created a 
new data format to publish cell network data.

This is very exciting news! :) I look forward to seeing what people do with this data.


You can find our full list of cell networks and hourly differential updates at 
https://location.services.mozilla.com/downloads. You can find the OpenCellID 
data in the same format under http://opencellid.org/downloads/. These changes 
went live yesterday / today for the two projects.

Why hourly diffs? Daily updates seems adequate. Are the hourly updates diffs from the most recent full update or the previous hourly update?

Why are the updates merely gzipped? xz compression has much better results:

MLS-full.csv     135M
MLS-full.csv.gz   35M
MLS-full.csv.bz2  32M
MLS-full.csv.xz   26M


Both of our projects will work on including the others data set in the next 
weeks. OpenCellID is also planning to do a major data cleanup in the remainder 
of the year to much more strictly validate their community contributed data. On 
the MLS side we also have some work to do to better distinguish GSM and UMTS 
networks.

A quick comparison of the two data sets has shown a fairly low overlap. 
OpenCellID has collected 6.2 million networks to date and MLS has collected 
about 1.8 million. Only 0.6 million of those networks are shared, so we end up 
with a combined data set of 7.4 million cells. Various commercial vendors claim 
to have a data set of about 46 million networks, so we still have a ways to go, 
but we are stronger together.

I'm surprised there is so little overlap, but that's good news for both projects. Are most of the OpenCellID networks in Europe? I look forward to seeing MLS' update coverage map. :)


chris
_______________________________________________
dev-geolocation mailing list
[email protected]
https://lists.mozilla.org/listinfo/dev-geolocation

Reply via email to