On 9/2/14 6:31 AM, Hanno Schlichting wrote:
As mentioned last week both OpenCellID and MLS have gone forward and created a new data format to publish cell network data.
This is very exciting news! :) I look forward to seeing what people do with this data.
You can find our full list of cell networks and hourly differential updates at https://location.services.mozilla.com/downloads. You can find the OpenCellID data in the same format under http://opencellid.org/downloads/. These changes went live yesterday / today for the two projects.
Why hourly diffs? Daily updates seems adequate. Are the hourly updates diffs from the most recent full update or the previous hourly update?
Why are the updates merely gzipped? xz compression has much better results: MLS-full.csv 135M MLS-full.csv.gz 35M MLS-full.csv.bz2 32M MLS-full.csv.xz 26M
Both of our projects will work on including the others data set in the next weeks. OpenCellID is also planning to do a major data cleanup in the remainder of the year to much more strictly validate their community contributed data. On the MLS side we also have some work to do to better distinguish GSM and UMTS networks. A quick comparison of the two data sets has shown a fairly low overlap. OpenCellID has collected 6.2 million networks to date and MLS has collected about 1.8 million. Only 0.6 million of those networks are shared, so we end up with a combined data set of 7.4 million cells. Various commercial vendors claim to have a data set of about 46 million networks, so we still have a ways to go, but we are stronger together.
I'm surprised there is so little overlap, but that's good news for both projects. Are most of the OpenCellID networks in Europe? I look forward to seeing MLS' update coverage map. :)
chris _______________________________________________ dev-geolocation mailing list [email protected] https://lists.mozilla.org/listinfo/dev-geolocation
