On Tue, Jul 21, 2009 at 3:38 PM, Frederik Ramm<[email protected]> wrote: > Hi, > > Andy Allan wrote: >> >> Tiger node tags make up 85.43% of all node tags and take up: > > [...] > > I just did a little test, prepared an .osc document that removed the node > tags from about 1000 nodes: > > http://www.openstreetmap.org/browse/changeset/1894387 > > It came out at roughly 10 node changes per second. I count 177m nodes with > TIGER tags, which means that the whole process would take about 200 days on > one API thread. It might be slightly faster if you upload larger or smaller > chunks - would have to do some experimenting to find the sweet spot. Time > could also be saved by running it one the LAN (on dev), but again probably > not a lot.
Ah, good stuff. I was assuming the best way to do it was a script on dev doing 50,000 nodes at a time (i.e. max diff upload) when it came to hitting the sweet-spot, but I'm interested in your experiments. Like you, I'm a bit concerned about the side effects of creating x billion changesets, but for replication, verification etc it's nice to behave nicely (i.e. through the API). But that just gives everyone an added incentive to stop other people doing ill-constructive bulk imports in the first place! The sooner we can clear this from TIGER the fewer people will use it as an example to aim for. Cheers, Andy _______________________________________________ dev mailing list [email protected] http://lists.openstreetmap.org/listinfo/dev

