On Tue, Jul 21, 2009 at 3:38 PM, Frederik Ramm<[email protected]> wrote:
> Hi,
>
> Andy Allan wrote:
>>
>> Tiger node tags make up 85.43% of all node tags and take up:
>
> [...]
>
> I just did a little test, prepared an .osc document that removed the node
> tags from about 1000 nodes:
>
> http://www.openstreetmap.org/browse/changeset/1894387
>
> It came out at roughly 10 node changes per second. I count 177m nodes with
> TIGER tags, which means that the whole process would take about 200 days on
> one API thread. It might be slightly faster if you upload larger or smaller
> chunks - would have to do some experimenting to find the sweet spot. Time
> could also be saved by running it one the LAN (on dev), but again probably
> not a lot.

Ah, good stuff. I was assuming the best way to do it was a script on
dev doing 50,000 nodes at a time (i.e. max diff upload) when it came
to hitting the sweet-spot, but I'm interested in your experiments.

Like you, I'm a bit concerned about the side effects of creating x
billion changesets, but for replication, verification etc it's nice to
behave nicely (i.e. through the API). But that just gives everyone an
added incentive to stop other people doing ill-constructive bulk
imports in the first place! The sooner we can clear this from TIGER
the fewer people will use it as an example to aim for.

Cheers,
Andy

_______________________________________________
dev mailing list
[email protected]
http://lists.openstreetmap.org/listinfo/dev

Reply via email to