Re: [OSM-dev] Changeset files (was Removing Minutely and Hourly Changesets)

2009-12-21 Thread Jon Burgess
On Mon, 2009-12-21 at 01:08 -0500, Anthony wrote: Cool. If anyone familiar with the planet dumper tool is listening... In http://svn.openstreetmap.org/applications/utils/planet.osm/C/output_osm.c } else if ((*in = 0) (*in 32)) { escape_tmp[len] = '?'; len++;

Re: [OSM-dev] Changeset files (was Removing Minutely and Hourly Changesets)

2009-12-21 Thread Ævar Arnfjörð Bjarmason
On Mon, Dec 21, 2009 at 11:44, Jon Burgess jburgess...@googlemail.com wrote: On Mon, 2009-12-21 at 01:08 -0500, Anthony wrote: Cool.  If anyone familiar with the planet dumper tool is listening... In http://svn.openstreetmap.org/applications/utils/planet.osm/C/output_osm.c } else if ((*in =

Re: [OSM-dev] Changeset files (was Removing Minutely and Hourly Changesets)

2009-12-21 Thread Anthony
On Mon, Dec 21, 2009 at 6:44 AM, Jon Burgess jburgess...@googlemail.comwrote: On Mon, 2009-12-21 at 01:08 -0500, Anthony wrote: Cool. If anyone familiar with the planet dumper tool is listening... In http://svn.openstreetmap.org/applications/utils/planet.osm/C/output_osm.c } else

Re: [OSM-dev] Changeset files (was Removing Minutely and Hourly Changesets)

2009-12-21 Thread Anthony
On Mon, Dec 21, 2009 at 7:06 AM, Ævar Arnfjörð Bjarmason ava...@gmail.comwrote: Would anything break if these values were just cleaned up in the database so we don't have to deal with them in other tools in the future? I can't see it breaking unless the cleanup causes a unique key violation

Re: [OSM-dev] Changeset files (was Removing Minutely and Hourly Changesets)

2009-12-21 Thread Lars Francke
Ah yes.  Hmm.  That said, most of the characters actually in the database are carriage returns, which along with tabs and line feeds (also in the db) are valid in XML.  Other characters are present - for instance ASCII 3 in http://www.openstreetmap.org/browse/changeset/1325382 - those will be

Re: [OSM-dev] Changeset files (was Removing Minutely and Hourly Changesets)

2009-12-21 Thread Brett Henderson
On Tue, Dec 22, 2009 at 1:56 AM, Lars Francke lars.fran...@gmail.comwrote: Ah yes. Hmm. That said, most of the characters actually in the database are carriage returns, which along with tabs and line feeds (also in the db) are valid in XML. Other characters are present - for instance

[OSM-dev] Changeset files (was Removing Minutely and Hourly Changesets)

2009-12-20 Thread Anthony
Hi Brett, Do you also maintain the changesets files (e.g. changesets-091216.osm.bz2)? Now that the full history dump is out I can confirm that there are a relatively large number of corrupted key/values in that file. For example, see http://www.openstreetmap.org/browse/changeset/2498325 and

Re: [OSM-dev] Changeset files (was Removing Minutely and Hourly Changesets)

2009-12-20 Thread Brett Henderson
Hi Anthony, No, that's not me. I only look after the osmosis changesets. Hmm, the word changeset has become overloaded ... by osmosis changesets I mean the diff/delta files. The planet and changesets-xx.osm.bz2 files are created using the planet dumper tool. I believe Jon and Grant look after

Re: [OSM-dev] Changeset files (was Removing Minutely and Hourly Changesets)

2009-12-20 Thread Anthony
Cool. If anyone familiar with the planet dumper tool is listening... In http://svn.openstreetmap.org/applications/utils/planet.osm/C/output_osm.c } else if ((*in = 0) (*in 32)) { escape_tmp[len] = '?'; len++; should be something like } else if ((*in 0) (*in 32)) {