Re: [Talk-us] We have less than a week left to remap!

2012-03-22 Thread James Mast

The 27th? Damn.  I thought it was going to go read-only on the 1st...  At least 
that's what I've been telling people who I've been contacting to see if they 
will accept the new CT (which is still being somewhat successful geting at 
least 2-3 people a day to accept). Ugh. - James
  Date: Wed, 21 Mar 2012 21:59:07 -0500
 From: toby.mur...@gmail.com
 To: talk-us@openstreetmap.org
 Subject: [Talk-us] We have less than a week left to remap!
 
 I don't think this has been reposted to talk-us yet. According to the
 latest license change rebuild plan, the database will enter read-only
 mode on March 27th for the license rebuild:
 http://www.osmfoundation.org/wiki/License/Rebuild_Plan
 
 So that gives us a matter of days to do any more remapping. It seems
 like the US hasn't seen a concerted effort to remap dirty data like
 some other countries have. So here is my attempt to get that started.
 I'm not sure if some people still don't know about the license change
 of if people think someone else will take care of the important
 things... But *everything* is on the chopping block!
 
 Some highlights:
 
 1) State/county borders. I've actually cleaned up a good chunk of
 these and hope to finish in the next few days. I may have to reimport
 some of the remaining ways. Anyone have thoughts on the best source
 for this information?
 
 2) The interstate system. I've taken care of about 4 or 5 states here
 in the midwest but there is a large swath from Illinois down to South
 Carolina that will become unroutable on April 1st. This will set OSM
 back immensely in the eyes of anyone actually trying to use our data.
 
 3) Specific cities: Los Angeles, Ausin, Seattle. Most major cities
 have some unclean blobs in them but these are particularly nasty. User
 blars has not responded to repeated requests for contact and I have
 reason to believe he is intentionally ignoring anything OSM related.
 His contributions are beyond huge in the LA area and really along a
 lot of the west coast. Seattle is mostly affected by user Sunny who
 has proven to be unreachable despite my best cyberstalking attempts.
 Austin, TX is also largely affected by one prolific user but it has
 actually gotten some attention so it isn't entirely doomed.
 
 4) Shorelines. These were imported from a public domain source by an
 anonymous user. I'm tempted to largely mark this with odbl=clean and
 call it a day. Thoughts?
 
 
 In light of this, I would strongly encourage everyone reading this
 list to suspend your current mapping efforts and concentrate entirely
 on license remapping for the next few days. It's not the most fun work
 but it needs to get done.
 
 Tools to use:
 
 JOSM license change plugin. Select everything in an area and hit the
 License Check button. Start deleting red things and replacing them
 from clean sources:
 http://wiki.openstreetmap.org/wiki/JOSM/Plugins/LicenseChange
 
 Potlatch2 I think now has the license change layer enabled by default.
 If not, go into the options and make sure Show license status is
 checked. Again... remove anything with a red halo and replace.
 
 Badmap. Dirty interstates are clearly visible:
 http://cleanmap.poole.ch/?zoom=5lat=41.12823lon=-99.48465layers=00B0
 
 OSM Inspector. Good for drilling down and finding individual objects to remap:
 http://tools.geofabrik.de/osmi/?view=wtfelon=-100.24219lat=40.17384zoom=4
 
 TIGER 2011 road name tiles. Used as a background imagery layer while
 editing. Good to pull street names while remapping residential areas.
 Just be sure to expand the abbreviations while you're at it.
 http://wiki.openstreetmap.org/wiki/TIGER_2011
 
 I have also written two blog posts on my own remapping efforts. The
 first focuses on interstates where there is existing clean data to
 work with. The biggest trick here is making sure you don't break the
 route relations. But honestly I wouldn't worry about it too much. As
 long as the ways are intact, routing will continue to work. We can
 come back later and clean up the route relations after the license
 change.
 http://ksmapper.blogspot.com/2012/01/license-change-mapping.html
 
 The second post is quite a bit more technical and I don't expect many
 others to follow it really. It was useful in that area of LA but it
 involves some traditional GIS tools, data analysis and a little python
 coding. For this type of work, most people would be better off using
 the TIGER 2011 road name tiles listed above.
 http://ksmapper.blogspot.com/2012/03/remapping-using-tiger-2011.html
 
 Honestly, it is kind of late to be sending out this email... I
 probably should have done it a couple of weeks ago but hopefully we
 can still make a meaningful dent in the dirty data. At least this
 whole process is almost over...
 
 Toby
 
 ___
 Talk-us mailing list
 Talk-us@openstreetmap.org
 http://lists.openstreetmap.org/listinfo/talk-us
  

Re: [Talk-us] We have less than a week left to remap!

2012-03-22 Thread Toby Murray
On Thu, Mar 22, 2012 at 1:00 AM, James Mast rickmastfa...@hotmail.com wrote:
 The 27th? Damn.  I thought it was going to go read-only on the 1st...  At
 least that's what I've been telling people who I've been contacting to see
 if they will accept the new CT (which is still being somewhat successful
 geting at least 2-3 people a day to accept). Ugh.

Yeah, that was my first reaction too. And several others have
expressed concerns about the tight schedule for a couple of different
reasons. So there is still a chance that it will be pushed back a
little. But either way... not much time left.

Toby

___
Talk-us mailing list
Talk-us@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk-us


Re: [Talk-us] We have less than a week left to remap!

2012-03-22 Thread stevea

I don't think this has been reposted to talk-us yet. According to the
latest license change rebuild plan, the database will enter read-only
mode on March 27th for the license rebuild:
http://www.osmfoundation.org/wiki/License/Rebuild_Plan


It has been stated that it is a GOAL to pull the plug on April 1st. 
The Sword of Damocles wielded by the Powers That Be in OSM are under 
no hard-and-fast obligation to do so, they are just trying to reach a 
stated goal.


If (as it is looking more and more to be true) that significant areas 
will disappear from the map, making routing and other important uses 
of these data seriously impaired, it is incumbent upon the Powers 
That Be to declare failure at reaching the stated goals.  Yes, this 
is disappointing, but it is true.  The only meaningful conclusion 
that can be reached is to give the OSM community more time and push 
back the April 1st (or March 27th) deadline.  (One month?  Three 
months?)


Is this an eleventh hour request to do so?  Well, yes!  But every 
word of the above is true.


The OSM community simply must have more time to complete these tasks. 
It is as simple as that.  Don't cripple an extremely useful and 
becoming-more-mainstream and rather popular tool with arbitrary and 
capricious dates on a calendar.  Plan and assess, yes, that has 
already happened, but now is the time to RE-assess, and reach the 
solemn conclusion that flipping the switch on significant data in 
barely a week is ill-advised at best, and will be a crippling body 
blow to the project at worst.  Perhaps one from which the project 
will not fully recover.


Nobody asked me to say it, but I'm asking for three more months.  We 
can complete needed tasks by then.


SteveA
California

___
Talk-us mailing list
Talk-us@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk-us


Re: [Talk-us] [Imports] Uploads to City of Salisbury, MD

2012-03-22 Thread Josh Doe
Importing data correctly is hard, especially the first time. I'm glad
you've come to the list. I'd be curious to know if you didn't notice
the warnings on the wiki to contact the community first; I've gone
ahead and added the warning banner to the ogr2osm, shp2osm,
shp-to-osm, and shape2osm pages in case people find those before the
other import pages.

On Wed, Mar 21, 2012 at 10:50 PM, Paul Norman penor...@mac.com wrote:
 If there are duplicated ways and nodes, perhaps reverting is the best
 option?

Unfortunately I'd agree that reverting these changesets will be the
easiest and best course of action. Trust me that you'll spend a long
time cleaning up dupe nodes and ways. Some of the buildings have up to
four dupe nodes and 3 dupe ways, all from different changesets.

I've done some reversion before with the JOSM plugin, so if you'd like
I can help do this. I'd like to help you do another successful import,
so please consider sharing the original GIS files and your ogr2osm
translation files.

I very much appreciate your interest in OSM and your effort, so please
do not let this dissuade you from continuing!

-Josh

___
Talk-us mailing list
Talk-us@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk-us


Re: [Talk-us] [Imports] Uploads to City of Salisbury, MD

2012-03-22 Thread Josh Doe
On Thu, Mar 22, 2012 at 6:40 AM, Josh Doe j...@joshdoe.com wrote:
 On Wed, Mar 21, 2012 at 10:50 PM, Paul Norman penor...@mac.com wrote:
 If there are duplicated ways and nodes, perhaps reverting is the best
 option?

 Unfortunately I'd agree that reverting these changesets will be the
 easiest and best course of action. Trust me that you'll spend a long
 time cleaning up dupe nodes and ways. Some of the buildings have up to
 four dupe nodes and 3 dupe ways, all from different changesets.

This is what I can tell from the first few building changesets (not an
extensive investigation):
10882159: 493 dangling nodes (REVERT)
10884039: 3517 dangling nodes (REVERT)
10885267: 1 building node (OK)
10891857: 5 dangling nodes (REVERT)
10891870: 8000 dangling nodes (REVERT)
10893364: building nodes and ways (OK?)

-Josh

___
Talk-us mailing list
Talk-us@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk-us


Re: [Talk-us] [Imports] Uploads to City of Salisbury, MD

2012-03-22 Thread Marc Zoss
I briefly downloaded all sby:bldgtype-tagged ways and relation of Maryland 
through the overpass-api. Then removed the ones having only a sby:bldgtype tag, 
run the validator and deleted the duplicated nodes and ways. 
This would result in a changeset to remove the roughly 71'000 duplicates nodes 
and ways. 

If the area was edited since the import and reverting gets tricky, this might 
be the option to go, at least the result looks ok at the first glance.

Please also note that the conversion step seems to add a building=yes tag on on 
inner ring of building polygons () which is certainly bad tagging, despite the 
correct rendering (52 occurrences, so could be fixed manually).

M 


On 22.03.2012, at 12:02, Josh Doe wrote:

 On Thu, Mar 22, 2012 at 6:40 AM, Josh Doe j...@joshdoe.com wrote:
 On Wed, Mar 21, 2012 at 10:50 PM, Paul Norman penor...@mac.com wrote:
 If there are duplicated ways and nodes, perhaps reverting is the best
 option?
 
 Unfortunately I'd agree that reverting these changesets will be the
 easiest and best course of action. Trust me that you'll spend a long
 time cleaning up dupe nodes and ways. Some of the buildings have up to
 four dupe nodes and 3 dupe ways, all from different changesets.
 
 This is what I can tell from the first few building changesets (not an
 extensive investigation):
 10882159: 493 dangling nodes (REVERT)
 10884039: 3517 dangling nodes (REVERT)
 10885267: 1 building node (OK)
 10891857: 5 dangling nodes (REVERT)
 10891870: 8000 dangling nodes (REVERT)
 10893364: building nodes and ways (OK?)
 
 -Josh
 
 ___
 Imports mailing list
 impo...@openstreetmap.org
 http://lists.openstreetmap.org/listinfo/imports


___
Talk-us mailing list
Talk-us@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk-us


Re: [Talk-us] [Imports] Uploads to City of Salisbury, MD

2012-03-22 Thread Josh Doe
On Thu, Mar 22, 2012 at 8:04 AM, Marc Zoss marcz...@gmail.com wrote:
 I briefly downloaded all sby:bldgtype-tagged ways and relation of Maryland 
 through the overpass-api. Then removed the ones having only a sby:bldgtype 
 tag, run the validator and deleted the duplicated nodes and ways.
 This would result in a changeset to remove the roughly 71'000 duplicates 
 nodes and ways.

 If the area was edited since the import and reverting gets tricky, this might 
 be the option to go, at least the result looks ok at the first glance.

 Please also note that the conversion step seems to add a building=yes tag on 
 on inner ring of building polygons () which is certainly bad tagging, despite 
 the correct rendering (52 occurrences, so could be fixed manually).

Thanks for doing that, as that was the next step I was going to try. I
posted some regarding the changesets here:
http://wiki.openstreetmap.org/wiki/User_talk:Nick_SPW#Salisbury.2C_Maryland_import

I think perhaps we should revert a subset of the changesets, such as
the dangling nodes, and then use your method to handle the rest.

-Josh

___
Talk-us mailing list
Talk-us@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk-us


Re: [Talk-us] [Imports] Uploads to City of Salisbury, MD

2012-03-22 Thread Nick Chamberlain
Josh and Marc,

Thank you!  I apologize that I'm unable to speak the OSM language as
well as everyone, I'm working on it :)  I posted on the Salisbury,
Maryland Import page that Josh created to give more detail about my
uploads.

I didn't really think that I created so many duplicates, because I did a
lot of things in JOSM before I actually chose to upload.  One thing I
know for sure is that I didn't I upload until I was actually able to - I
was getting a proxy error and the uploads were timing out when I
attempted to upload the entire batch.  I assumed that these attempts
were unsuccessful, which I might be wrong about and might have resulted
in duplication.

I assumed that my successful attempts started, maybe @ 10901673, when I
realized I needed to break the original shapefile up tabularly into
percentiles and upload 10 segments of the building footprint dataset,
one after the other.  These were all definitely successful, and were
only done once per percentile.

Josh, where are you finding the list of changesets in the format you
posted?  I can only figure out how to list them in my editor profile
with my comments.

If you believe that the method you mention that removes the 71,000 nodes
is the best approach, please feel free to do so.  I will also gladly
manually fix the inner ring tagging issue as the data gets fixed.
Please let me know what I can do to help.  I am also willing to share
the .osm files and/or shapefiles if that will help.  Thanks.

- Nick

-Original Message-
From: joshthephysic...@gmail.com [mailto:joshthephysic...@gmail.com] On
Behalf Of Josh Doe
Sent: Thursday, March 22, 2012 8:51 AM
To: Marc Zoss
Cc: impo...@openstreetmap.org; talk-us@openstreetmap.org; Nick
Chamberlain
Subject: Re: [Imports] [Talk-us] Uploads to City of Salisbury, MD

On Thu, Mar 22, 2012 at 8:04 AM, Marc Zoss marcz...@gmail.com wrote:
 I briefly downloaded all sby:bldgtype-tagged ways and relation of
Maryland through the overpass-api. Then removed the ones having only a
sby:bldgtype tag, run the validator and deleted the duplicated nodes and
ways.
 This would result in a changeset to remove the roughly 71'000
duplicates nodes and ways.

 If the area was edited since the import and reverting gets tricky,
this might be the option to go, at least the result looks ok at the
first glance.

 Please also note that the conversion step seems to add a building=yes
tag on on inner ring of building polygons () which is certainly bad
tagging, despite the correct rendering (52 occurrences, so could be
fixed manually).

Thanks for doing that, as that was the next step I was going to try. I
posted some regarding the changesets here:
http://wiki.openstreetmap.org/wiki/User_talk:Nick_SPW#Salisbury.2C_Maryl
and_import

I think perhaps we should revert a subset of the changesets, such as the
dangling nodes, and then use your method to handle the rest.

-Josh

___
Talk-us mailing list
Talk-us@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk-us


Re: [Talk-us] [Imports] Uploads to City of Salisbury, MD

2012-03-22 Thread Marc Zoss
Nick and Josh

thanks for the clarification on your upload strategy. With previous large 
uploads I have experience the same behaviour resulting in massive dupes. So I 
guess it is not a conversion issue.

If you want me to commit the remove duplicates changeset, I can do so. But you 
will have to go through the data subsequently and check if the issues are 
resolved and no new ones emerged.

M

On 22.03.2012, at 14:12, Nick Chamberlain wrote:

 Josh and Marc,
 
 Thank you!  I apologize that I'm unable to speak the OSM language as
 well as everyone, I'm working on it :)  I posted on the Salisbury,
 Maryland Import page that Josh created to give more detail about my
 uploads.
 
 I didn't really think that I created so many duplicates, because I did a
 lot of things in JOSM before I actually chose to upload.  One thing I
 know for sure is that I didn't I upload until I was actually able to - I
 was getting a proxy error and the uploads were timing out when I
 attempted to upload the entire batch.  I assumed that these attempts
 were unsuccessful, which I might be wrong about and might have resulted
 in duplication.
 
 I assumed that my successful attempts started, maybe @ 10901673, when I
 realized I needed to break the original shapefile up tabularly into
 percentiles and upload 10 segments of the building footprint dataset,
 one after the other.  These were all definitely successful, and were
 only done once per percentile.
 
 Josh, where are you finding the list of changesets in the format you
 posted?  I can only figure out how to list them in my editor profile
 with my comments.
 
 If you believe that the method you mention that removes the 71,000 nodes
 is the best approach, please feel free to do so.  I will also gladly
 manually fix the inner ring tagging issue as the data gets fixed.
 Please let me know what I can do to help.  I am also willing to share
 the .osm files and/or shapefiles if that will help.  Thanks.
 
 - Nick
 
 -Original Message-
 From: joshthephysic...@gmail.com [mailto:joshthephysic...@gmail.com] On
 Behalf Of Josh Doe
 Sent: Thursday, March 22, 2012 8:51 AM
 To: Marc Zoss
 Cc: impo...@openstreetmap.org; talk-us@openstreetmap.org; Nick
 Chamberlain
 Subject: Re: [Imports] [Talk-us] Uploads to City of Salisbury, MD
 
 On Thu, Mar 22, 2012 at 8:04 AM, Marc Zoss marcz...@gmail.com wrote:
 I briefly downloaded all sby:bldgtype-tagged ways and relation of
 Maryland through the overpass-api. Then removed the ones having only a
 sby:bldgtype tag, run the validator and deleted the duplicated nodes and
 ways.
 This would result in a changeset to remove the roughly 71'000
 duplicates nodes and ways.
 
 If the area was edited since the import and reverting gets tricky,
 this might be the option to go, at least the result looks ok at the
 first glance.
 
 Please also note that the conversion step seems to add a building=yes
 tag on on inner ring of building polygons () which is certainly bad
 tagging, despite the correct rendering (52 occurrences, so could be
 fixed manually).
 
 Thanks for doing that, as that was the next step I was going to try. I
 posted some regarding the changesets here:
 http://wiki.openstreetmap.org/wiki/User_talk:Nick_SPW#Salisbury.2C_Maryl
 and_import
 
 I think perhaps we should revert a subset of the changesets, such as the
 dangling nodes, and then use your method to handle the rest.
 
 -Josh


___
Talk-us mailing list
Talk-us@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk-us


Re: [Talk-us] [Imports] Uploads to City of Salisbury, MD

2012-03-22 Thread Mike N

On 3/22/2012 9:12 AM, Nick Chamberlain wrote:

I
was getting a proxy error and the uploads were timing out when I
attempted to upload the entire batch.


This is a common problem with uploading large changesets with JOSM, 
where upload failures result in a partial upload.  We're hesitant to put 
automatic failed changeset upload recovery and resolution into JOSM for 
fear that someone will create a mass conversion of shapefiles and upload 
it all without even trying to tune the data to OSM standards.g


___
Talk-us mailing list
Talk-us@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk-us


[Talk-us] Duplicates in data uploads (using JOSM) -- was: Re: [Imports] Uploads to City of Salisbury, MD

2012-03-22 Thread Jaakko Helleranta.com
With previous large uploads I have experience the same behaviour resulting
in massive dupes. So I guess it is not a conversion issue.

I don't have experience with conversions nor (mass) imports -- but I _have_
had massive dupes problems a number of times when uploading larger
amounts of data with JOSM over a bad connection. The problem has always
been related to the combination of large uploads and bad connections where
(if I understand right) the JOSM data upload connection gets a hick-up at
some point and isn't able to finish the job -- and doesn't leave a note for
itself where it was left of. Then, because of reasons I don't _exactly_
understand there's duplication of data on the next upload(s (attempts)).

My vague understanding is that this is due to at least the fact that JOSM
uploads nodes first and only after that the information about ways (i.e.
which nodes belong to which ways). And then when it hasn't gotten or
confirmation for succesful uploads (or it hasn't recorded that to it's data
file(?)) it considers the uploaded nodes to still be new at next upload(s
(attempts)).

I feel that duplication sometimes happens also to partial uploads where the
ways have uploaded, too, resulting in duplicate uploaded ways but I haven't
documented this well enough to say this solidly.

If you have a bad connection / feel that this may be your problem it is a
good idea to tweak the JOSM Advanced upload settings (Upload  Advanced
tab: Upload data in chunks of objects. Chunk size: , where  is
your number of objects per chunk. I use 200 in with my Haitian connection.

Cheers,
-Jaakko
http://osm.org/user/jaakkoh
--
jaa...@helleranta.com * Skype: jhelleranta * Mobile: +509-37-269154  *
http://go.hel.cc/MyProfile



On Thu, Mar 22, 2012 at 8:28 AM, Marc Zoss marcz...@gmail.com wrote:

 Nick and Josh

 thanks for the clarification on your upload strategy. With previous large
 uploads I have experience the same behaviour resulting in massive dupes. So
 I guess it is not a conversion issue.

 If you want me to commit the remove duplicates changeset, I can do so. But
 you will have to go through the data subsequently and check if the issues
 are resolved and no new ones emerged.

 M

 On 22.03.2012, at 14:12, Nick Chamberlain wrote:

  Josh and Marc,
 
  Thank you!  I apologize that I'm unable to speak the OSM language as
  well as everyone, I'm working on it :)  I posted on the Salisbury,
  Maryland Import page that Josh created to give more detail about my
  uploads.
 
  I didn't really think that I created so many duplicates, because I did a
  lot of things in JOSM before I actually chose to upload.  One thing I
  know for sure is that I didn't I upload until I was actually able to - I
  was getting a proxy error and the uploads were timing out when I
  attempted to upload the entire batch.  I assumed that these attempts
  were unsuccessful, which I might be wrong about and might have resulted
  in duplication.
 
  I assumed that my successful attempts started, maybe @ 10901673, when I
  realized I needed to break the original shapefile up tabularly into
  percentiles and upload 10 segments of the building footprint dataset,
  one after the other.  These were all definitely successful, and were
  only done once per percentile.
 
  Josh, where are you finding the list of changesets in the format you
  posted?  I can only figure out how to list them in my editor profile
  with my comments.
 
  If you believe that the method you mention that removes the 71,000 nodes
  is the best approach, please feel free to do so.  I will also gladly
  manually fix the inner ring tagging issue as the data gets fixed.
  Please let me know what I can do to help.  I am also willing to share
  the .osm files and/or shapefiles if that will help.  Thanks.
 
  - Nick
 
  -Original Message-
  From: joshthephysic...@gmail.com [mailto:joshthephysic...@gmail.com] On
  Behalf Of Josh Doe
  Sent: Thursday, March 22, 2012 8:51 AM
  To: Marc Zoss
  Cc: impo...@openstreetmap.org; talk-us@openstreetmap.org; Nick
  Chamberlain
  Subject: Re: [Imports] [Talk-us] Uploads to City of Salisbury, MD
 
  On Thu, Mar 22, 2012 at 8:04 AM, Marc Zoss marcz...@gmail.com wrote:
  I briefly downloaded all sby:bldgtype-tagged ways and relation of
  Maryland through the overpass-api. Then removed the ones having only a
  sby:bldgtype tag, run the validator and deleted the duplicated nodes and
  ways.
  This would result in a changeset to remove the roughly 71'000
  duplicates nodes and ways.
 
  If the area was edited since the import and reverting gets tricky,
  this might be the option to go, at least the result looks ok at the
  first glance.
 
  Please also note that the conversion step seems to add a building=yes
  tag on on inner ring of building polygons () which is certainly bad
  tagging, despite the correct rendering (52 occurrences, so could be
  fixed manually).
 
  Thanks for doing that, as that was the next step I was going to try. I
  

Re: [Talk-us] [Imports] Uploads to City of Salisbury, MD

2012-03-22 Thread Nick Chamberlain
Mark, if you could commit the remove duplicates changeset, that'd be
great.  I will do my best to check if the issues are resolved, and will
gladly accept any guidance on the best ways to do so.  Thanks.

- Nick

-Original Message-
From: Marc Zoss [mailto:marcz...@gmail.com] 
Sent: Thursday, March 22, 2012 9:28 AM
To: Nick Chamberlain; Josh Doe
Cc: impo...@openstreetmap.org; talk-us@openstreetmap.org
Subject: Re: [Imports] [Talk-us] Uploads to City of Salisbury, MD

Nick and Josh

thanks for the clarification on your upload strategy. With previous
large uploads I have experience the same behaviour resulting in massive
dupes. So I guess it is not a conversion issue.

If you want me to commit the remove duplicates changeset, I can do so.
But you will have to go through the data subsequently and check if the
issues are resolved and no new ones emerged.

M

On 22.03.2012, at 14:12, Nick Chamberlain wrote:

 Josh and Marc,
 
 Thank you!  I apologize that I'm unable to speak the OSM language as 
 well as everyone, I'm working on it :)  I posted on the Salisbury, 
 Maryland Import page that Josh created to give more detail about my 
 uploads.
 
 I didn't really think that I created so many duplicates, because I did

 a lot of things in JOSM before I actually chose to upload.  One thing 
 I know for sure is that I didn't I upload until I was actually able to

 - I was getting a proxy error and the uploads were timing out when I 
 attempted to upload the entire batch.  I assumed that these attempts 
 were unsuccessful, which I might be wrong about and might have 
 resulted in duplication.
 
 I assumed that my successful attempts started, maybe @ 10901673, when 
 I realized I needed to break the original shapefile up tabularly into 
 percentiles and upload 10 segments of the building footprint dataset, 
 one after the other.  These were all definitely successful, and were 
 only done once per percentile.
 
 Josh, where are you finding the list of changesets in the format you 
 posted?  I can only figure out how to list them in my editor profile 
 with my comments.
 
 If you believe that the method you mention that removes the 71,000 
 nodes is the best approach, please feel free to do so.  I will also 
 gladly manually fix the inner ring tagging issue as the data gets
fixed.
 Please let me know what I can do to help.  I am also willing to share 
 the .osm files and/or shapefiles if that will help.  Thanks.
 
 - Nick
 
 -Original Message-
 From: joshthephysic...@gmail.com [mailto:joshthephysic...@gmail.com] 
 On Behalf Of Josh Doe
 Sent: Thursday, March 22, 2012 8:51 AM
 To: Marc Zoss
 Cc: impo...@openstreetmap.org; talk-us@openstreetmap.org; Nick 
 Chamberlain
 Subject: Re: [Imports] [Talk-us] Uploads to City of Salisbury, MD
 
 On Thu, Mar 22, 2012 at 8:04 AM, Marc Zoss marcz...@gmail.com wrote:
 I briefly downloaded all sby:bldgtype-tagged ways and relation of
 Maryland through the overpass-api. Then removed the ones having only a

 sby:bldgtype tag, run the validator and deleted the duplicated nodes 
 and ways.
 This would result in a changeset to remove the roughly 71'000
 duplicates nodes and ways.
 
 If the area was edited since the import and reverting gets tricky,
 this might be the option to go, at least the result looks ok at the 
 first glance.
 
 Please also note that the conversion step seems to add a building=yes
 tag on on inner ring of building polygons () which is certainly bad 
 tagging, despite the correct rendering (52 occurrences, so could be 
 fixed manually).
 
 Thanks for doing that, as that was the next step I was going to try. I

 posted some regarding the changesets here:
 http://wiki.openstreetmap.org/wiki/User_talk:Nick_SPW#Salisbury.2C_Mar
 yl
 and_import
 
 I think perhaps we should revert a subset of the changesets, such as 
 the dangling nodes, and then use your method to handle the rest.
 
 -Josh


___
Talk-us mailing list
Talk-us@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk-us


Re: [Talk-us] Duplicates in data uploads (using JOSM) -- was: Re: [Imports] Uploads to City of Salisbury, MD

2012-03-22 Thread Nick Chamberlain
Jaakko,

 

Thank you for the explanation.  I will tweak my chunk sizes further next
time.  I did so before, but they were still fairly large and took a few
hours per upload.  Reducing them might take longer, but if that fixes
duplication I will do that.  Thanks.

 

- Nick

 

From: Jaakko Helleranta.com [mailto:jaa...@helleranta.com] 
Sent: Thursday, March 22, 2012 10:47 AM
To: Marc Zoss
Cc: Nick Chamberlain; Josh Doe; impo...@openstreetmap.org;
talk-us@openstreetmap.org
Subject: Duplicates in data uploads (using JOSM) -- was: Re: [Imports]
[Talk-us] Uploads to City of Salisbury, MD

 

With previous large uploads I have experience the same behaviour
resulting in massive dupes. So I guess it is not a conversion issue.


 

I don't have experience with conversions nor (mass) imports -- but I
_have_ had massive dupes problems a number of times when uploading
larger amounts of data with JOSM over a bad connection. The problem has
always been related to the combination of large uploads and bad
connections where (if I understand right) the JOSM data upload
connection gets a hick-up at some point and isn't able to finish the job
-- and doesn't leave a note for itself where it was left of. Then,
because of reasons I don't _exactly_ understand there's duplication of
data on the next upload(s (attempts)). 

 

My vague understanding is that this is due to at least the fact that
JOSM uploads nodes first and only after that the information about ways
(i.e. which nodes belong to which ways). And then when it hasn't gotten
or confirmation for succesful uploads (or it hasn't recorded that to
it's data file(?)) it considers the uploaded nodes to still be new at
next upload(s (attempts)).

 

I feel that duplication sometimes happens also to partial uploads where
the ways have uploaded, too, resulting in duplicate uploaded ways but I
haven't documented this well enough to say this solidly.

 

If you have a bad connection / feel that this may be your problem it is
a good idea to tweak the JOSM Advanced upload settings (Upload 
Advanced tab: Upload data in chunks of objects. Chunk size: ,
where  is your number of objects per chunk. I use 200 in with my
Haitian connection.

 

Cheers,

-Jaakko

http://osm.org/user/jaakkoh

--

jaa...@helleranta.com * Skype: jhelleranta * Mobile: +509-37-269154  *
http://go.hel.cc/MyProfile





On Thu, Mar 22, 2012 at 8:28 AM, Marc Zoss marcz...@gmail.com wrote:

Nick and Josh

thanks for the clarification on your upload strategy. With previous
large uploads I have experience the same behaviour resulting in massive
dupes. So I guess it is not a conversion issue.

If you want me to commit the remove duplicates changeset, I can do so.
But you will have to go through the data subsequently and check if the
issues are resolved and no new ones emerged.

M

On 22.03.2012, at 14:12, Nick Chamberlain wrote:

 Josh and Marc,

 Thank you!  I apologize that I'm unable to speak the OSM language as
 well as everyone, I'm working on it :)  I posted on the Salisbury,
 Maryland Import page that Josh created to give more detail about my
 uploads.

 I didn't really think that I created so many duplicates, because I did
a
 lot of things in JOSM before I actually chose to upload.  One thing I
 know for sure is that I didn't I upload until I was actually able to -
I
 was getting a proxy error and the uploads were timing out when I
 attempted to upload the entire batch.  I assumed that these attempts
 were unsuccessful, which I might be wrong about and might have
resulted
 in duplication.

 I assumed that my successful attempts started, maybe @ 10901673, when
I
 realized I needed to break the original shapefile up tabularly into
 percentiles and upload 10 segments of the building footprint dataset,
 one after the other.  These were all definitely successful, and were
 only done once per percentile.

 Josh, where are you finding the list of changesets in the format you
 posted?  I can only figure out how to list them in my editor profile
 with my comments.

 If you believe that the method you mention that removes the 71,000
nodes
 is the best approach, please feel free to do so.  I will also gladly
 manually fix the inner ring tagging issue as the data gets fixed.
 Please let me know what I can do to help.  I am also willing to share
 the .osm files and/or shapefiles if that will help.  Thanks.

 - Nick

 -Original Message-
 From: joshthephysic...@gmail.com [mailto:joshthephysic...@gmail.com]
On
 Behalf Of Josh Doe
 Sent: Thursday, March 22, 2012 8:51 AM
 To: Marc Zoss
 Cc: impo...@openstreetmap.org; talk-us@openstreetmap.org; Nick
 Chamberlain
 Subject: Re: [Imports] [Talk-us] Uploads to City of Salisbury, MD

 On Thu, Mar 22, 2012 at 8:04 AM, Marc Zoss marcz...@gmail.com wrote:
 I briefly downloaded all sby:bldgtype-tagged ways and relation of
 Maryland through the overpass-api. Then removed the ones having only a
 sby:bldgtype tag, run the validator and deleted the duplicated nodes
and
 ways.

Re: [Talk-us] [Imports] Uploads to City of Salisbury, MD

2012-03-22 Thread Josh Doe
On Thu, Mar 22, 2012 at 10:52 AM, Nick Chamberlain
nchamberl...@ci.salisbury.md.us wrote:
 Mark, if you could commit the remove duplicates changeset, that'd be
 great.  I will do my best to check if the issues are resolved, and will
 gladly accept any guidance on the best ways to do so.  Thanks.

I'm reverting a few of the changesets as we speak, so if Mark could
hold off a few minutes, I'll update you all as I go. Since this only
concerns the US, future messages will only be sent on the talk-us@
list.

-Josh

___
Talk-us mailing list
Talk-us@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk-us


Re: [Talk-us] [Imports] Uploads to City of Salisbury, MD

2012-03-22 Thread Josh Doe
On Thu, Mar 22, 2012 at 8:49 PM, Josh Doe j...@joshdoe.com wrote:
 On Thu, Mar 22, 2012 at 10:52 AM, Nick Chamberlain
 nchamberl...@ci.salisbury.md.us wrote:
 Mark, if you could commit the remove duplicates changeset, that'd be
 great.  I will do my best to check if the issues are resolved, and will
 gladly accept any guidance on the best ways to do so.  Thanks.

 I'm reverting a few of the changesets as we speak, so if Mark could
 hold off a few minutes, I'll update you all as I go. Since this only
 concerns the US, future messages will only be sent on the talk-us@
 list.

I've reverted a few of them (see [0]), but when I checked #10901301
(2 nodes), it couldn't revert cleanly, since some nodes are
danglers, while others are used for ways in subsequent uploads. I'll
stop for now, but maybe Mark can take a look at things now?

-Josh

[0]: 
http://wiki.openstreetmap.org/wiki/User_talk:Nick_SPW#Salisbury.2C_Maryland_import

___
Talk-us mailing list
Talk-us@openstreetmap.org
http://lists.openstreetmap.org/listinfo/talk-us