Dear fellow-mappers,

In The Netherlands we went through this process 4 years ago. This means we can talk from experience.

After the preparation period, we started the real import process around march 2014 with a group of some 30 volunteers. At the end of 2014, the initial import was finished. At the time, we decided to import the building id's, but not the address id's. The address id's we not available in the WFS data source we used and we reasoned that the addresses can be identified by the address tags.

Since then, we are in the process of maintaining the imported buildings and address. This concerns around 11.000 new buildings and the same number of new addresses every month. The tooling we use in this process depends highly on the building id's. Not having have the address id's seriously complicates the maintenance of addresses, Even though the combination of postcode and house number are unique in The Netherlands. Being official government id's, the building and address id's in The Netherlands get more and more official applications day by day. They are already required in notarial acts and used by the chamber of commerce. Leaving them out would not be an option in my opinion.

We have no issue with mappers being afraid to touch buildings because of the building id's. Most people know where to find the id, or file an import request on the forum for a building or a block of buildings. Some complex buildings like Utrecht Central station keep being mapped manually because the official data is too complex. The main issue we have is with addresses being deleted, because people delete a node instead of the poi tags. This is nothing new, but it is detected more often because of the continuous comparison between OSM and official building and address data.

Gertjan Idema


On 01/11/2018 15:39, Pieter Vander Vennet wrote:
Hello everyone,
Original Poster here.

Thanks for your remarks. A lot of peopleactivein the Belgian community are following this discussionwith interest and some bewilderment.Like the first message, this is a reply that has been written by several memberstogether. We did have a hard time filtering the actual discussion about our import from the thread. We would kindly invite everyone to take the discussion of other imports and the more philosophical points  to a more appropriate place (e.g. the talk-list) and stick to the topic of the Flemish buildings import.

We have been working on this for two years and don't mind working on it for a few more months, before running the actual integration. Weare, in the first place, looking for advice and guidance so as to further improve our methods - together with witha go-ahead once all issues are resolved.

In case this was not clear before:this is _not_ a flat import where all the data is dumped into OSM automatically. This is an effort of the Belgian community, where mappers select a subset of the data they choose to integrate piecemeal, usually one street at a time. The data is loaded in JOSM and checked against the aerial imagery involving the mapper's common sense. As an example, this is a screenshot of the tool in action: https://matrix.org/_matrix/media/v1/download/matrix.org/WmemnfTQZOSfoKoIlByFBHmv. You can see that the tool reuses tags from OSM and is offering the mapper the choicewhich geometry should be used(by not saving the import). In combination with aerial imagery, this should cause little to no problemsin regard with the original OSM-data.

Mateusz and Frederik, your points regarding documentation are being addressed as we speak.

# External IDs

The most discussed point seems to be the IDs we wanted to include.
We believe theIDswill significantly ease updates to our buildings when the GRB is updated,and make the whole process more robust. Theyprovide a way to update and crossreference the data now and in the future.Exactly by keeping theseIDs, updating data down the road will be smoother and prevent later changes from OSM contributors to be overwritten. As an added benefit, keeping IDsmakes the import tool-agnostic. It will also make it much easier to flag errors in the source data.

We aren't afraid that people will refrain from editing source-tagged objects: new users (using theiD editor) will probably not notice those tags in the first place; advanced users will know about them. And intermediate users will probably look them up. Merging and splitting are rather rare operations –once the geometry is accurate, they should becomeunnecessary operations. Finally, as the data set is available for free under an open license, anyone can verify the data independently (though noton the ground, of course).

We will address the worries about the external IDs on a point by point basis below:

From *Frederik Ramm:*

  * /If you delete a building////that has such an ID, how will you
    ensure i//t//isn't brought in again////through a later "update
    import"? Etc.//"/


There will not be "an update import". The tool is built for continuous use. It will improve the geometry of existing buildings and already looks at the source tags. The tool is builtfor heavy mappers who will be monitored by our closely knit community. Importing and updating will be done street by streetor block by blockbasis. We believe we cantrustthese mappers to analyze situations like this on a case by case basis. In this case, whether of not the deleted building had an ID does not matter much.


These points, respectively by Frederik Ramm, Christoph Hormann and Mateusz Konieczny are quite similar. *We'll answer them together.*

  * /The idea of having an "audit trail" for every single geometry by
    way of an Id for that individual geometry is interesting, but I
    think that it is totally sufficient if a changeset carries the
    information that this changeset has been imported from XYZ data
    source at time stamp T; everything else can be researched down the
    line if the need should/
  * / For this purpose it is completely unnecessary to bother the OSM 
    community with external IDs.  If you want to check if the data has
    been unchanged since you added it then do exactly that - check if
    there are  any newer versions of the objects that have originally
    been added in  the import./

  * /* What is the point of adding tags like source:geometry:entity
    given that like any other tags they may be edited once added to OSM?/


It is not, and has never been, the intention to fully automate this. As we are trying to make clear, this info is needed and used all the time, and not at some vague "maybe we do an update sometime". We are not using the IDs to make a direct (full database) comparison to see what exists in one and not in the other on a gigantic scale. We are not looking for an 'auto update OSM with all new buildings added to GRB'. We use the IDs, on a SINGLE BUILDING comparison basis only to see what's changed (geometry touch-ups, or entirely replacing a building).

"THEY WON'T BE STABLE ANYWAY"
If the external references are edited, the tool will flag the building as needing an update. The only thing a (deliberate or accidental) unneeded change of the ID would do, is alert the tool's users that something doesn't add up. Either the building has been replaced in reality (and thus having a new UIDN), and you can improve the geometry of the new building. If no apparent change to the physical building can be traced, the UIDN can be restored.  The amount of 'false positives' due to unintended edits of the IDs is expect not to come remotely close to the useful flaggings. Without the tags, it's hard to tell which buildings have been imported: you would need complicated spatial heuristics because we don't blindly copy buildings, we improve them through other sources. Once mapped, the geometry changing would happen more often than the tags changing, so we'd have a lot more false positives.


"WHY NOT JUST CREATE A DATABASE OF LINKS EXTERNALLY?"
In theory it would be possible to have the tool keep a register of which OSM ID maps to which ID in the GRB or to evaluate changesets to get similar info. Some issues with this approach:

  * Because we aren't doing an automatic import but manually add the
    buildings through JOSM, this would require us to copy the OSM ID
    of each building manually, or rely on heuristics.

  * We don't want to centralize the link between OSM and GRB. There
    isn't one single person doing the import, it's something several
    people in the communitywork on. Anyone could host the tool if its
    current maintainer disappears.

  * Pushing the analysis of what exactly has happened towards a
    changeset is an unnecessary burden on a mapper. The changeset will
    not just contain "here be new buildings", but also "in this case,
    we just used part of the geometry of the GRB building, but we left
    part of the building geometry intact because that was better in
    OSM". After having analysed the changeset, the mapper would still
    have to look up in an import database what exactly the
    relationship between the OSM and GRB objects was at the time of
    import.

  * An external database providing the link between OSM and GRB
    objects would be outdated after a day and almost impossible to
    update with changes on the OSM side



"WHY NOT JUST COMPARE GEOMETRIES?"
Doing a geometrical analysis to analyze differences would be impractical, not just because this would be computationally heavy, but also because it would lead to too much false positives. For example because of tiny changes, but also because we do not blindly use source geometries since we first address overnoding.



I hope that all confusion is cleared now and that we can move this issue forward.

With Regards,
The Belgian Mappers,
Pieter Vander Vennet

_______________________________________________
Imports mailing list
[email protected]
https://lists.openstreetmap.org/listinfo/imports
_______________________________________________
Imports mailing list
[email protected]
https://lists.openstreetmap.org/listinfo/imports

Reply via email to