Hi Christoph,
Thank you for you feedback. Yes, dangers of automated classification are well
understood, past failures are studied, and these dangers will be addressed
during this import. Careful visual inspection of imported data is needed,
comparison against available satellite imagery, existing OSM information, and,
when possible, local knowledge. If for any parts of the country concerns about
correctness of new data arise, these parts will be excluded from import.
In the Import Plan it is stated that this project is meant to import areas only
corresponding to forests (with occasional secondary tags) and farmlands. A very
small set of classes with no deep sub-classification. No man-made areas or
features such as industrial zones, residential areas, roads etc., no water
areas (rivers, lakes etc.) — such information present in the source dataset is
thrown away.
I tried really hard to find mis-classified places but so far I could not find
it. This source even classifies cutlines under power lines, northern mountain
vasts, and logging areas as "scrub", the same way they are typically tagged in
OSM, and I must add it is often hard for humans to recognize "scrub" on
satellite imagery. So far I've managed to look though a tiny subset of
resulting data (about 4 of total 290 counties) and again I do not exclude a
possibility that things will go wrong at some locations. But now, apart from
unavoidable presence of muffled digital noise making it through smoothing
filters, I dare to say I wouldn't be able to manually tag forests and farmlands
more accurately than this import has.
>this is inherently incompatible with the tagging system and the way how things
>are mapped in OSM.
>you'd import the whole classification system and philosophy behind it.
I am sorry, but I cannot see how these points apply in this case. Pixels with
values 111-128 map to "landuse=forest" (with a few variations reflected as
secondary tags), pixel 3 corresponds to farmland, pixel 42 — to grass. That's
it. Resulting data layers tagging choice looks and feels the same way I would
have tagged it manually using imagery.
> like natural=wood vs. natural=scrub, natural=scree vs. natural=bare_rock,
> landuse=commercial vs. landuse=industrial or leisure=park vs. leisure=garden
> being indistinguishable based on spectral characteristics alone.
I agree with you on that. However, given that man-made features (apart from
farmlands which are easily identifiable) are not target for this import, the
only relevant issue is to tell apart wood, scrub, scree etc. However, so far I
see good accuracy in detecting scrub vs wood, at the same level of (in)accuracy
a human looking at imagery would have.
>But this has to be done specifically for OSM and for the tagging and mapping
>standards we have and it has to be controlled by the mapper.
We do consider this data source to be done for everyone who can make good use
of it, in particular specifically for us OSM-mappers. The Import Plan in
details describes in several places all stages at which human mappers control
imported data before, during and after the import.
Let's be realistic — it is unlikely that we will be able to manually map
forests in reasonable time in a country sized 1500×500 km by tracing imagery by
hand. We already allow machines to do many repetitious tasks about the OSM
dataset for us, why not let them help us with land cover?
>Воскресенье, 14 апреля 2019, 22:59 +03:00 от Christoph Hormann <
>[email protected] >:
>
>On Sunday 14 April 2019, Grigory Rechistov via Imports wrote:
>> Dear OSM community,
>> This is an announcement and request for comments, suggestions, and
>> approval of land cover information data import for the territory of
>> Sweden.
>>
>> [...]
>Had a quick look over your description.
>
>My general recommendation would be not to import any landcover data in
>OSM generated through automated classification of satellite imagery.
>There are multiple reasons for this, the most significant probably is
>that this is inherently incompatible with the tagging system and the
>way how things are mapped in OSM.
>
>Part of this stems from the very idea of closed landcover classification
>systems as not being meant to positively identify certain areas but
>assigning essentially the least unlikely of a set of potential classes
>to every point on the surface.
>
>Another part of this is due to many of the tagging distinctions we have
>in OSM - like natural=wood vs. natural=scrub, natural=scree vs.
>natural=bare_rock, landuse=commercial vs. landuse=industrial or
>leisure=park vs. leisure=garden being indistinguishable based on
>spectral characteristics alone.
>
>Therefore you'd not just be importing data into the existing OSM tagging
>system, you'd import the whole classification system and philosophy
>behind it.
>
>Don't get me wrong - i am not opposed to using automated analysis of
>open data satellite imagery for more efficient mapping in OSM. This
>has a lot of potential. But this has to be done specifically for OSM
>and for the tagging and mapping standards we have and it has to be
>controlled by the mapper. There is no shortcut to this by importing
>data sets created for a completely different purpose with completely
>different goals.
>
>--
>Christoph Hormann
>http://www.imagico.de/
С наилучшими пожеланиями,
Григорий Речистов.
Med vänliga hälsningar,
Grigory Rechistov
With best regards,
Grigory Rechistov
_______________________________________________
Imports mailing list
[email protected]
https://lists.openstreetmap.org/listinfo/imports