Re: [OSM-dev] New OSM Test Data Repository

2014-02-24 Thread Sven Geggus
Jochen Topf joc...@remote.org wrote:

 It also depends on what you are testing. If you are testing the OSM XML 
 parser,
 those cases are valid data that the OSM parser must understand. If you are
 testing the renderer, it might be different.

For the renderer it would be nice to have a model village featuring all
objects in the standard mapnik style.

Apart from this I once set up a testfile for verification of correct
rendering of riverwidths. Background: I learnt from a cartographer, that
waterbodies should always be rendered in their real width on the ground.

Probably also of interest to this database.

Regards

Sven

-- 
Trotz der zunehmenden Verbreitung von Linux erfreut sich der Bär,
und - dank Knut - insbesondere der Eisbär, deutlich größerer
Beliebtheit als der Pinguin. (Gefunden bei http://telepolis.de/)
/me is giggls@ircnet, http://sven.gegg.us/ on the Web

___
dev mailing list
dev@openstreetmap.org
https://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] New OSM Test Data Repository

2014-02-24 Thread Sven Geggus
Jochen Topf joc...@remote.org wrote:
 I have started to work on a common OSM test data repository that can be
 used to test all sorts of software that works with OSM data.

Hm, you seem to user real osm id-numbers.

Don't you think we should define a private id-range for this purpose?

900 and above might be a reasonable range.

I think about importing such data into postgis for rendering tests and it
might be bad in some cases if this would kill existing data.

Sven

-- 
Those who do not understand Unix are condemned to reinvent it, poorly
(Henry Spencer)

/me is giggls@ircnet, http://sven.gegg.us/ on the Web

___
dev mailing list
dev@openstreetmap.org
https://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] New OSM Test Data Repository

2014-02-24 Thread Jochen Topf
On Mo, Feb 24, 2014 at 04:08:09 +, Sven Geggus wrote:
 Jochen Topf joc...@remote.org wrote:
  I have started to work on a common OSM test data repository that can be
  used to test all sorts of software that works with OSM data.
 
 Hm, you seem to user real osm id-numbers.
 
 Don't you think we should define a private id-range for this purpose?
 
 900 and above might be a reasonable range.
 
 I think about importing such data into postgis for rendering tests and it
 might be bad in some cases if this would kill existing data.

Creating test cases etc. is bad enough without having to count zeros in
huge numbers.

Jochen
-- 
Jochen Topf  joc...@remote.org  http://www.jochentopf.com/  +49-721-388298

___
dev mailing list
dev@openstreetmap.org
https://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] New OSM Test Data Repository

2014-02-24 Thread Jochen Topf
On Mo, Feb 24, 2014 at 03:55:37 +, Sven Geggus wrote:
 Jochen Topf joc...@remote.org wrote:
 
  It also depends on what you are testing. If you are testing the OSM XML 
  parser,
  those cases are valid data that the OSM parser must understand. If you are
  testing the renderer, it might be different.
 
 For the renderer it would be nice to have a model village featuring all
 objects in the standard mapnik style.

 Apart from this I once set up a testfile for verification of correct
 rendering of riverwidths. Background: I learnt from a cartographer, that
 waterbodies should always be rendered in their real width on the ground.
 
 Probably also of interest to this database.

For me that's out of scope. I am trying to test software not rendering style
files. But if somebody else wants to tackle this, we can talk about whether it
makes sense to put all of this in the same repository or how we can organize it
best.

Jochen
-- 
Jochen Topf  joc...@remote.org  http://www.jochentopf.com/  +49-721-388298

___
dev mailing list
dev@openstreetmap.org
https://lists.openstreetmap.org/listinfo/dev


[OSM-dev] New OSM Test Data Repository

2014-02-21 Thread Jochen Topf
I have started to work on a common OSM test data repository that can be
used to test all sorts of software that works with OSM data. See my blog
post at http://blog.jochentopf.com/2014-02-21-osm-test-data-repository.html
for details.

Jochen
-- 
Jochen Topf  joc...@remote.org  http://www.jochentopf.com/  +49-721-388298

___
dev mailing list
dev@openstreetmap.org
https://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] New OSM Test Data Repository

2014-02-21 Thread Christoph Hormann
On Friday 21 February 2014, Jochen Topf wrote:
 I have started to work on a common OSM test data repository that can
 be used to test all sorts of software that works with OSM data. See
 my blog post at
 http://blog.jochentopf.com/2014-02-21-osm-test-data-repository.html
 for details.

Nice.

Of course I could not resist creating another test:

http://www.imagico.de/files/data.osm

where the result would be 'valid but invalid in plate carree or web 
mercator'.

'invalid but valid in plate carree and web mercator' is also possible of 
course, just like all other combinations. ;-)

In case you wonder about the coordinates - that's (50 900) and 
(55 905) in UTM zone 47.  And of course this does not work near 
the equator.

-- 
Christoph Hormann
http://www.imagico.de/

___
dev mailing list
dev@openstreetmap.org
https://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] New OSM Test Data Repository

2014-02-21 Thread Jochen Topf
On Fr, Feb 21, 2014 at 08:05:34 +0100, Christoph Hormann wrote:
 On Friday 21 February 2014, Jochen Topf wrote:
  I have started to work on a common OSM test data repository that can
  be used to test all sorts of software that works with OSM data. See
  my blog post at
  http://blog.jochentopf.com/2014-02-21-osm-test-data-repository.html
  for details.
 
 Nice.
 
 Of course I could not resist creating another test:
 
 http://www.imagico.de/files/data.osm
 
 where the result would be 'valid but invalid in plate carree or web 
 mercator'.
 
 'invalid but valid in plate carree and web mercator' is also possible of 
 course, just like all other combinations. ;-)
 
 In case you wonder about the coordinates - that's (50 900) and 
 (55 905) in UTM zone 47.  And of course this does not work near 
 the equator.

I am not sure yet what exactly the valid and invalid mean and whether
we need different outcomes. For the moment valid means more something
like: The data looks good, could exist in a real database, and everybody
would agree on how to interpret it. While invalid is everything else.

Say, for instance, that you have a way that contains the same node twice
directly next to each other. Do you remove one occurence in the linestring
geometry or not? You don't see the difference usually, so in almost all
cases it does not matter. But there could be a difference. And it doesn't
happen often in the data. But strictly speaking people shouldn't add data
like that. So is that already invalid data?

It also depends on what you are testing. If you are testing the OSM XML parser,
those cases are valid data that the OSM parser must understand. If you are
testing the renderer, it might be different. I expect different programs
to use the same data.osm files from the repository, but decide for themselves
which test case is valid and which isn't in their context.

I haven't figured out all the details yet, for the moment I am trying to
get the ball rolling, get started writing test data and tests programs that
use them. We'll figure out the details along the way.

Jochen
-- 
Jochen Topf  joc...@remote.org  http://www.jochentopf.com/  +49-721-388298

___
dev mailing list
dev@openstreetmap.org
https://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] New OSM Test Data Repository

2014-02-21 Thread Christoph Hormann
On Friday 21 February 2014, you wrote:

 I am not sure yet what exactly the valid and invalid mean and
 whether we need different outcomes. For the moment valid means more
 something like: The data looks good, could exist in a real database,
 and everybody would agree on how to interpret it. While invalid is
 everything else.

I suppose there are many cases where it would make sense to allow an 
assessment somewhere between 'fully valid no matter how hard you look 
at it' and 'invalid beyond any consistent interpretation of the data'.  
It might make sense to say robust programs should be able to deal with 
certain data while data producing progams should avoid generating such 
data.

 I haven't figured out all the details yet, for the moment I am trying
 to get the ball rolling, get started writing test data and tests
 programs that use them. We'll figure out the details along the way.

It will be interesting to see results for multipolygons in various 
programs.  I have long suspected the JOSM validator is fairly 'loose' 
in this regard (which is a bad thing - validation in the editor should 
ideally be the strictest check of all) but never got around to pinning 
this down.

-- 
Christoph Hormann
http://www.imagico.de/

___
dev mailing list
dev@openstreetmap.org
https://lists.openstreetmap.org/listinfo/dev


Re: [OSM-dev] New OSM Test Data Repository

2014-02-21 Thread Martin Raifer

I am not sure yet what exactly the valid and invalid mean and
whether we need different outcomes. For the moment valid means more
something like: The data looks good, could exist in a real database,
and everybody would agree on how to interpret it. While invalid is
everything else.


I suppose there are many cases where it would make sense to allow an
assessment somewhere between 'fully valid no matter how hard you look
at it' and 'invalid beyond any consistent interpretation of the data'.
It might make sense to say robust programs should be able to deal with
certain data while data producing progams should avoid generating such
data.


Yes. I'd add that any good consumer/production OSM-data-processor should  
(be able to) gracefully handle slightly invalid data. For example it  
should not segfault, abort or skip a linestring just because the way  
contains a node twice. The same tests could also be used against  
OSM-quality-assurance tools which of course should be able to detect the  
respective glitches.


___
dev mailing list
dev@openstreetmap.org
https://lists.openstreetmap.org/listinfo/dev