Re: [OSGeo-Discuss] Re: idea for an OSGeo project -- a new, open data format

2007-12-20 Thread Christopher Schmidt
On Wed, Nov 21, 2007 at 11:52:06PM -0700, Robert Bray wrote:
 Chris,
 
 I agree and will see what I can do to make it happen. If we want wider 
 adoption it may also be beneficial to see some kind of C/C++ access library 
 created for the format. In the past I always felt the FDO Provider was that 
 library, but the masses seem to be telling me I am wrong about that :)

/me sends a '30 days no news' nag, continues to wait patiently.

Regards,
-- 
Christopher Schmidt
Web Developer
___
Discuss mailing list
Discuss@lists.osgeo.org
http://lists.osgeo.org/mailman/listinfo/discuss


Re: [OSGeo-Discuss] Re: idea for an OSGeo project -- a new, open data format

2007-11-22 Thread Robert Bray

Chris,

I agree and will see what I can do to make it happen. If we want wider 
adoption it may also be beneficial to see some kind of C/C++ access library 
created for the format. In the past I always felt the FDO Provider was that 
library, but the masses seem to be telling me I am wrong about that :)


Bob
- Original Message - 
From: Christopher Schmidt [EMAIL PROTECTED]

To: OSGeo Discussions discuss@lists.osgeo.org
Sent: Wednesday, November 21, 2007 9:04 PM
Subject: Re: [OSGeo-Discuss] Re: idea for an OSGeo project -- a new,open 
data format




On Tue, Nov 13, 2007 at 11:18:42PM -0800, Robert Bray wrote:

Is it an open format?  ABSOLUTELY (we just never wrote a spec, but I am
willing to get it done)

All this said, I'd really like to understand everyones requirements for
this new format. If SDF fits thats great, if not thats ok too. We are
always happy to contribute what we can to the community.


I have an interest in seeing wider implementation of SDF. I have a user
who is interested in using FeatureServer against SDF data -- but I don't
have an environment in which I can build/use the FDO Python bindings
(being a mac user), so I can't help the user out.

It seems more likely that an alternative implementation to FDO's could be
created if documentation on the format was available. Obviously, there's
no guarenteee (and unfortunately, I'm not a very good coder, so I can't
offer much), but I'd at least like to be able to investigate how
difficult SDF might be to implement: not just for my own personal
reasons, but because implementation-difficulty can affect uptake of a
format as much as the technical benefits can (if not more).

so in response to 'I am willing to get it done', I would strongly
encourage you to do so!

Regards,
--
Christopher Schmidt
Web Developer
___
Discuss mailing list
Discuss@lists.osgeo.org
http://lists.osgeo.org/mailman/listinfo/discuss




___
Discuss mailing list
Discuss@lists.osgeo.org
http://lists.osgeo.org/mailman/listinfo/discuss


Re: [OSGeo-Discuss] Re: idea for an OSGeo project -- a new, open data format

2007-11-22 Thread Mateusz Loskot
Robert Bray wrote:
 Chris,
 
 I agree and will see what I can do to make it happen. If we want wider
 adoption it may also be beneficial to see some kind of C/C++ access
 library created for the format. In the past I always felt the FDO
 Provider was that library, but the masses seem to be telling me I am
 wrong about that :)

Bob,

Sounds like a very good idea to setup a libsdf project.
This way, it could be used by FDO, OGR and others.

Cheers
-- 
Mateusz Loskot
http://mateusz.loskot.net
___
Discuss mailing list
Discuss@lists.osgeo.org
http://lists.osgeo.org/mailman/listinfo/discuss


RE: [OSGeo-Discuss] Re: idea for an OSGeo project -- a new, open data format

2007-11-22 Thread Traian Stanev
How about an OGR driver for SDF? No need to invent a new API when one already 
exists.

Traian


From: [EMAIL PROTECTED] [EMAIL PROTECTED] On Behalf Of Mateusz Loskot [EMAIL 
PROTECTED]
Sent: Thursday, November 22, 2007 2:36 PM
To: OSGeo Discussions
Subject: Re: [OSGeo-Discuss] Re: idea for an OSGeo project -- a new,open
data format

Robert Bray wrote:
 Chris,

 I agree and will see what I can do to make it happen. If we want wider
 adoption it may also be beneficial to see some kind of C/C++ access
 library created for the format. In the past I always felt the FDO
 Provider was that library, but the masses seem to be telling me I am
 wrong about that :)

Bob,

Sounds like a very good idea to setup a libsdf project.
This way, it could be used by FDO, OGR and others.

Cheers
--
Mateusz Loskot
http://mateusz.loskot.net
___
Discuss mailing list
Discuss@lists.osgeo.org
http://lists.osgeo.org/mailman/listinfo/discuss
___
Discuss mailing list
Discuss@lists.osgeo.org
http://lists.osgeo.org/mailman/listinfo/discuss


Re: [OSGeo-Discuss] Re: idea for an OSGeo project -- a new, open data format

2007-11-22 Thread Christopher Schmidt
On Thu, Nov 22, 2007 at 05:17:46PM -0800, Traian Stanev wrote:
 How about an OGR driver for SDF? No need to invent a new API when one already 
 exists.

I think that was exactly Bob's point: There is already an FDO driver for
SDF. If OGR is sufficient, I'm not entirely sure why FDO wouldn't be by
the same argument.

I'm with Mat, in feeling that it's not the path to the widest usage in
only FDO *or* OGR, and that a reference implementation outside of either
of those would result in possibly higher OEM-style integration into non
OGR/FDO products, and that a single shared implementation is the best
way for all developers to share a single effort when moving forward. 

Regards,
-- 
Christopher Schmidt
Web Developer
___
Discuss mailing list
Discuss@lists.osgeo.org
http://lists.osgeo.org/mailman/listinfo/discuss


Re: [OSGeo-Discuss] Re: idea for an OSGeo project -- a new, open data format

2007-11-22 Thread Mateusz Loskot
Traian Stanev wrote:
 How about an OGR driver for SDF? No need to invent a new API
 when one already exists.

Traian,

AFAIKU, that's the idea that triggered the debate
but it would be good if there is non-API specific library for SDF.
The the same library could be used by OGR, FDO, etc.

Cheers
-- 
Mateusz Loskot
http://mateusz.loskot.net
___
Discuss mailing list
Discuss@lists.osgeo.org
http://lists.osgeo.org/mailman/listinfo/discuss


RE: [OSGeo-Discuss] Re: idea for an OSGeo project -- a new, open data format

2007-11-22 Thread Traian Stanev

I think many people have it backwards. SDF is basically a reference 
implementation of the FDO API (or a significant subset thereof). It's not an 
independent file format that happens to have an FDO driver. Any C++ API for SDF 
would likely have to use FDO data structures and types underneath to get SDF 
data out of the file. This is because FDO data structures are directly 
serialized into SDF binary tables.

Traian


From: [EMAIL PROTECTED] [EMAIL PROTECTED] On Behalf Of Christopher Schmidt 
[EMAIL PROTECTED]
Sent: Thursday, November 22, 2007 9:23 PM
To: OSGeo Discussions
Subject: Re: [OSGeo-Discuss] Re: idea for an OSGeo project -- a new,open 
data format

On Thu, Nov 22, 2007 at 05:17:46PM -0800, Traian Stanev wrote:
 How about an OGR driver for SDF? No need to invent a new API when one already 
 exists.

I think that was exactly Bob's point: There is already an FDO driver for
SDF. If OGR is sufficient, I'm not entirely sure why FDO wouldn't be by
the same argument.

I'm with Mat, in feeling that it's not the path to the widest usage in
only FDO *or* OGR, and that a reference implementation outside of either
of those would result in possibly higher OEM-style integration into non
OGR/FDO products, and that a single shared implementation is the best
way for all developers to share a single effort when moving forward.

Regards,
--
Christopher Schmidt
Web Developer
___
Discuss mailing list
Discuss@lists.osgeo.org
http://lists.osgeo.org/mailman/listinfo/discuss
___
Discuss mailing list
Discuss@lists.osgeo.org
http://lists.osgeo.org/mailman/listinfo/discuss


Re: [OSGeo-Discuss] Re: idea for an OSGeo project -- a new, open data format

2007-11-21 Thread Christopher Schmidt
On Tue, Nov 13, 2007 at 11:18:42PM -0800, Robert Bray wrote:
 Is it an open format?  ABSOLUTELY (we just never wrote a spec, but I am 
 willing to get it done)
 
 All this said, I'd really like to understand everyones requirements for 
 this new format. If SDF fits thats great, if not thats ok too. We are 
 always happy to contribute what we can to the community.

I have an interest in seeing wider implementation of SDF. I have a user
who is interested in using FeatureServer against SDF data -- but I don't
have an environment in which I can build/use the FDO Python bindings
(being a mac user), so I can't help the user out.

It seems more likely that an alternative implementation to FDO's could be 
created if documentation on the format was available. Obviously, there's
no guarenteee (and unfortunately, I'm not a very good coder, so I can't
offer much), but I'd at least like to be able to investigate how
difficult SDF might be to implement: not just for my own personal
reasons, but because implementation-difficulty can affect uptake of a
format as much as the technical benefits can (if not more).

so in response to 'I am willing to get it done', I would strongly
encourage you to do so!

Regards,
-- 
Christopher Schmidt
Web Developer
___
Discuss mailing list
Discuss@lists.osgeo.org
http://lists.osgeo.org/mailman/listinfo/discuss


RE: [OSGeo-Discuss] Re: idea for an OSGeo project -- a new, open data format

2007-11-19 Thread Andy Robinson
From: [EMAIL PROTECTED] [mailto:discuss-
[EMAIL PROTECTED] On Behalf Of Jo Walsh
Sent: 18 November 2007 6:26 AM
To: OSGeo Discussions
Subject: Re: [OSGeo-Discuss] Re: idea for an OSGeo project -- a new,open
data format

dear Steve, all,
On Tue, Nov 13, 2007 at 05:24:55PM +, Steve Coast wrote:
 Real artists ship. For everyone else there's standards wanking.

As the origins of the word 'yardstick' suggest, size is relative,
and standards and wanking have always been intimately connected.
http://www.etymonline.com/index.php?search=yardstick

 this: We should have got a committee to design a standard, then we
 could think about a committee to design an ontology... and choose a

As http://wiki.openstreetmap.org/index.php/Map_Features states,
there is benefit in agreeing a recommended set of features and
corresponding tags. This page is a beautiful work and has many
interesting properties. But it's an ontology designed by a committee,
the work of core contributors needing more structure to achieve their
aims, than an completely open key-value tag system.


I'll point out that the original map features list was created by me and not
by committee. It was put together so that those with no experience of what
might be mapped would have some idea of where to start, remember that OSM is
not a project driven by the traditional cartography or GIS industries, for
most people joining the project it all needs to be learnt from the ground
up. Map Features was never intended to be a standard and hopefully it never
will be. Since its introduction in March 06 there have been plenty who have
advocated standardising it and a process of voting on proposed new tags to
join the Map features list still goes on by a minority, however in practice
if a tag is proposed and mappers find it useful they simply use it and
that's the way it should be. Voting only sees a small handful of responses
anyway. Principally it's a good way for discouraging poor tag design rather
than setting a standard.

I'm impressed to see how many different language communities have
worked to translate this page and explain its contents.
Yet, italians, swedes and spaniards alike must use english-language
key and value 'tags' in order to have their work show up on the map.
Many annotations made before the core recommended feature set was
fixed have been lost to perception; though it's technically possible
to use a different system, adapt the rendering and annotation clients,
then that work would be lost to re-use. A bit more structure would
afford a lot more future adaptation and translation.

Map features was always intended to be an English orientated tagging list. I
always envisaged that other languages would develop their own tags and use
those for their local areas. I also expected (and still expect) to see other
tagging formats for other peculiar uses. Map features never included tagging
ideas for transport routing for instance. These haven't really happened much
yet, perhaps through a lack of confidence in breaking with tradition more
than anything else. The immediate gratification of rendered mapping (which
uses the longest established and most prolifically used tags) is also a big
driver. Hopefully though with time a lot more language specific tagging will
turn up in the data set.


OSM is a brilliant project, borne of real need and social momentum.
Meanwhile some corners of the standards industry really have gone off
the rails and appear to be acting against common sense and user benefit.
About the most depressing thing i heard at FOSS4G was, in the middle
of an interesting talk, we were going to implement cool feature X,
but we're *waiting for the standard*.

As i think you've pointed out in the past, standards like the core
Map Features ought to emerge from common practise, from comparing
different things that are shipped and running and being used.
Standards that *do* work, like WMS and RSS, will get picked up.
The core difference in approach seems to be a question of process.
As Bitner pointed out, sometimes it makes sense to slow down a bit in
order to let others catch up, so we can all go faster.


Bitner may have a point but I'm not so sure it holds true for OSM, which has
demonstrated that you just need to facilitate the creativity for great
progress to be made. Standards have not been the driver for making it a
success in the short time to date and each week we see something new and
exciting within the project without any reference to an agreed standard or
committee decision. If we stop and wait for everyone to catch up we will
surely stifle the cutting edge creativity from which the project benefits
immensely. Like all the exploration of the past there will always be a those
forging ahead, setting the path of the project, and opening up new
territories. Without them there will be nowhere for the masses to settle and
make use of this great data OSM is creating.

Cheers

Andy

Andy Robinson

Re: [OSGeo-Discuss] Re: idea for an OSGeo project -- a new, open data format

2007-11-17 Thread Jo Walsh
dear Steve, all,
On Tue, Nov 13, 2007 at 05:24:55PM +, Steve Coast wrote:
 Real artists ship. For everyone else there's standards wanking.

As the origins of the word 'yardstick' suggest, size is relative,
and standards and wanking have always been intimately connected.
http://www.etymonline.com/index.php?search=yardstick
 
 this: We should have got a committee to design a standard, then we  
 could think about a committee to design an ontology... and choose a  

As http://wiki.openstreetmap.org/index.php/Map_Features states, 
there is benefit in agreeing a recommended set of features and
corresponding tags. This page is a beautiful work and has many
interesting properties. But it's an ontology designed by a committee,
the work of core contributors needing more structure to achieve their
aims, than an completely open key-value tag system.

I'm impressed to see how many different language communities have
worked to translate this page and explain its contents. 
Yet, italians, swedes and spaniards alike must use english-language 
key and value 'tags' in order to have their work show up on the map.
Many annotations made before the core recommended feature set was
fixed have been lost to perception; though it's technically possible
to use a different system, adapt the rendering and annotation clients,
then that work would be lost to re-use. A bit more structure would
afford a lot more future adaptation and translation.

OSM is a brilliant project, borne of real need and social momentum.
Meanwhile some corners of the standards industry really have gone off
the rails and appear to be acting against common sense and user benefit.
About the most depressing thing i heard at FOSS4G was, in the middle
of an interesting talk, we were going to implement cool feature X,
but we're *waiting for the standard*. 

As i think you've pointed out in the past, standards like the core
Map Features ought to emerge from common practise, from comparing
different things that are shipped and running and being used. 
Standards that *do* work, like WMS and RSS, will get picked up.
The core difference in approach seems to be a question of process. 
As Bitner pointed out, sometimes it makes sense to slow down a bit in
order to let others catch up, so we can all go faster.


jo
--
 
___
Discuss mailing list
Discuss@lists.osgeo.org
http://lists.osgeo.org/mailman/listinfo/discuss


Re: [OSGeo-Discuss] Re: idea for an OSGeo project -- a new, open data format

2007-11-14 Thread Robert Bray

These items are also supported in SDF today:

- SRS is a must, complete projection parameters (stored as WKT).
- full set of attibute types (c, int2, int4, int8, float, double,
 txt (i18l), date and timestamps (with gmt precise offset), blobs, ...
- indexing (unique and non-unique) on key attributes

The only exception is that I do not believe SDF supports the blob data type. 
There is also support for range constraints.


Bob

- Original Message - 
From: Paul Spencer [EMAIL PROTECTED]

To: OSGeo Discussions discuss@lists.osgeo.org
Sent: Wednesday, November 14, 2007 5:24 AM
Subject: Re: [OSGeo-Discuss] Re: idea for an OSGeo project -- a new,open 
data format





On 14-Nov-07, at 7:20 AM, Christopher Schmidt wrote:


- optional coloring and styles, break values, rendering and
 scale limits, persistent joins or relates, color ramp, ...


are things which are provided by SLD and the like, which means that  you
really want SDF + WMC -- I don't think that this is SDF's job, and I
don't think that it should be the job of any geodata format.


mapinfo files come with embedded styling so there is at least one  format 
out there that combines the two.  KML arguably does the same  thing.


I think it would be worth exploring the merits of providing the  ability 
to embed styling information.  I'm not sure if it is useful to  place 
styling 'hints' in metadata, assuming that the format supports  arbitrary 
metadata.


Cheers

Paul

+-+
|Paul Spencer  [EMAIL PROTECTED]|
+-+
|Chief Technology Officer |
|DM Solutions Group Inchttp://www.dmsolutions.ca/ |
+-+





___
Discuss mailing list
Discuss@lists.osgeo.org
http://lists.osgeo.org/mailman/listinfo/discuss




___
Discuss mailing list
Discuss@lists.osgeo.org
http://lists.osgeo.org/mailman/listinfo/discuss


Re: [OSGeo-Discuss] Re: idea for an OSGeo project -- a new, open data format

2007-11-14 Thread P Kishor
On 11/14/07, Christopher Schmidt [EMAIL PROTECTED] wrote:
 On Wed, Nov 14, 2007 at 04:12:21AM -0600, [EMAIL PROTECTED] wrote:
  I would just love to click open and see something nice, specially if
  someone has already taken the time to make it beautiful.
 
  Think of it as the output of a word processor instead of an editor.
  Excel vs. VisiCalc; and the list may go on forever.
 
  All this is to say: not only the data needs a better container, the maps
  as well.

 A data format is not a 'container' for maps, it's a container for data.
 The container for your maps should be something like WMC (though I
 don't know if it makes sense off the web): grouping together datasources
 and SLD and the like. The first half of the things you identified:

  - optional space for metadata,
  - optional thumnails (in 2 sizes: thumb and browse)
  - optional embeded symbols (truetype/svg glyphs) and prefered layout
  - optional coloring and styles, break values, rendering and
scale limits, persistent joins or relates, color ramp, ...

 are things which are provided by SLD and the like, which means that you
 really want SDF + WMC -- I don't think that this is SDF's job, and I
 don't think that it should be the job of any geodata format.

From a pure data approach, I agree that how it looks does not belong
in what it is. Data should be completely separate from the
presentation.

There is a part of me, though, that believes that embedding some
default styling in the data container would be very useful. Apple's
Quicklook is a great technology that allows one to quickly look at
the data before deciding what to do with it. So, at there is some
precedence for this.

This is how it could work -- if any styling is provided from outside,
the internal styling would be ignored. However, if nothing is provided
from outside, then the default styling as determined by the creator
would be used.

Being able to click on the data package and see a preview would be fantastic.

Built in containers for metadata and SRS would be absolutely essential.

Manolo, could you please add these to the wiki page that David set up,
else I will add it but not until tomorrow or so.


 Regards,
 --
 Christopher Schmidt
 Web Developer
 ___
 Discuss mailing list
 Discuss@lists.osgeo.org
 http://lists.osgeo.org/mailman/listinfo/discuss



-- 
Puneet Kishor
http://punkish.eidesis.org/
Nelson Institute for Environmental Studies
http://www.nelson.wisc.edu/
Open Source Geospatial Foundation (OSGeo)
http://www.osgeo.org/
Summer 2007 ST Policy Fellow, The National Academies
http://www.nas.edu/
___
Discuss mailing list
Discuss@lists.osgeo.org
http://lists.osgeo.org/mailman/listinfo/discuss


Re: [OSGeo-Discuss] Re: idea for an OSGeo project -- a new, open data format

2007-11-14 Thread P Kishor
disclaimer: I speak from a position of complete ignorance about SDF,
so everything I might say could be wrong.

On 11/14/07, Robert Bray [EMAIL PROTECTED] wrote:
 A little clarification on SDF, since it comes up once or twice in this
 thread. Jason's earlier descriptions of it's capabilities are pretty good.
 It supports multiple Feature Classes / Tables per file, strongly typed
 properties, multiple geometry properties per class, with seperate R-Trees
 for each geometry property. All of this is stored in a single file that is
 heavily optimized for spatial reads. The SDF FDO Provider suppports a
 multiple reader / single writer model. The geometries themselves include
 simple features + circular arcs in 2D, 2D with Z, 2D with M, or 2D with Z 
 M. The R-Trees are currently 2D.


SDF does sound the cat's meow.

 Is it an open format?  ABSOLUTELY (we just never wrote a spec, but I am
 willing to get it done)


What is the licensing of this format? What is the plan for licensing
of this format? Is the format going to be put out in public domain
ever? This is an important discussion, but it probably doesn't belong
in this thread. I bring it up here just to make sure it is not
forgotten.

 Another little known fact is that in the process of creating SDF we
 (Autodesk) literally wrote the code three times. The first time we built SDF
 on BerkleyDB, a great open source project but it has some fairly significant
 license fees for using it in a proprietary product. The second time we wrote
 it on SQLite, however the performance penalaty of the Relational layer was
 significant (read orders of magnitude). The third time we chose to strip
 away the SQLite relational layer and built directly on the SQLite Backend
 components (B-Tree, Pager, and OS Interface). In the end the third
 implementation actually turned out to be faster than BDB and is the one we
 use today.


By not having the relational layer we lose a lot in terms of attribute
data handling, or is that still somehow preserved in SDF?

Keep in mind, SQLite has gone through major revisions, some of them
bringing huge speed bumps in the process. That said, different folks
have different evaluation critieria -- I believe it was Steve W who
said that pretty much all he cared about was speed. For him it seems
SDF might work very well. For me, I don't mind sacrificing some speed
for ease and flexibility of related data storage, querying, and
retrieval. How can we achieve both?

 All this said, I'd really like to understand everyones requirements for this
 new format. If SDF fits thats great, if not thats ok too. We are always
 happy to contribute what we can to the community.

Thanks Bob. As you can see, we are still articulating the
requirements, and while we might be accused of requirement wanking,
design by committee might be better than no design at all.

I find two problems with Shapefiles -- one, that it is not in public
domain (I am not even sure of what licensing there is on it), and
while ESRI is not likely to pull a Unisys on us, it just is
philosophically better to free if possible. The second, more major
problem is Shapefile's antiquated data technology. DBF is a royal pain
in the derierre with its 10 char column name, no relational tables, no
indexing of data constraints. The geometry part of Shapefiles seems to
be pretty good and adequate; it is the data part that is the problem.


 Bob

 - Original Message -
 From: Michael P. Gerlek [EMAIL PROTECTED]
 To: [EMAIL PROTECTED]; OSGeo Discussions discuss@lists.osgeo.org;
 [EMAIL PROTECTED]
 Sent: Tuesday, November 13, 2007 12:23 PM
 Subject: RE: [OSGeo-Discuss] Re: idea for an OSGeo project -- a new,open
 data format


   Regarding the suggestion that MapServer takes on this new format as
  the
   primary format:  I think this is way beyond the scope of what OSGeo
  should
   be doing.
 
  I agree with bitnerd.  If the MapServer team thinks this is a valuable
  and worthwhile format, they will adopt it at some point.  It would not
  be unreasonable for them to step back and see how thing progresses
  before deciding to spend their valuable ergs on it.  The burden is on
  the OpenShape people to show the idea is worthwhile and meritorious.
 
  (My two cents on the standards question: OSGeo is not a standards
  organization, but / however / on the other hand / nonetheless one of the
  reasons OSGeo exists is to foster such collaborations.  If some people
  want to try and develop something new like this, I'm all in favor of
  OSGeo offering mailing list and wiki space to help out.  Declaring this
  to be a standard effort, however, is probably premature in any case --
  more useful at this point to see the idea sketched out further, see
  who's interested, etc.)
 
  -mpg
  ___
  Discuss mailing list
  Discuss@lists.osgeo.org
  http://lists.osgeo.org/mailman/listinfo/discuss
 


 ___
 Discuss mailing list

Re: [OSGeo-Discuss] Re: idea for an OSGeo project -- a new, open data format

2007-11-14 Thread Frank Warmerdam

P Kishor wrote:

I find two problems with Shapefiles -- one, that it is not in public
domain (I am not even sure of what licensing there is on it), and
while ESRI is not likely to pull a Unisys on us, it just is
philosophically better to free if possible. 


Puneet,

I think this is a red herring.  A file format is not normally considered
copyrightable.  Generally speaking the only way to control a file format
is through patents (for instance on compression methods or some fancy
spatial indexing) and through trademarks on the format name.  There is
no apparent such issues with shapefiles.  While the Shapefile format may
not have been developed by an open process, nor is it (as far as I know)
a dejure standard, but it is as open as any format.

I think we need to focus on the technical failings of existing formats,
and ensure a new format is widely usable (no strings attached).

There were also questions about SDF format.  As I see it, one failing of the
SDF format is that there isn't a published specification (that I'm aware of)
and it is a complex enough format that it would be challenging to build a
complete distinct library for the format (in Java for instance).  Also, the
format is really an evolving format.  But, I still think SDF is a good
candidate.

I think, so far, there isn't a strong enough motivation in the community
to develop a new format that does everything I want.  So (I think) a
green field design is unlikely to get past the standards wanking
stage that SteveC always goes out of his way to deride.

Best regards,
--
---+--
I set the clouds in motion - turn up   | Frank Warmerdam, [EMAIL PROTECTED]
light and sound - activate the windows | http://pobox.com/~warmerdam
and watch the world go round - Rush| President OSGeo, http://osgeo.org

___
Discuss mailing list
Discuss@lists.osgeo.org
http://lists.osgeo.org/mailman/listinfo/discuss


Re: [OSGeo-Discuss] Re: idea for an OSGeo project -- a new, open data format

2007-11-14 Thread P Kishor
On 11/14/07, David William Bitner [EMAIL PROTECTED] wrote:


 
 
 
  I never (I think I never did) argued that Shapefile is not open. I
  argued that it is not Free. I could be wrong.
 
 
  
 Here's the open published specification:
 http://www.esri.com/library/whitepapers/pdfs/shapefile.pdf

yes, I am aware of this, and have used it on and off over the past decade.


 Do what you will with it.  I don't know in this case what you imply by Free.
  Might ESRI make changes in the future that they don't publish?  Maybe, but
 at this version of shapefiles -- that are pervasive throughout the industry,
 I would doubt they would forgo backwards compatibility and this version of
 the specification is out there and free and open as far as I can tell.


ok, so there may or may not be an issue. I did bring it up, and so I
risk the discussion focusing on this aspect too much. Let's forget
about this aspect for now.

Shall we focus on the technical limitations of Shapefiles in order to
keep moving forward?

Many thanks,

Puneet.
___
Discuss mailing list
Discuss@lists.osgeo.org
http://lists.osgeo.org/mailman/listinfo/discuss


RE: [OSGeo-Discuss] Re: idea for an OSGeo project -- a new, open data format

2007-11-14 Thread Michael P. Gerlek
 I find two problems with Shapefiles -- one, that it is not in public
 domain (I am not even sure of what licensing there is on it), and
 while ESRI is not likely to pull a Unisys on us, it just is
 philosophically better to free if possible. 

I don't see this as an issue at all -- legally speaking I don't think
ESRI has any grounds to attack anyone for using or implementing the
Shapefile format.  (and from a market standpoint too, it'd be a bad move
for them...)

 The second, more major
 problem is Shapefile's antiquated data technology. DBF is a royal pain
 in the derierre with its 10 char column name, no relational tables, no
 indexing of data constraints. The geometry part of Shapefiles seems to
 be pretty good and adequate; it is the data part that is the problem.

This, though, is more interesting.  There are some things in the
shapefile format that limit me today, and so I'm interested in
alternatives.

-mpg
___
Discuss mailing list
Discuss@lists.osgeo.org
http://lists.osgeo.org/mailman/listinfo/discuss


Re: [OSGeo-Discuss] Re: idea for an OSGeo project -- a new, open data format

2007-11-14 Thread Frank Warmerdam

Landon Blake wrote:

P.S. - This is probably a crazy idea, but has anyone ever considered
talking to ESRI about cooperating on an update Shapefile spec?


Landon,

I believe ESRI sees the file based geodatabase as filling roughly the
role that the Shapefile played in the ArcView 3.x days.  Of course it isn't
clear that it will be very open and interoperable with other software packages.

I can't imagine a upgrade to shapefiles that would satisfy all our
requirements that has anything but the name in common with the existing
shapefiles.  So why drag ESRI into it?

Best regards,
--
---+--
I set the clouds in motion - turn up   | Frank Warmerdam, [EMAIL PROTECTED]
light and sound - activate the windows | http://pobox.com/~warmerdam
and watch the world go round - Rush| President OSGeo, http://osgeo.org

___
Discuss mailing list
Discuss@lists.osgeo.org
http://lists.osgeo.org/mailman/listinfo/discuss


RE: [OSGeo-Discuss] Re: idea for an OSGeo project -- a new, open data format

2007-11-14 Thread Landon Blake
Puneet wrote: Shall we focus on the technical limitations of Shapefiles
in order to keep moving forward?

I was going to add a couple of limitations in addition to the
limitations of attribute data brought on by use of a DBF file as a
storage container:

[1] No way to store simple topology.
[2] No way to store features with different schema and/or different
geometry representations in the same file.
[3] No way to store labeling or annotation.

I'm not sure if you would want to do all of this with a new file format,
but I know its limitations of the Shapefile format that I have run into
in the past.

Landon

P.S. - This is probably a crazy idea, but has anyone ever considered
talking to ESRI about cooperating on an update Shapefile spec?

-Original Message-
From: [EMAIL PROTECTED]
[mailto:[EMAIL PROTECTED] On Behalf Of P Kishor
Sent: Wednesday, November 14, 2007 9:06 AM
To: [EMAIL PROTECTED]
Cc: OSGeo Discussions
Subject: Re: [OSGeo-Discuss] Re: idea for an OSGeo project -- a new,open
data format

On 11/14/07, David William Bitner [EMAIL PROTECTED] wrote:


 
 
 
  I never (I think I never did) argued that Shapefile is not open. I
  argued that it is not Free. I could be wrong.
 
 
  
 Here's the open published specification:
 http://www.esri.com/library/whitepapers/pdfs/shapefile.pdf

yes, I am aware of this, and have used it on and off over the past
decade.


 Do what you will with it.  I don't know in this case what you imply by
Free.
  Might ESRI make changes in the future that they don't publish?
Maybe, but
 at this version of shapefiles -- that are pervasive throughout the
industry,
 I would doubt they would forgo backwards compatibility and this
version of
 the specification is out there and free and open as far as I can tell.


ok, so there may or may not be an issue. I did bring it up, and so I
risk the discussion focusing on this aspect too much. Let's forget
about this aspect for now.

Shall we focus on the technical limitations of Shapefiles in order to
keep moving forward?

Many thanks,

Puneet.
___
Discuss mailing list
Discuss@lists.osgeo.org
http://lists.osgeo.org/mailman/listinfo/discuss


Warning:
Information provided via electronic media is not guaranteed against defects 
including translation and transmission errors. If the reader is not the 
intended recipient, you are hereby notified that any dissemination, 
distribution or copying of this communication is strictly prohibited. If you 
have received this information in error, please notify the sender immediately.
___
Discuss mailing list
Discuss@lists.osgeo.org
http://lists.osgeo.org/mailman/listinfo/discuss


Re: [OSGeo-Discuss] Re: idea for an OSGeo project -- a new, open data format

2007-11-14 Thread Lucena, Ivan

Frank,

I was watching this PyTables video 
[http://www.carabos.com/videos/pytables-1-intro] and one thought came to 
my mind: HDF5 can easily be used to store and retrieve vector, raster 
and attribute tables. We would need to standardize a schema tough.


Best regards,

Ivan

PS. I am not that Ivan on that video.

Frank Warmerdam wrote:

Landon Blake wrote:

P.S. - This is probably a crazy idea, but has anyone ever considered
talking to ESRI about cooperating on an update Shapefile spec?


Landon,

I believe ESRI sees the file based geodatabase as filling roughly the
role that the Shapefile played in the ArcView 3.x days.  Of course it isn't
clear that it will be very open and interoperable with other software 
packages.


I can't imagine a upgrade to shapefiles that would satisfy all our
requirements that has anything but the name in common with the existing
shapefiles.  So why drag ESRI into it?

Best regards,

___
Discuss mailing list
Discuss@lists.osgeo.org
http://lists.osgeo.org/mailman/listinfo/discuss


RE: [OSGeo-Discuss] Re: idea for an OSGeo project -- a new, open data format

2007-11-13 Thread Landon Blake
Puneet,

You wrote: Should be easy to transition to. By building the new format
on the
structure of the Shapefile format, and *in fact*, calling it open
shapefiles or some such thing, we indicate from its name that the
transition is not that revolutionary but is evolutionary. This,
hopefully, will bring some name-familiarity, and make the transition
less scary.

I really think you are going to run into problems using the Shapefile
as part of the trademark or name for any product not sold by ESRI. I
strongly recommend against this move. Let people adopt the
implementation of your idea for its merits, not for name recognition
that comes from another product line.

You wrote: ANSI standard C is still
that magic common denominator that compiles and works predictably on
most number of systems. I have a lot against Java, but those who love
Java should definitely work on tools for accessing and working with
this new format as it would only make the format more widely used and
adopted.

It sounds to me like you are really describing a tool. File formats are
written in a binary encoding or text, not in a programming language. If
you are designing a tool you can choose the programming language of your
choice, but be aware that this will limit the developers that adopt the
tool. This will be the case no matter what language you choose to use,
whether it is C, Java, or something else. 

If, in contrast, you are creating a file format, then programming
languages shouldn't really matter. Binary and text data can be accessed
by almost all programming languages.

I think you need to decide if you want a tool or a data format. It
sounds like you are shooting more for a spatial database written in the
C programming language that uses some form of the ESRI Shapefile as its
underlying data storage mechanism. To me that is a tool or piece of
software, not a format. But maybe I don't completely understand your
goal.

Landon


-Original Message-
From: [EMAIL PROTECTED]
[mailto:[EMAIL PROTECTED] On Behalf Of P Kishor
Sent: Tuesday, November 13, 2007 8:35 AM
To: OSGeo Discussions
Subject: [OSGeo-Discuss] Re: idea for an OSGeo project -- a new,open
data format

Thanks everyone, for responding. Here is my groundwork.

The new format --

- Should be fast. SQLite is plenty fast, and anything that simply
extends the Shapefile format to inject relational capabilities
should be pretty fast. It should definitely be faster than a
geodatabase format (such as PostGIS/ArcSDE) and perhaps even faster
than Shapefiles especially while accessing attribute data. DBF is
sequential, and searching for textual information is particularly
expensive. SQLite has been tuned to excellence. I have been working
with it for a few years now, and it really is an amazing product,
development community, support, and capabilities. That it is in public
domain makes for a transfat-free icing on the cake.

- Should be unencumbered by licenses and copyrights. Ideally, the new
format could also be put back into public domain. We want to remove
all encumbrances to encourage rapid and wide adoption.

- Should be a single file. Well, some like multiple files and some
like single files. We can achieve both objectives by using a
tar-gzipped packaging such as Apple tends to use for much of its stuff
(for example, its Pages wordprocessor uses a tgzipped xml file along
with other resources for icons and pictures and stuff). Or, if speed
is going to be affected because of gzipping and gunzipping, just a
package format (I have no idea if this is a Unix thing or a Mac OS
thing -- we, in the Mac world, call them packages... they appear like
files in the Finder, and like directories in the shell).

- Should be easy to transition to. By building the new format on the
structure of the Shapefile format, and *in fact*, calling it open
shapefiles or some such thing, we indicate from its name that the
transition is not that revolutionary but is evolutionary. This,
hopefully, will bring some name-familiarity, and make the transition
less scary.

- Frank mentions SQLite's lack of datatypes as an issue -- I guess
that is a matter of preference. I personally quite like that freedom
as it gives me, the application developer, complete control over what
goes where. SQLite actually does have now a few datatypes that it
respects, but doesn't complain about. Since all users will be
accessing the data via an application, as long as the application is
well defined, it should be fine.

- SQLite excels at one thing that it has been entrusted to do --
retrieve data that it has been entrusted with at extremely fast
speeds, and maintain ACID data integrity in case of a programmatic
catastrophe. The transactions themselves are worth their price of
admission, which, happily, happens to be zero.

- Langdon mentions Java support -- well, yes, use/work on SQLite JDBC.
I have been using it for a few days now and find it to be a pretty
competent conduit. Extend it, spatialize it. ANSI standard C

RE: [OSGeo-Discuss] Re: idea for an OSGeo project -- a new, open data format

2007-11-13 Thread Jason Birch
Landon wrote:
---
I really think you are going to run into problems using the Shapefile
as part of the trademark or name for any product not sold by ESRI. I
strongly recommend against this move.
---

I'm not a lawyer, but I really doubt that shapefile is unique enough to
be subject to trademark.  Initial searches of TESS don't show up
anything either.

That said, I don't think that shapefile should be used anywhere in an
open format name _because_ it's too generic and will cause confusion.

As my initial blog entry mentioned, I had (have?) pretty high hopes for
SDF.  It's extremely fast (native rtree), contains multiple tables in a
single file, supports complex geometries (arcs, aggregates, etc),
includes native schema definition, etc...  However... while some initial
marketing materials marked SDF as an open format, I have been unable to
obtain any clarification from Autodesk on the file specification or
openness of SDF.  As well, there are some limitations; in the quest for
speed, SDF is only using the SQLite table access mechanisms, not the
entire relational layer (or something like that).  

For reference, most of the interesting parts of the SDF Provider,
including the Rtree implementation, are available here:

http://tinyurl.com/2b5en2

Jason
___
Discuss mailing list
Discuss@lists.osgeo.org
http://lists.osgeo.org/mailman/listinfo/discuss


Re: [OSGeo-Discuss] Re: idea for an OSGeo project -- a new, open data format

2007-11-13 Thread P Kishor
 on tools for accessing and working with
   this new format as it would only make the format more widely used
   and
   adopted.
  
   It sounds to me like you are really describing a tool. File
   formats are
   written in a binary encoding or text, not in a programming
   language. If
   you are designing a tool you can choose the programming language
   of your
   choice, but be aware that this will limit the developers that
   adopt the
   tool. This will be the case no matter what language you choose to
   use,
   whether it is C, Java, or something else.
  
   If, in contrast, you are creating a file format, then programming
   languages shouldn't really matter. Binary and text data can be
   accessed
   by almost all programming languages.
  
   I think you need to decide if you want a tool or a data format. It
   sounds like you are shooting more for a spatial database written
   in the
   C programming language that uses some form of the ESRI Shapefile
   as its
   underlying data storage mechanism. To me that is a tool or piece of
   software, not a format. But maybe I don't completely understand your
   goal.
  
  
   well, I am, frankly confused.
  
   I was quite convinced I wasn't describing a tool but was describing
   a format. Of course, to describe the format, I positioned it on the
   format (the SQLite-compatible format) used and popularized by a
   tool (SQLite, the library, which happens to be written in C). In my
   mind, having the data format based on SQLite *format* for its
   relational attribute handling was the real winner. In that sense,
   perhaps I conflated the format and the tool. I am not well versed in
   these things to I am probably already walking on thin ice, but that
   shouldn't stop others.
  
   So, forget that I mentioned C and Java... let's just concentrate on a
   way of laying out data on a disk that is not too dissimilar from how
   Shapefile data are laid out, except that we utilize the
   SQLite-compatible binary format for relational data handling, so that
   SQLite-enabled spatial tools can access this new format.
  
   And, put this format into public domain.
  
  
  
   -Original Message-
   From: [EMAIL PROTECTED]
   [mailto:[EMAIL PROTECTED] On Behalf Of P Kishor
   Sent: Tuesday, November 13, 2007 8:35 AM
   To: OSGeo Discussions
   Subject: [OSGeo-Discuss] Re: idea for an OSGeo project -- a new,open
   data format
  
   Thanks everyone, for responding. Here is my groundwork.
  
   The new format --
  
   - Should be fast. SQLite is plenty fast, and anything that simply
   extends the Shapefile format to inject relational capabilities
   should be pretty fast. It should definitely be faster than a
   geodatabase format (such as PostGIS/ArcSDE) and perhaps even faster
   than Shapefiles especially while accessing attribute data. DBF is
   sequential, and searching for textual information is particularly
   expensive. SQLite has been tuned to excellence. I have been working
   with it for a few years now, and it really is an amazing product,
   development community, support, and capabilities. That it is in
   public
   domain makes for a transfat-free icing on the cake.
  
   - Should be unencumbered by licenses and copyrights. Ideally, the
   new
   format could also be put back into public domain. We want to remove
   all encumbrances to encourage rapid and wide adoption.
  
   - Should be a single file. Well, some like multiple files and some
   like single files. We can achieve both objectives by using a
   tar-gzipped packaging such as Apple tends to use for much of its
   stuff
   (for example, its Pages wordprocessor uses a tgzipped xml file along
   with other resources for icons and pictures and stuff). Or, if speed
   is going to be affected because of gzipping and gunzipping, just a
   package format (I have no idea if this is a Unix thing or a Mac OS
   thing -- we, in the Mac world, call them packages... they appear
   like
   files in the Finder, and like directories in the shell).
  
   - Should be easy to transition to. By building the new format on the
   structure of the Shapefile format, and *in fact*, calling it open
   shapefiles or some such thing, we indicate from its name that the
   transition is not that revolutionary but is evolutionary. This,
   hopefully, will bring some name-familiarity, and make the transition
   less scary.
  
   - Frank mentions SQLite's lack of datatypes as an issue -- I guess
   that is a matter of preference. I personally quite like that freedom
   as it gives me, the application developer, complete control over
   what
   goes where. SQLite actually does have now a few datatypes that it
   respects, but doesn't complain about. Since all users will be
   accessing the data via an application, as long as the application is
   well defined, it should be fine.
  
   - SQLite excels at one thing that it has been entrusted to do --
   retrieve data that it has been entrusted with at extremely fast

Re: [OSGeo-Discuss] Re: idea for an OSGeo project -- a new, open data format

2007-11-13 Thread David William Bitner
 of the trademark or name for any product not sold by ESRI.
 I
strongly recommend against this move. Let people adopt the
implementation of your idea for its merits, not for name
 recognition
that comes from another product line.
   
Good enough point to keep in mind, but not to get hung up over
 enough
to entangle us. Suggestions for names of the data format can be a
project in itself. open spatial data format or its variations
 could
be chosen. Still, point taken.
   
   
You wrote: ANSI standard C is still
that magic common denominator that compiles and works predictably
 on
most number of systems. I have a lot against Java, but those who
love
Java should definitely work on tools for accessing and working
 with
this new format as it would only make the format more widely used
and
adopted.
   
It sounds to me like you are really describing a tool. File
formats are
written in a binary encoding or text, not in a programming
language. If
you are designing a tool you can choose the programming language
of your
choice, but be aware that this will limit the developers that
adopt the
tool. This will be the case no matter what language you choose to
use,
whether it is C, Java, or something else.
   
If, in contrast, you are creating a file format, then programming
languages shouldn't really matter. Binary and text data can be
accessed
by almost all programming languages.
   
I think you need to decide if you want a tool or a data format. It
sounds like you are shooting more for a spatial database written
in the
C programming language that uses some form of the ESRI Shapefile
as its
underlying data storage mechanism. To me that is a tool or piece
 of
software, not a format. But maybe I don't completely understand
 your
goal.
   
   
well, I am, frankly confused.
   
I was quite convinced I wasn't describing a tool but was
 describing
a format. Of course, to describe the format, I positioned it on
 the
format (the SQLite-compatible format) used and popularized by a
tool (SQLite, the library, which happens to be written in C). In
 my
mind, having the data format based on SQLite *format* for its
relational attribute handling was the real winner. In that sense,
perhaps I conflated the format and the tool. I am not well versed
 in
these things to I am probably already walking on thin ice, but that
shouldn't stop others.
   
So, forget that I mentioned C and Java... let's just concentrate on
 a
way of laying out data on a disk that is not too dissimilar from
 how
Shapefile data are laid out, except that we utilize the
SQLite-compatible binary format for relational data handling, so
 that
SQLite-enabled spatial tools can access this new format.
   
And, put this format into public domain.
   
   
   
-Original Message-
From: [EMAIL PROTECTED]
[mailto:[EMAIL PROTECTED] On Behalf Of P Kishor
Sent: Tuesday, November 13, 2007 8:35 AM
To: OSGeo Discussions
Subject: [OSGeo-Discuss] Re: idea for an OSGeo project -- a
 new,open
data format
   
Thanks everyone, for responding. Here is my groundwork.
   
The new format --
   
- Should be fast. SQLite is plenty fast, and anything that simply
extends the Shapefile format to inject relational capabilities
should be pretty fast. It should definitely be faster than a
geodatabase format (such as PostGIS/ArcSDE) and perhaps even
 faster
than Shapefiles especially while accessing attribute data. DBF is
sequential, and searching for textual information is particularly
expensive. SQLite has been tuned to excellence. I have been
 working
with it for a few years now, and it really is an amazing product,
development community, support, and capabilities. That it is in
public
domain makes for a transfat-free icing on the cake.
   
- Should be unencumbered by licenses and copyrights. Ideally, the
new
format could also be put back into public domain. We want to
 remove
all encumbrances to encourage rapid and wide adoption.
   
- Should be a single file. Well, some like multiple files and some
like single files. We can achieve both objectives by using a
tar-gzipped packaging such as Apple tends to use for much of its
stuff
(for example, its Pages wordprocessor uses a tgzipped xml file
 along
with other resources for icons and pictures and stuff). Or, if
 speed
is going to be affected because of gzipping and gunzipping, just a
package format (I have no idea if this is a Unix thing or a Mac OS
thing -- we, in the Mac world, call them packages... they appear
like
files in the Finder, and like directories in the shell).
   
- Should be easy to transition to. By building the new format on
 the
structure of the Shapefile format, and *in fact