On Monday, May 4, 2015 at 3:32:39 PM UTC+3, Thomas Mueller wrote:
>
> Hi,
>
> > But already for many years  the spatial support was provided by a 
> combination of geodb + hatbox libraries and integration in GeoTools  world
>
> Yes. However, those don't use the built-in R tree. Do they use an external 
> R tree?
>

Hatbox provides R-Tree.  It is based on H2 infrastructure (some auxiliary 
table is created where nodes are stored).  What is "built-in" R-Tree would 
mean?

>
> > All these was done on BLOB type where a geometry WKB is stored. 
>
> A small BLOB is stored inline, so it might not be that bad.
>
> >  any access of BLOB value makes a copy of it
>
Yes, that is what I meant.  For certain scenarios it has significant 
performance issues anyway as I experienced. Millions of temporary LOB 
entries when you have just dozens of thousands of spatial records and some 
not very optimized spatial query.

>
> Access is making a copy of the reference of a large BLOB.
>
> > Isn't it  2Gb is a limit for binary types?
>
> I reality the problem is the memory usage (heap memory).
>

That  should not be a problem.  Typically in GIS application the biggest 
result  sets extracted from the database are not hold or cached long time 
but rather used to render spatial features and immediately any references 
released in JVM. Whether it's BLOB or BINARY anyway it's loaded to memory 
to parse Geometry from WKB.  May be with VARBINARY a  bit more data is kept 
during short period  of time in memory than would be with BLOBs.  May be I 
would consider an approach in ValueGeometry  that bytes are kept just until 
geometry is requested, then lazily Geometry is parsed and bytes are 
released. So that at any point of time whether bytes are hold or Geometry 
as an object. From bytes to Geometry, from Geometry to bytes when necessary.

In SELECT scenarious bytes are needed until Geometry object is created and 
then it is used outside of result set or locally during command execution. 
I am not sure how relevant in scope of the whole database infrastructure 
does this sound. 
Am I right that until local result set data structure is fully composed it 
is not returned to caller?  Then if result set is huge then all bytes are 
anyway kept in memory until result set is delivered and the client starts 
to request Geometry objects when bytes would be cleaned...


Vitali.


> Regards,
> Thomas
>
>
>
> On Sun, May 3, 2015 at 9:51 PM, Vitali <[email protected] <javascript:>> 
> wrote:
>
>> Hello.
>>
>> I would like to share some   observations.  Recently H2 got a Geometry 
>> type, logic around it seems is growing, also some extra tiers like H2GIS 
>> are under development.  All together this seems as a future of spatial 
>> support in H2.  But already for many years  the spatial support was 
>> provided by a combination of geodb + hatbox libraries and integration in 
>> GeoTools  world (as  H2 data store  interface for storing/managing spatial 
>> features with geometries).
>> All these was done on BLOB type where a geometry WKB is stored. 
>>
>> BLOB became completely useless as a type for handling WKB of geometries. 
>> Because of  this change that any access of BLOB value makes a copy of it. 
>> HATBOX and GEODB libs based  on JTS library  provide functions to work with 
>> WKB. But any call of these functions makes a read of BLOB value which makes 
>> a copy in memory.  Some spatial conflation operations being not-optimized 
>> (having polynomial complexity with applying spatial predicates between any 
>> combination of input geometries from 2 tables e.g.)  now have a 
>> catastrophic performance and memory consumption.  Cases where  old H2 just 
>> worked 10 secods performing some kind of spatial operation between 2 layers 
>> (tables) now runs 2 hours , 3Gb of database file (instead of 400Mb 
>> normally) and outofmemory error finally. And long cleanings of temporary 
>> LOB storage on app start, app close, transaction commit after such 
>> operations.
>>
>> I understand real reasons of this BLOB  copying approach.  But the 
>> conclusion is that BLOB is not a right type for geometries. In typical GIS 
>> (like UDIG) thousands of records are extracted every second for multiple 
>> layers during rendering and other types of requests need geometries. Now 
>> BLOB became inefficient.
>>
>> Alltogether very likely I will do refactoring of geodb, hatbox and 
>> GeoTools to work with GEOMETRY type which is basically VARBINARY kind of 
>> which means WKB is just read to memory. But it is what usually is needed to 
>> GIS app - to get a geometry almost every time when data is read. Also 
>> because  JTS geometry is lazily cached in ValueGeometry various logic in H2 
>> (like custom spatial functions call multiple times) gets benefits.  I think 
>> H2GIS toolkit more or less uses this approach already.
>>
>> The only concern is that are there any limitations for cases like "lake 
>> boundary" that consists from hundreds of  thousands of vertices.. Isn't it  
>> 2Gb is a limit for binary types? Then it's fine..  But how do older 
>> PageStore and modern MVStore handle this type? Any performance issues?
>>
>> Vitali.
>>
>>
>>  -- 
>> You received this message because you are subscribed to the Google Groups 
>> "H2 Database" group.
>> To unsubscribe from this group and stop receiving emails from it, send an 
>> email to [email protected] <javascript:>.
>> To post to this group, send email to [email protected] 
>> <javascript:>.
>> Visit this group at http://groups.google.com/group/h2-database.
>> For more options, visit https://groups.google.com/d/optout.
>>
>
>

-- 
You received this message because you are subscribed to the Google Groups "H2 
Database" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at http://groups.google.com/group/h2-database.
For more options, visit https://groups.google.com/d/optout.

Reply via email to