Re: [JPP-Devel] Memory issue with writing to PostGIS and with reading too

Michaël Michaud Thu, 03 Nov 2016 16:41:43 -0700

Hi Jukka,

Wow, this is a huge table. How much memory do you have on your JVM ?

Indeed, PostGISDataSource already uses batches for bulk uploads (whichexplains why you get yourdata into your database) but not for queries. Uploads are immediatelyfollowed by a new query (to

get the database feature id). That's where you get the OOM Exception.
(in StackTrace you can see reloadDataFromDataStore, then executeQuery)


At the end of the loading phase, I think we can have as much 3 datasets :
- the old FeatureCollection

- the new FeatureCollection nearly completed (if we remove the oldcollection before getting the new one, we take the risk to loose all)- the raw ResultSet from the database which is not yet entirelyconverted to the new FeatureCollection

I think the only place where we can save a bit of memory by usingbatches (as you suggested) is in the transformationfrom ResutSet to the new FeatureCollection (but we still will have tohandle more than two times the dataset size).


Michaël


Le 03/11/2016 à 17:05, Rahkonen Jukka (MML) a écrit :

Hi,
I tried to write a large layer with 91 million vertices into PostGISwith the Save to PostGIS (new) driver. It took a few hours and endedwith Out of memory error:
java.lang.OutOfMemoryError: Java heap space

at com.vividsolutions.jts.io.WKBReader.hexToBytes(WKBReader.java:70)
atcom.vividsolutions.jump.datastore.spatialdatabases.SpatialDatabasesValueConverterFactory$WKBGeometryValueConverter.getValue(SpatialDatabasesValueConverterFactory.java:96)
atcom.vividsolutions.jump.datastore.spatialdatabases.SpatialDatabasesResultSetConverter.getFeature(SpatialDatabasesResultSetConverter.java:52)
atcom.vividsolutions.jump.datastore.spatialdatabases.SpatialDatabasesFeatureInputStream.getFeature(SpatialDatabasesFeatureInputStream.java:99)
atcom.vividsolutions.jump.datastore.spatialdatabases.SpatialDatabasesFeatureInputStream.readNext(SpatialDatabasesFeatureInputStream.java:95)
atcom.vividsolutions.jump.io.BaseFeatureInputStream.hasNext(BaseFeatureInputStream.java:31)
atorg.openjump.core.ui.plugin.datastore.postgis2.PostGISDataStoreDataSource.createFeatureCollection(PostGISDataStoreDataSource.java:93)
atorg.openjump.core.ui.plugin.datastore.WritableDataStoreDataSource.access$000(WritableDataStoreDataSource.java:38)
atorg.openjump.core.ui.plugin.datastore.WritableDataStoreDataSource$1.executeQuery(WritableDataStoreDataSource.java:160)
atorg.openjump.core.ui.plugin.datastore.WritableDataStoreDataSource$1.executeQuery(WritableDataStoreDataSource.java:171)
atorg.openjump.core.ui.plugin.datastore.WritableDataStoreDataSource.reloadDataFromDataStore(WritableDataStoreDataSource.java:543)
atorg.openjump.core.ui.plugin.datastore.WritableDataStoreDataSource.access$500(WritableDataStoreDataSource.java:38)
atorg.openjump.core.ui.plugin.datastore.WritableDataStoreDataSource$1.executeUpdate(WritableDataStoreDataSource.java:228)
atcom.vividsolutions.jump.workbench.datasource.AbstractSaveDatasetAsPlugIn.run(AbstractSaveDatasetAsPlugIn.java:28)
atcom.vividsolutions.jump.workbench.ui.task.TaskMonitorManager$TaskWrapper.run(TaskMonitorManager.java:152)
at java.lang.Thread.run(Unknown Source)
However, there is a new table in the database and it may even containall the data. I could not check that because OpenJUMP runs also out ofmemory when I try to read the table with Run Datastore Query. I canload the same data from JML so it feels like OpenJUMP puts all thedata from the SQL query to memory and converts it then to OpenJUMPfeautures which requires perhaps two times more memory. I admit thatthis dataset is quite big but I wonder if reading data from PostGIScould be made more memory-savvy perhaps by reading data in chunks withrepeated paged requests “limit 10000 offset xxx”.
-Jukka Rahkonen-



------------------------------------------------------------------------------
Developer Access Program for Intel Xeon Phi Processors
Access to Intel Xeon Phi processor-based developer platforms.
With one year of Intel Parallel Studio XE.
Training and support from Colfax.
Order your platform today. http://sdm.link/xeonphi


_______________________________________________
Jump-pilot-devel mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/jump-pilot-devel

------------------------------------------------------------------------------
Developer Access Program for Intel Xeon Phi Processors
Access to Intel Xeon Phi processor-based developer platforms.
With one year of Intel Parallel Studio XE.
Training and support from Colfax.
Order your platform today. http://sdm.link/xeonphi

_______________________________________________
Jump-pilot-devel mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/jump-pilot-devel

Re: [JPP-Devel] Memory issue with writing to PostGIS and with reading too

Reply via email to