Re: [OSGeo-Discuss] any spatial databases for high performance geo-computing

2014-05-22 Thread Andrew Ross

Hi Shuai, Everyone

It's worth a look at a few projects @ LocationTech too. There's a nice 
community growing around these.


GeoTrellis http://www.locationtech.org/projects/technology.geotrellis, 
Apache v2 License, Intro Article 
http://www.eclipse.org/community/eclipse_newsletter/2014/march/article4.php, 
Video http://youtu.be/aS8BAmu9daU, Discussion list 
https://www.locationtech.org/mailman/listinfo/geotrellis-dev


GeoMesa http://www.locationtech.org/projects/technology.geomesa, 
Apache v2 License, Intro Article 
http://www.eclipse.org/community/eclipse_newsletter/2014/march/article3.php, 
Video http://youtu.be/JsQiOuGGWds, Discussion list 
https://www.locationtech.org/mailman/listinfo/geomesa-users


GeoJinni http://www.locationtech.org/proposals/geojinni (formerly 
Spatial Hadoop), Video 
http://www.youtube.com/watch?v=A3Q2XlGvYcUfeature=share, discussion 
list coming soon


Glad to help connect anyone who might be interested.

Kind regards,

Andrew

On 19/05/14 18:04, Zhang, Shuai wrote:

Hi All,

sorry for asking, but what do you think is a good choice of spatial database 
for high performance geo-computing?

In some high performance computing scenarios, data size tends to be huge, and a 
bunch of computer clusters work together with high throughput and tense 
computation. sometimes we use parallel filesystems like lustre, gfs, hdfs to 
handle specific problems but what if a spatial database?

I explored some of postgresql cluster solutions, such as streaming replication, 
pgpool, slony and so on. I think most of them are designed for failover, and 
they might not be able to stand up with the huge data size and high performance 
demands. the case is quite alike in oracle and db2 spatial, i think.

so any suggestions for projects aiming to build a distributed and parallel 
spatial database running on a cluster?

Thanks,
shuai
___
Discuss mailing list
Discuss@lists.osgeo.org
http://lists.osgeo.org/mailman/listinfo/discuss




___
Discuss mailing list
Discuss@lists.osgeo.org
http://lists.osgeo.org/mailman/listinfo/discuss

Re: [OSGeo-Discuss] any spatial databases for high performance geo-computing

2014-05-20 Thread Peter Baumann
you might look at rasdaman running on top of PostgreSQL (www.rasdaman.org). 
Supports n-D arrays + an SQL-style array query language.

-Peter



On 05/20/2014 12:04 AM, Zhang, Shuai wrote:

Hi All,

sorry for asking, but what do you think is a good choice of spatial database 
for high performance geo-computing?

In some high performance computing scenarios, data size tends to be huge, and a 
bunch of computer clusters work together with high throughput and tense 
computation. sometimes we use parallel filesystems like lustre, gfs, hdfs to 
handle specific problems but what if a spatial database?

I explored some of postgresql cluster solutions, such as streaming replication, 
pgpool, slony and so on. I think most of them are designed for failover, and 
they might not be able to stand up with the huge data size and high performance 
demands. the case is quite alike in oracle and db2 spatial, i think.

so any suggestions for projects aiming to build a distributed and parallel 
spatial database running on a cluster?

Thanks,
shuai
___
Discuss mailing list
Discuss@lists.osgeo.org
http://lists.osgeo.org/mailman/listinfo/discuss


--
Dr. Peter Baumann
 - Professor of Computer Science, Jacobs University Bremen
   www.faculty.jacobs-university.de/pbaumann
   mail: p.baum...@jacobs-university.de
   tel: +49-421-200-3178, fax: +49-421-200-493178
 - Executive Director, rasdaman GmbH Bremen (HRB 26793)
   www.rasdaman.com, mail: baum...@rasdaman.com
   tel: 0800-rasdaman, fax: 0800-rasdafax, mobile: +49-173-5837882
Si forte in alienas manus oberraverit hec peregrina epistola incertis ventis 
dimissa, sed Deo commendata, precamur ut ei reddatur cui soli destinata, nec preripiat 
quisquam non sibi parata. (mail disclaimer, AD 1083)


___
Discuss mailing list
Discuss@lists.osgeo.org
http://lists.osgeo.org/mailman/listinfo/discuss


[OSGeo-Discuss] any spatial databases for high performance geo-computing

2014-05-19 Thread Zhang, Shuai
Hi All,

sorry for asking, but what do you think is a good choice of spatial database 
for high performance geo-computing?

In some high performance computing scenarios, data size tends to be huge, and a 
bunch of computer clusters work together with high throughput and tense 
computation. sometimes we use parallel filesystems like lustre, gfs, hdfs to 
handle specific problems but what if a spatial database?

I explored some of postgresql cluster solutions, such as streaming replication, 
pgpool, slony and so on. I think most of them are designed for failover, and 
they might not be able to stand up with the huge data size and high performance 
demands. the case is quite alike in oracle and db2 spatial, i think.

so any suggestions for projects aiming to build a distributed and parallel 
spatial database running on a cluster?

Thanks,
shuai
___
Discuss mailing list
Discuss@lists.osgeo.org
http://lists.osgeo.org/mailman/listinfo/discuss


Re: [OSGeo-Discuss] any spatial databases for high performance geo-computing

2014-05-19 Thread Alex Mandel
On 05/19/2014 03:04 PM, Zhang, Shuai wrote:
 Hi All,
 
 sorry for asking, but what do you think is a good choice of spatial database 
 for high performance geo-computing?
 
 In some high performance computing scenarios, data size tends to be huge, and 
 a bunch of computer clusters work together with high throughput and tense 
 computation. sometimes we use parallel filesystems like lustre, gfs, hdfs to 
 handle specific problems but what if a spatial database?
 
 I explored some of postgresql cluster solutions, such as streaming 
 replication, pgpool, slony and so on. I think most of them are designed for 
 failover, and they might not be able to stand up with the huge data size and 
 high performance demands. the case is quite alike in oracle and db2 spatial, 
 i think.
 
 so any suggestions for projects aiming to build a distributed and parallel 
 spatial database running on a cluster?
 
 Thanks,
 shuai

I haven't used but have a seen a few Hadoop implementations. If you do
research on Sharding that's the kind of db where the data is split
across nodes not redundantly.

http://www.nathankerr.com/projects/parallel-gis-processing/alternative_approaches_to_parallel_gis_processing.html

http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3814183/

Then of course depending on your needs there are plenty of MPI
compatible libraries in various languages.

Thanks,
Alex

___
Discuss mailing list
Discuss@lists.osgeo.org
http://lists.osgeo.org/mailman/listinfo/discuss


Re: [OSGeo-Discuss] any spatial databases for high performance geo-computing

2014-05-19 Thread Brent Wood
Tools like Hadoop, Neteeza  Teradata support very large databases  are fully 
spatially aware, generally through custom re-implementations of FOSS tools like 
GDAL.

http://www10.giscafe.com/nbc/articles/view_article.php?articleid=590803
http://www.teradata.com.au/products-and-services/teradata-geospatial

or in the FOSS arena: 

http://spatialhadoop.cs.umn.edu/

Brent Wood



 
 

On 05/19/2014 03:04 PM, Zhang, Shuai wrote:
 Hi All,
 
 sorry for asking, but what do you think is a good choice of spatial database 
 for high performance geo-computing?
 
 In some high performance computing scenarios, data size tends to be huge, and 
 a bunch of computer clusters work together with high throughput and tense 
 computation. sometimes we use parallel filesystems like lustre, gfs, hdfs to 
 handle specific problems but what if a spatial database?
 
 I explored some of postgresql cluster solutions, such as streaming 
 replication, pgpool, slony and so on. I think most of them are designed for 
 failover, and they might not be able to stand up with the huge data size and 
 high performance demands. the case is quite alike in oracle and db2 spatial, 
 i think.
 
 so any suggestions for projects aiming to build a distributed and parallel 
 spatial database running on a cluster?
 
 Thanks,
 shuai___
Discuss mailing list
Discuss@lists.osgeo.org
http://lists.osgeo.org/mailman/listinfo/discuss

Re: [OSGeo-Discuss] any spatial databases for high performance geo-computing

2014-05-19 Thread Schuyler Erle
Two options for distributed spatial databasing, based on Solr's spatial types:

  ElasticSearch: 
http://www.elasticsearchtutorial.com/spatial-search-tutorial.html

  Riak: 
http://www.christopherbiscardi.com/2014/02/07/geospatial-indexing-with-riak-search-2-0-yokozunasolr/

I've had great success with ElasticSearch and spatial search, but not in a 
clustered configuration. Riak is designed from the ground up to be a 
distributed database, but its Solr support is quite new and still in beta. 
Neither one support particularly complex geometric operations AFAIK, so YMMV. 
Good luck!

SDE

On May 19, 2014, at 3:04 PM, Zhang, Shuai sh...@illinois.edu wrote:

 Hi All,
 
 sorry for asking, but what do you think is a good choice of spatial database 
 for high performance geo-computing?
 
 In some high performance computing scenarios, data size tends to be huge, and 
 a bunch of computer clusters work together with high throughput and tense 
 computation. sometimes we use parallel filesystems like lustre, gfs, hdfs to 
 handle specific problems but what if a spatial database?
 
 I explored some of postgresql cluster solutions, such as streaming 
 replication, pgpool, slony and so on. I think most of them are designed for 
 failover, and they might not be able to stand up with the huge data size and 
 high performance demands. the case is quite alike in oracle and db2 spatial, 
 i think.
 
 so any suggestions for projects aiming to build a distributed and parallel 
 spatial database running on a cluster?
 
 Thanks,
 shuai
 ___
 Discuss mailing list
 Discuss@lists.osgeo.org
 http://lists.osgeo.org/mailman/listinfo/discuss

___
Discuss mailing list
Discuss@lists.osgeo.org
http://lists.osgeo.org/mailman/listinfo/discuss