Re: [OSM-dev] OSM with Hadoop

2015-02-18 Thread Martin Koppenhoefer
2015-02-18 11:46 GMT+01:00 Imre Samu : > "WE JUST IMPORTED THE WORLD. AND ITS HISTORY. IN 47 MINUTES." [ 30 > novembre 2013 ] yes, but before we get too enthusiastic, they supposedly did it on this machine: "Our main database here is a 8 nodes (384 GB RAM, 96 CPU cores, 96 hard drives and 96

Re: [OSM-dev] OSM with Hadoop

2015-02-18 Thread Imre Samu
>An obvious evaluation target for me would be >http://en.wikipedia.org/wiki/Greenplum from the news: Pivotal open sources its Hadoop and *Greenplum tech*, and then some ( February 17, 2015 ) "Pivotal announced today groundbreaking product enhancements to Pivotal Big Data Suite, including plan

Re: [OSM-dev] OSM with Hadoop

2015-02-12 Thread Stefan Keller
Hi Stephen, Though being biased(*) I'm observing alternatives to PostgreSQL all the time. An obvious evaluation target for me would be http://en.wikipedia.org/wiki/Greenplum And on a higher level I would reflect about different architectures, e.g. like applying this architecture design pattern ht

[OSM-dev] OSM with Hadoop

2015-01-25 Thread Stephen Knox
Just wanted to say thanks to everyone for their thoughts and opinions on my proposal - it seems to have generated some interesting discussion at least! I hope I can produce something that will be interesting for the community and look at it from an OSM standpoint as well as a "look at all this coo

Re: [OSM-dev] OSM with Hadoop

2015-01-14 Thread Frederik Ramm
Hi, On 01/14/2015 11:16 AM, Paweł Paprota wrote: > Main central server with a single multi-TB-sized database > somehow screams single point of failure to me... ... until you learn that there's actually an active replication, based on technology that has been tried and proven. I'm not saying brus

Re: [OSM-dev] OSM with Hadoop

2015-01-14 Thread Paweł Paprota
Playing devil's advocate for a minute, honestly I'm not sure I would be as content with the current OSM data storage and processing architecture. Main central server with a single multi-TB-sized database somehow screams single point of failure to me... Add to that the growth rate (which is pretty c

Re: [OSM-dev] OSM with Hadoop

2015-01-12 Thread Frederik Ramm
Stephen, previous discussions of combining NoSQL *or* massively parallel storage with OSM were often less driven by the approach "let's investigate solid future storage models for OSM" but rather by "hey there's a cool new technology I'd like to play with and I'm sure it can somehow work with O

Re: [OSM-dev] OSM with Hadoop

2015-01-12 Thread Markus Mayr
A very, very interesting and necessary project! Hadoop seems to be the mature choice. I have done some experiments based on this CouchDB scripts: http://wiki.openstreetmap.org/wiki/OSMCouch (there exists also an older modified imposm which can export directly to CouchDB) While this kind of storage

Re: [OSM-dev] OSM with Hadoop

2015-01-09 Thread Paweł Paprota
Hi Stephen, I read your document - it sounds like an interesting project! Even more so that, as you can see, not much has been done in this area so far. Pretty much every approach to OSM data storage/handling I've seen is Postgres + some variation of a database schema optimized towards a specific

Re: [OSM-dev] OSM with Hadoop

2015-01-08 Thread Pieren
On Fri, Jan 2, 2015 at 7:38 PM, Stephen Knox wrote: > So firstly I am wondering if I am missing any previous posts / research on > the topic? Here are my findings about hadoop quoted in MLs: https://lists.openstreetmap.org/pipermail/dev/2009-August/016554.html https://lists.openstreetmap.org/pip

[OSM-dev] OSM with Hadoop

2015-01-02 Thread Stephen Knox
Hi all, I am preparing for a GIS MSc thesis at Kingston University (London) and since I am very interested in OSM (https://www.openstreetmap.org/user/stev) and big data technologies, I have chosen to base my dissertation on Hadoop and OSM. I have searched the forum archives for previous reference