Re: [Assimilation] Thinking about adding Postgres to the Assimilation project

Welch, Bill Fri, 09 Feb 2018 13:27:55 -0800

Just intuition here, but embedding postrges inside Assimilation seems heavy for 
just JSON. Of course, you already embed neo4j so you know how to handle having 
a dependency on a large subsystem.


Then, there's mongodb vs postgres vs ...:
https://www.sisense.com/blog/postgres-vs-mongodb-for-storing-json-data/
https://www.quora.com/Which-is-better-storing-json-objects-in-json-files-in-Redis-or-MongoDB-RethinkDB

On 9/2/18, 10:05, "Assimilation on behalf of Alan Robertson" 
<assimilation-boun...@lists.community.tummy.com on behalf of al...@unix.sh> 
wrote:

    EXTERNAL EMAIL – Use caution with any links or file attachments.
    
    This data is generated by a variety of shell scripts that do discovery -  
potentially dozens of them - and each is different. Some of the most critical 
data is decomposed to attributes - but not most of it.
    
    -- 
      Alan Robertson
      al...@unix.sh
    
    On Fri, Feb 9, 2018, at 2:58 AM, Michael Hunger wrote:
    > I think this is ok. 
    > I wished we had full document support yet.  
    > 
    > I know that pg has really good jsonb support, so go for it. 
    > 
    > Did you ever try to destrucure the data into properties? Not sure how 
    > deeply nested it is? And leave off all that are just defaults 
    > 
    > Von meinem iPhone gesendet
    > 
    > > Am 09.02.2018 um 04:10 schrieb Alan Robertson <al...@unix.sh>:
    > > 
    > > Hi,
    > > 
    > > There is one set of data that when I insert it into Neo4j - it's 
really, really slow. It's discovery data - which is JSON - and sometimes very 
large - a few megabytes. Many of them are smallish, but having items a few 
kilobytes is common, and dozens of kilobytes is also common, and some few 
things are in the megabyte+ range. [Because of compression, I can send up to 3 
megabytes of this JSON over UDP].
    > > 
    > > There are a few things I can do with Neo4j to make inserting it faster, 
but I don't think a lot -- and when I get done, the data is very hard to query 
against (it involves regexes against unindexed data, and is a performance 
nightmare).
    > > 
    > > Postgres has JSON support, and it has real transactions and a 
reputation for being very solid. I did some benchmarking and it is a a couple 
of orders of magnitude faster than Neo4j with both of them untuned. In 
addition, Postgres JSON (jsonb) can have indexes over the JSON information - 
greatly improving the query capabilities over what Neo4j can do for this same 
data.
    > > 
    > > I'm not thinking about doing anything except moving this one class of 
data to Postgres. This particular class of data is also idempotent, which has 
advantages when you have multiple databases involved...
    > > 
    > > Since this particular type of data is its own object in the Python, 
having it be in Postgres wouldn't likely be horrible to implement.
    > > 
    > > If I'm going to do this in the next year or two, it makes sense to 
couple it with the rest of the backwards-incompatible changes I'm already 
putting into release 2.
    > > 
    > > Does anyone think this is a show-stopper to use two databases?
    > > 
    > > -- 
    > >  Alan Robertson
    > >  al...@unix.sh
    _______________________________________________
    Assimilation mailing list - Discovery-Driven Monitoring
    Assimilation@lists.community.tummy.com
    http://lists.community.tummy.com/cgi-bin/mailman/listinfo/assimilation
    http://assimmon.org/
    

Notice:  This e-mail message, together with any attachments, contains
information of Merck & Co., Inc. (2000 Galloping Hill Road, Kenilworth,
New Jersey, USA 07033), and/or its affiliates Direct contact information
for affiliates is available at 
http://www.merck.com/contact/contacts.html) that may be confidential,
proprietary copyrighted and/or legally privileged. It is intended solely
for the use of the individual or entity named on this message. If you are
not the intended recipient, and have received this message in error,
please notify us immediately by reply e-mail and then delete it from 
your system.
_______________________________________________
Assimilation mailing list - Discovery-Driven Monitoring
Assimilation@lists.community.tummy.com
http://lists.community.tummy.com/cgi-bin/mailman/listinfo/assimilation
http://assimmon.org/

Re: [Assimilation] Thinking about adding Postgres to the Assimilation project

Reply via email to