[GitHub] metron issue #940: METRON-1460: Create a complementary non-split-join enrich...

2018-03-02 Thread cestella
Github user cestella commented on the issue: https://github.com/apache/metron/pull/940 Just FYI, as part of the performance experimentation in the lab here, we found that one major impediment to scale was the guava cache in this topology when the size of the cache becomes non-trivial

[GitHub] metron pull request #947: METRON-1467: Replace guava caches in places where ...

2018-03-02 Thread cestella
GitHub user cestella opened a pull request: https://github.com/apache/metron/pull/947 METRON-1467: Replace guava caches in places where the keyspace might be large ## Contributor Comments Based on the performance tuning exercise as part of METRON-1460, guava has difficulties

[GitHub] metron issue #946: METRON-1465:Support for Elasticsearch X-pack

2018-03-02 Thread wardbekker
Github user wardbekker commented on the issue: https://github.com/apache/metron/pull/946 Steps to test: - build centos vagrant dev build box - vagrant ssh - sudo /usr/share/elasticsearch/bin/elasticsearch-plugin install x-pack - sudo

[GitHub] metron issue #940: METRON-1460: Create a complementary non-split-join enrich...

2018-03-02 Thread cestella
Github user cestella commented on the issue: https://github.com/apache/metron/pull/940 @arunmahadevan Thanks for chiming in Arun. I would say that most of the enrichment work is I/O bound and we try to avoid it whenever possible with a a time-evicted LRU cache in front of the

[GitHub] metron issue #940: METRON-1460: Create a complementary non-split-join enrich...

2018-03-02 Thread arunmahadevan
Github user arunmahadevan commented on the issue: https://github.com/apache/metron/pull/940 Managing threadpools within a bolt isn't fundamentally wrong, we have see some use cases where this is done. However, we have been putting efforts to reduce the overall number of threads

[GitHub] metron issue #940: METRON-1460: Create a complementary non-split-join enrich...

2018-03-02 Thread cestella
Github user cestella commented on the issue: https://github.com/apache/metron/pull/940 @ottobackwards I haven't sent an email to the storm team, but I did run the PR past a storm committer that I know and asked his opinion prior to submitting the PR. The general answer was something

[GitHub] metron issue #940: METRON-1460: Create a complementary non-split-join enrich...

2018-03-02 Thread ottobackwards
Github user ottobackwards commented on the issue: https://github.com/apache/metron/pull/940 have we thought to send a mail to the storm dev list and ask if anyone has done this? potential issues? ---

[GitHub] metron issue #940: METRON-1460: Create a complementary non-split-join enrich...

2018-03-02 Thread ottobackwards
Github user ottobackwards commented on the issue: https://github.com/apache/metron/pull/940 If we integrated storm with yarn this would also be a problem, as our resource management may be at odds with yarn's. I think? What would be nice is if storm could manage the pool and

[GitHub] metron issue #940: METRON-1460: Create a complementary non-split-join enrich...

2018-03-02 Thread cestella
Github user cestella commented on the issue: https://github.com/apache/metron/pull/940 @mraliagha It's definitely a tradeoff. This is why this is as a complement to the original split/join topology. Keep in mind, also, that this architecture enables use-cases that the other would