from:"Ryan Merriman"

Re: [VOTE] Move Apache Metron to the Apache Attic and Dissolve PMC

2020-11-16 Thread Ryan Merriman

+1

> On Nov 16, 2020, at 5:19 PM, David Lyle  wrote:
> 
> +1
> 
>> On Mon, Nov 16, 2020 at 3:10 PM Michael Miklavcic <
>> michael.miklav...@gmail.com> wrote:
>> 
>> +1
>> 
>>> On Mon, Nov 16, 2020 at 7:01 AM Justin Leet  wrote:
>>> 
>>> Hi all,
>>> 
>>> This is a vote thread to retire Metron to the Attic, and dissolve the
>> PMC.
>>> This follows a discussion thread on the dev list ([DISCUSS] Retire Metron
>>> to the Attic
>>> <
>>> 
>> https://lists.apache.org/thread.html/reb31f643fac20d3ad09521fd702b19922412b7a4e8e08062968268c5%40%3Cdev.metron.apache.org%3E
 ).
>>> More details can be found in that discussion, but the most relevant link
>> is
>>> the specific process at Moving a project to the Attic
>>> .
>>> 
>>> As noted in the process page, this is a PMC vote. As usual, feel
>> encouraged
>>> to contribute non-binding votes.
>>> 
>>> The vote will run 72 hours, until Nov 19th at 9:00 am EST.
>>> 
>>> Thank you,
>>> Justin
>>> 
>>

Re: Discuss: Time to update bundled SOLR support?

2019-11-13 Thread Ryan Merriman

Here's the Jira in case anyone wants to see the changes involved:
https://issues.apache.org/jira/browse/METRON-2225.

On Wed, Nov 13, 2019 at 8:48 AM Michael Miklavcic <
michael.miklav...@gmail.com> wrote:

> That is correct - this was/is handled in the feature branch.
>
> On Wed, Nov 13, 2019 at 7:35 AM Justin Leet  wrote:
>
> > Someone working more on the feature branch can correct me if I'm wrong,
> but
> > I believe that's occurring as part of the general "Upgrade HDP version"
> > branch, since that involves a lot of major upgrades to more supported
> > versions of basically all the components. Specifically, it looks like it
> > occurs here
> > <
> >
> https://github.com/apache/metron/commit/ad71c046977f1b3ab1aa58fea38b846a1e37
> > >
> > .
> >
> > Having said that, especially for feature branches, having more eyes on it
> > is generally pretty helpful if you're interested in hopping in to catch
> any
> > issues or opportunities.  I believe the people most involved are Mike
> > Miklavcic, Nick Allen, and Ryan Merriman, so they may have some more
> > input or have seen some opportunities if you're looking to contribute
> > something there.
> >
> > On Wed, Nov 13, 2019 at 2:46 AM Dale Richardson 
> > wrote:
> >
> > > At the moment, Metron currently ships with Solr 6.6.2 support (I think
> > > that matches HDP Search 3).
> > > Horton Works HDP Search 4 and 5 is based off Apache Solr 7.4
> > > Cloudera search is based off Apache Solr 7.4 as well.
> > > Lucidworks Solr is based on 7.x, soon to be 8.x (if not already) - they
> > > always seem to push the bleeding edge with SOLR in production.
> > >
> > > Does anybody have any issues if I upgrade the bundled SOLR support to
> > Solr
> > > 7.4?  I'm hoping most people that use Metron/Solr in production use a
> > > supported distribution, and thus we should try and keep up with the
> > > supported Solr versions as much as possible.
> > >
> > > Regards,
> > > Dale.
> > >
> >
>

Re: [DISCUSS] Parser Aggregation in Management UI

2019-06-11 Thread Ryan Merriman

"We planning to add the changes to the latest PR as additional commits to
avoid impacting the PR sequence. We will refer to the source PR in the
commit message of the fix. Also adding a link to the comment section of the
source PR of the change request to the fixing commit to make them
connected."

I don't think this is going to work.  If changes are requested and applied
in a different PR, how would you test the original PR?  What would you do
if there were merge conflicts introduced between the current and final PR?
What's the point of even having any PRs before the final one if they won't
get committed or changed?  I don't see any way to avoid having to propagate
changes through all subsequent PRs.

On Wed, May 29, 2019 at 7:03 AM Tibor Meller  wrote:

> Hi all,
>
> *We still need some volunteer reviewers for this feature.* All individual
> PR is under 1000 line of changes except one and it is due to an
> autogenerated package.lock.json file.
> Just a heads up: parser aggregation by default turned on for bro, snort and
> yarn parser on full dev. Without this changeset, full dev is broken.
>
> https://lists.apache.org/thread.html/beeb4cfddfca7958a22ab926f72f52f46a33c42edce714112df9a2da@%3Cdev.metron.apache.org%3E
>
>
>
> On Fri, May 24, 2019 at 3:20 PM Tibor Meller 
> wrote:
>
> > Please find below the list of the PRs we opened for Parser Aggregation.
> > With Shane, Tamas we tried to provide as much information as possible to
> > make the reviewing process easier.
> > Please keep that in mind these PRs are not against muster but a Parser
> > Aggregation feature branch.
> > If you like to read more about the process we followed with these PRs
> > please read the previous three message in this thread.
> >
> > PR#1 METRON-2114: [UI] Moving components to sensor parser module
> > 
> > PR#2 METRON-2116: [UI] Removing redundant AppConfigService
> > 
> > PR#3 METRON-2117: [UI] Aligning models to grouping feature
> > 
> > PR#4 METRON-2115: [UI] Aligning UI to the parser aggregation AP
> > 
> > PR#5 METRON-2122: [UI] Fixing early app config access issue
> > 
> > PR#6 METRON-2124: [UI] Move status information and start/stop to the
> > Aggregate level 
> > PR#7 METRON-2125: [UI] Making changes visible in the parser list by
> > marking changed items 
> > PR#8 METRON-2131: Add NgRx and related dependencies
> > 
> > PR#9 METRON-2133: Add NgRx effects to communicate with the server
> > 
> > PR#10 METRON-2134: Add NgRx reducers to perform parser and group changes
> > in the store 
> > PR#11 METRON-2135: Add NgRx actions to trigger state changes
> > 
> > PR#12 METRON-2136: Add parser aggregation sidebar
> > 
> > PR#13 METRON-2137: Implement drag and drop mechanism and wire NgRx
> > 
> > PR#14 METRON-2138: Code clean up
> > 
> > PR#15 METRON-2139: Refactoring sensor-parser-config.component and wire
> > NgRx 
> >
> > Thanks,
> > Tibor
> >
> >
> > On Thu, May 23, 2019 at 11:45 AM Tibor Meller 
> > wrote:
> >
> >> Yes, am expecting that some change request will rase due to the review.
> >> We planning to add the changes to the latest PR as additional commits to
> >> avoid impacting the PR sequence. We will refer to the source PR in the
> >> commit message of the fix. Also adding a link to the comment section of
> the
> >> source PR of the change request to the fixing commit to make them
> connected.
> >>
> >> On Wed, May 22, 2019 at 5:49 PM Michael Miklavcic <
> >> michael.miklav...@gmail.com> wrote:
> >>
> >>> Tibor, that sounds reasonable to me. If PR #1 ends up requiring code
> >>> changes, will you guys just percolate those up through the remaining k
> >>> PRs
> >>> in order, or just the final PR? I'm wondering how this works in
> reference
> >>> to your last point in #5 about rebasing.
> >>>
> >>> On Wed, May 22, 2019 at 8:47 AM Tibor Meller 
> >>> wrote:
> >>>
> >>> > I would like to describe quickly *our approach to breaking down
> Parser
> >>> > Aggregation PR for smaller chunks*
> >>> >
> >>> > *1. we squashed the commits in the original development branch*
> >>> > - when we started to open smaller PRs from the commits from the
> >>> original
> >>> > branch, we found ourself opening PRs out of historical states of the
> >>> code
> >>> > instead of the final one
> >>> > - none of those states of development are worth (or make sense) to be
> >>> > reviewed (initial

Re: [DISCUSS] Shaded jar classifiers

2019-06-03 Thread Ryan Merriman

Thanks Casey.  We definitely need more testing.  At this point I've just
done some light smoke testing with full dev to ensure nothing obvious is
broken (should cover ES though).  I imagine we'll need to test Solr as you
suggest and also test all our scripts with an actual use case to ensure we
haven't introduced runtime classpath issues.  We have also noticed some
regressions while using Stellar in the enrichment topology so it might be a
good time to create a test suite for that.  I believe Mike Miklavcic is
currently working on that.

On Mon, Jun 3, 2019 at 11:02 AM Casey Stella  wrote:

> This looks good to me, honestly.  Anything to make the build more
> understandable and help find classpath issues easier is a good idea IMO.
>
> Just curious, did you test that PR in both solr and ES (you added an
> exclude in the ES portion of the code) and did you spin it up in full-dev
> (to ensure ambari doesn't have any dependencies on the jar names)?
>
> Other than that, I'm +1 to the effort!
>
> On Mon, Jun 3, 2019 at 8:55 AM Ryan Merriman  wrote:
>
> > I recently opened a PR <https://github.com/apache/metron/pull/1436> that
> > has potential to significantly change (for the better in my opinion) the
> > way our Maven build process works.  I want to highlight this and get any
> > feedback on potential issues that may come with this change.
> >
> > I frequently run into the classpath version issues (especially with the
> > recent module reorganization work) and find them extremely challenging to
> > troubleshoot.  I believe we have found the root cause (from the PR
> > description):
> >
> > "When a module that uses the shaded plugin without a classifier is added
> to
> > another module as a dependency:
> >
> > 1. Any Maven excludes added to that dependency are ignored
> > 2. The Maven dependency:tree tool does not accurately report the
> transitive
> > dependencies pulled in by that dependency"
> >
> > After making this change, a number of classpath version problems popped
> up
> > as expected.  However they are now easy to track down and resolve.
> >
> > Does anyone have any concerns with making this change?  Are there things
> > I'm not thinking of?
> >
>

Re: [DISCUSS] Metron RPM spec file changelog

2019-05-31 Thread Ryan Merriman

I vote we get rid of them.  It's easy enough to look through the commit
history to see what changed and when.  If there is a need to explain a
change I think an inline comment would be more appropriate.

On Thu, May 30, 2019 at 3:52 PM Michael Miklavcic <
michael.miklav...@gmail.com> wrote:

> During a recent PR review, I discovered that we have missed updating the
> changelog for our metron.spec file many, many times -
>
> https://github.com/apache/metron/blob/master/metron-deployment/packaging/docker/rpm-docker/SPECS/metron.spec#L728
>
> Are we getting any value out of this feature, and do we want to keep it
> around? It seems like it's an extra point of work, as well as potential
> confusion, now that we have a number of missing entries. I started walking
> through the Git commit history and out of 81 changes, we started missing
> changes as early as the 9th change to this file. I logged a Jira
> https://issues.apache.org/jira/browse/METRON-2144 for addressing fixing
> the
> issue, but might it be better to simply jettison this feature?
>
> Mike
>

Re: [VOTE] Metron Release Candidate 0.7.1-RC1

2019-04-29 Thread Ryan Merriman

I am working on the backend change mentioned above (
https://issues.apache.org/jira/browse/METRON-2034) and should a PR up today.

On Mon, Apr 29, 2019 at 1:16 PM Tamás Fodor  wrote:

> As Justin pointed out, we've already implemented the frontend related part
> of the aggregation in https://github.com/apache/metron/pull/1360. Since
> it's a very big changeset, we would like to double check it again before we
> merge it back to master. Also, we're waiting for a small change in the
> backend code to fully cover everything related to parser aggregation. Once
> we have introduced this small patch and fully tested it manually we can
> solve the issue with the aggregated sensors on the management UI.
>
> On Sun, Apr 28, 2019 at 8:30 PM Nick Allen  wrote:
>
> > I agree with Justin.  My +1 stands.
> >
> > Considering that this is a known gap, we have already released with this
> > gap, and we have a backlog of numerous improvements that should be
> released
> > to the community, I am not in favor of delaying the release.  Metron
> > provides a wide variety of functionality at varying levels of maturity.
> > This is to be expected.  If we expect perfection, we will never get a
> > release out.
> >
> >
> > On Sat, Apr 27, 2019 at 6:12 PM Justin Leet 
> wrote:
> >
> > > Mike is correct, that is because of the combination of full dev
> > > restrictions and the lack of support in the configuration UI for parser
> > > aggregation.  This was introduced in
> > > https://github.com/apache/metron/pull/1207 and also was true of the
> last
> > > release. Currently, parser aggregation is an advanced/manual feature
> > whose
> > > (bare minimum) configuration can be done via Ambari, out of
> convenience.
> > >
> > > I haven't looked into it, but
> https://github.com/apache/metron/pull/1360
> > > is
> > > likely the work for this (and need additional work before merging).
> > >
> > > I'm personally letting my binding +1 stand, although I would support
> > either
> > > ensuring we get that PR cleaned up and in and/or additional
> documentation
> > > regarding the current limitations of this feature.
> > >
> > >
> > > On Sat, Apr 27, 2019 at 2:38 PM Anand Subramanian <
> > > asubraman...@hortonworks.com> wrote:
> > >
> > > > I can confirm that I've seen the Mgmt UI shows the sensor status
> > > correctly
> > > > when they run as single topologies.
> > > >
> > > > -Anand
> > > >
> > > > On 4/27/19, 11:37 PM, "Michael Miklavcic" <
> > michael.miklav...@gmail.com>
> > > > wrote:
> > > >
> > > > I believe that is bc of parser aggregation. The UI does not
> support
> > > it
> > > > currently. IIRC there was a PR to change the bro, snort, and yaf
> > > > sensors to
> > > > aggregated bc full dev didn't have enough resources. The upshot
> is
> > > > that the
> > > > UI still works for single sensors, but the feature for enabling
> > > > aggregated
> > > > sensors has not yet been completed.
> > > >
> > > > On Sat, Apr 27, 2019, 11:33 AM Otto Fowler <
> > ottobackwa...@gmail.com>
> > > > wrote:
> > > >
> > > > > -1
> > > > >
> > > > > Ran the script and ran full dev, all good.
> > > > > In the configuration ui, the status of the sensors is not
> > correct.
> > > > It
> > > > > does not show any running, but they are running in storm and
> the
> > > > data was
> > > > > moved correctly.
> > > > >
> > > > >
> > > > > On April 26, 2019 at 09:58:02, Otto Fowler (
> > > ottobackwa...@gmail.com)
> > > > > wrote:
> > > > >
> > > > > Curious Anand,
> > > > > are your steps for bringing up an open stack cluster something
> we
> > > > could
> > > > > script like the AWS stuff?
> > > > >
> > > > >
> > > > > On April 26, 2019 at 09:35:29, Anand Subramanian (
> > > > > asubraman...@hortonworks.com) wrote:
> > > > >
> > > > > +1 (non-binding)
> > > > >
> > > > > * Built RPMs and mpacks.
> > > > > * Brought up Metron stack on 12-node CentOS 7 openstack
> cluster.
> > > > > * Ran sensor-stubs and validated events in the Alerts UI for
> the
> > > > default
> > > > > sensors.
> > > > > * Management UI, Alerts UI and Swagger UI sanity check
> > > > >
> > > > > Regards,
> > > > > Anand
> > > > >
> > > > > On 4/26/19, 5:18 AM, "Nick Allen"  wrote:
> > > > >
> > > > > +1 Verified release with all documented steps and ran up Full
> > Dev.
> > > > >
> > > > > On Thu, Apr 25, 2019 at 6:10 PM Michael Miklavcic <
> > > > > michael.miklav...@gmail.com> wrote:
> > > > >
> > > > > > Ok cool, just finished the validation and updated the steps
> in
> > > the
> > > > doc to
> > > > > > reflect the current code base.
> > > > > >
> > > > > > On Thu, Apr 25, 2019 at 3:45 PM Nick Allen <
> n...@nickallen.org
> > >
> > > > wrote:
> > > > > >
> > > > > > > No voting required. Those are just docs. Whoever is willing
> > to
> > > > correct
> > > > > > > and has access, should be

Re: [DISCUSS] Next Release

2019-04-05 Thread Ryan Merriman

Jon is correct.   I am actively working on this and hope to have it
completed soon.   I realize it will hold up the release so it's a priority
for me.

On Sat, Mar 30, 2019 at 6:09 PM zeo...@gmail.com  wrote:

> Isn't the documentation already in progress?
>
> https://github.com/apache/metron/pull/1330#issuecomment-466453372
>
> If not I would still consider it important to complete prior to a release
> and I agree with Justin's comments in
>
>
> https://lists.apache.org/thread.html/50b89b919bd8bef3f7fcdef167cbd7e489fa74a1e2da3e4fddb08b13@
> 
>
> Jon Zeolla
>
> On Thu, Mar 28, 2019, 2:16 PM Michael Miklavcic <
> michael.miklav...@gmail.com>
> wrote:
>
> > Jon and Ryan - this was a convo/negotiation between you two at the time.
> > Any thoughts?
> >
> > On Thu, Mar 28, 2019 at 9:08 AM Nick Allen  wrote:
> >
> > > Is anyone volunteering to take this on?  Would be nice to get a release
> > > out.
> > >
> > > On Thu, Mar 14, 2019, 4:53 PM zeo...@gmail.com 
> wrote:
> > >
> > > > We should likely get METRON-2014 in, based on
> > > >
> > > >
> > >
> >
> https://lists.apache.org/thread.html/13bd0ed5606ad4f3427f24a8e759d6bcb61ace76d4afcc9f48310a00@%3Cdev.metron.apache.org%3E
> > > >
> > > > On Thu, Mar 14, 2019 at 4:24 PM Michael Miklavcic <
> > > > michael.miklav...@gmail.com> wrote:
> > > >
> > > > > Ticket is now done and merged. I'm also good on 0.7.1.
> > > > >
> > > > > On Thu, Mar 14, 2019 at 2:18 PM Justin Leet  >
> > > > wrote:
> > > > >
> > > > > > I'm in favor doing a release, pending the ticket Mike pointed out
> > > (and
> > > > > > anything else someone comes up with).
> > > > > >
> > > > > > To the best of my knowledge, I think 0.7.1 is sufficient, but if
> > > > someone
> > > > > > comes up with something, it's not hard to pivot.
> > > > > >
> > > > > > On Wed, Mar 13, 2019, 13:08 Michael Miklavcic <
> > > > > michael.miklav...@gmail.com
> > > > > > >
> > > > > > wrote:
> > > > > >
> > > > > > > I'd like to see this fixed for the next release.
> > > > > > > https://issues.apache.org/jira/browse/METRON-2036. Even though
> > > it's
> > > > a
> > > > > > > non-prod issue, this is a core part of our
> > > infrastructure/development
> > > > > > > lifecycle that is currently broken and fits with our previous
> > > > > agreements
> > > > > > of
> > > > > > > holding a release until all intermittent test failures are
> > > addressed.
> > > > > > >
> > > > > > > On Wed, Mar 13, 2019 at 11:33 AM Nick Allen <
> n...@nickallen.org>
> > > > > wrote:
> > > > > > >
> > > > > > > > I would like to open a discussion in regards to the next
> > release.
> > > > Our
> > > > > > > last
> > > > > > > > 0.7.0 release was on Dec 11th.
> > > > > > > >
> > > > > > > > I believe we have a significant number of bug fixes and
> > > performance
> > > > > > > > improvements that would make a worthy point release; 0.7.1.
> > > > > Although,
> > > > > > we
> > > > > > > > should review the change log and see if there are any
> breaking
> > > > > changes
> > > > > > > that
> > > > > > > > would require a bump to the minor version.
> > > > > > > >
> > > > > > > > Thoughts?
> > > > > > > >
> > > > > > > > $ git log --format=%B
> tags/apache-metron_0.7.0-release..HEAD |
> > > > grep
> > > > > > > > METRON-
> > > > > > > > Merge remote-tracking branch 'apache/master' into METRON-2035
> > > > > > > > METRON-2035 Allow User to Configure Role Names for Access
> > Control
> > > > > > > > METRON-2030 SensorParserGroupControllerIntegrationTest
> > > intermittent
> > > > > > > errors
> > > > > > > > (merrimanr via mmiklavc) closes apache/metron#1352
> > > > > > > > METRON-2031 [UI] Turning off initial search request and
> polling
> > > by
> > > > > > > default
> > > > > > > > on Alerts UI (tiborm via mmiklavc) closes apache/metron#1353
> > > > > > > > METRON-2012 Unable to Execute Stellar Functions Against HBase
> > in
> > > > the
> > > > > > REPL
> > > > > > > > (nickwallen) closes apache/metron#1345
> > > > > > > > METRON-1971 Short timeout value in Cypress may cause build
> > > failures
> > > > > > > > (sardell) closes apache/metron#1323
> > > > > > > > METRON-1940 Check if not and install Elastic search
> templates /
> > > > Solr
> > > > > > > > collections when indexing server is restarted (MohanDV)
> closes
> > > > > > > > apache/metron#1305
> > > > > > > > METRON-2019 Improve Metron REST Logging (merrimanr) closes
> > > > > > > > apache/metron#1347
> > > > > > > > METRON-2016 Parser aggregate groups should be persisted and
> > > > available
> > > > > > > > through REST (merrimanr) closes apache/metron#1346
> > > > > > > > METRON-1987 Upgrade Alert UI to stable Bootstrap 4 (sardell)
> > > closes
> > > > > > > > apache/metron#1336
> > > > > > > > METRON-1968 Messages are lost when a parser produces multiple
> > > > > messages
> > > > > > > and
> > > > > > > > batch size is greater than 1 (merrimanr) closes
> > > apache/metron#1330
> > > > > > > > METRON-1778 Out-of-order timestamps may delay flush in Storm
> > > > Profiler
> > > > > > >

[DISCUSS] Upgrading HBase and Kafka support

2019-03-08 Thread Ryan Merriman

I have been researching the effort involved to upgrade to HDP 3.  Along the
way I've found a couple challenging issues that we will need to solve, both
involving our integration testing strategy.

The first issue is Kafka.  We are moving from 0.10.0 to 2.0.0 and there
have been significant changes to the API.  This creates an issue in the
KafkaComponent class, which we use as an in-memory Kafka server in
integration tests.  Most of the classes that were previously used have gone
away, and to the best of my knowledge, were not supported as public APIs.
I also don't see any publicly documented APIs to replace them.

The second issue is HBase.  We are moving from 1.1.2 to 2.0.2 so another
significant change.  This creates an issue in the MockHTable class
becausethe HTableInterface class has changed to Table, essentially
requiring that MockHTable be rewritten to conform to the new interface.
It's my opinion that this class is complicated and difficult to maintain as
it is anyways.

These 2 issues have the potential to add a significant amount of work to
upgrading Metron to HDP 3.  I want to take a step back and review our
options before we move forward.  Here are some initial thoughts I had on
how to approach this.  For HBase:

   1. Update MockHTable to work with the new HBase API.  We would continue
   using a mock server approach for HBase.
   2. Research replacing MockHTable with an in-memory HBase server.
   3. Replace MockHTable with a Docker container running HBase.

For Kafka:

   1. Replace KafkaComponent with a mock server implementation.
   2. Update KafkaComponent to work with the new API.  We would probably
   need to leverage some internal Kafka classes.  I do not see a testing API
   documented publicly.
   3. Replace KafkaComponent with a Docker container running Kafka.

What other options are there?  Whatever we choose I think we should follow
a similar approach for both (mock servers, in memory servers, Docker, other
options I'm not thinking of).

This will not shock anyone but I would be in favor of Docker containers.
They have the advantage of classpath isolation, easy upgrades, and accurate
integration testing.  The downside is we will have to adjusts our tests and
travis script to incorporate these Docker containers into our build
process.  We have discussed this at length in the past and it has generally
stalled for various reasons.  Maybe if we move a few services at a time it
might be more palatable?  As for the other 2 approaches, I think if either
worked well we wouldn't be having this discussion.  Mock servers are hard
to maintain and I don't see in memory testing classes documented in
javadocs for either service.

Thoughts?

Re: [DISCUSS] Architecture documentation

2019-02-25 Thread Ryan Merriman

I feel like the code itself is pretty well documented.  I updated existing
javadocs and added javadocs to classes that didn't have them before this
PR.  In my opinion the level of documentation for these classes has
increased significantly.

On Mon, Feb 25, 2019 at 1:52 PM Michael Miklavcic <
michael.miklav...@gmail.com> wrote:

> Tentatively agreed on further clarification of what we consider in/out of
> scope for documentation re: document something that wasn't documented
> before. Ryan, can you give a quick summary of what you *have* added/updated
> in documentation on this PR vs what you want to leave out?
>
> My initial concern in punting on docs right now is that part of what made
> this PR/task more challenging in the first place was not having
> documentation. We risk losing context and detail again if we don't do this
> immediately. Would it be reasonable to split it up as follows?:
>
>1. Additional overarching documentation feels out of scope - make it a
>follow on (see comments below).
>2. Adding documentation to our existing README's and java code comments
>that describe the new/modified functionality should be in scope because
>it's part of the unit of work. I expect that a developer should be able
> to
>look at the code, tests, comments, and README's and understand how this
>code functions without having to start from scratch.
>
> The way we've handled follow-on work before, at least as far as feature
> branches are concerned, was to create Jiras and link them to the
> appropriate discussions for context. Maybe we can take that one step
> further and do the release manager a favor by also labeling the
> required/requested release on the Jira as a gating factor. This follows our
> pattern for intermittent test failure reporting, e.g.
>
> https://issues.apache.org/jira/browse/METRON-1946?jql=project%20%3D%20METRON%20AND%20resolution%20%3D%20Unresolved%20AND%20labels%20%3D%20test-failure%20ORDER%20BY%20priority%20DESC%2C%20updated%20DESC
> .
>
> I'm also in favor of continuing to document architecture and technical
> details as part of the code base as Ryan and Jon have suggested. I think we
> should have an "architecture.md" in metron root that replaces this -
>
> https://github.com/apache/metron/blob/d7d4fd9afb19e2bd2e66babb7e1514a19eae07d0/README.md#navigating-the-architecture
> and covers the broad architecture with links to the appropriate modules for
> detail. Minimally, it would be nice if we had a simple diagram showing the
> basic flow of data in Metron. I think we probably want an updated version
> of this wiki entry from back in the day -
> https://cwiki.apache.org/confluence/display/METRON/Metron+Architecture
>
> Best,
> Mike
>
>
> On Mon, Feb 25, 2019 at 7:18 AM Nick Allen  wrote:
>
> > I don't think we should hold up this work to document something that
> wasn't
> > previously documented.  A follow-on is sufficient.
> >
> > On Mon, Feb 25, 2019 at 8:50 AM Ryan Merriman 
> wrote:
> >
> > > Recently I submitted a PR <https://github.com/apache/metron/pull/1330>
> > > that
> > > introduces a large number of changes to a critical part of our code
> base.
> > > Reviewers feel like it is significant enough to document at an
> > > architectural level (and I agree).  There are a couple points I would
> > like
> > > to clarify.
> > >
> > > Generally architectural documentation lives in the README of the
> > > appropriate module.  Do we want to continue documenting architecture
> > here?
> > > I think it makes sense because it will be versioned along with the
> code.
> > > Just wanted to confirm there are no objections to continuing this
> > practice.
> > >
> > > A reviewer suggested we could accept the PR as is and leave the
> > > architectural documentation as a follow on.  I think this makes sense
> > > because it can be tedious to maintain a large PR as other smaller
> commits
> > > are accepted into master.  An important requirement is the
> documentation
> > > follow on must be completed in a timely manner, before the next
> release.
> > > Are there any objections to doing it this way?
> > >
> >
>

[DISCUSS] Architecture documentation

2019-02-25 Thread Ryan Merriman

Recently I submitted a PR  that
introduces a large number of changes to a critical part of our code base.
Reviewers feel like it is significant enough to document at an
architectural level (and I agree).  There are a couple points I would like
to clarify.

Generally architectural documentation lives in the README of the
appropriate module.  Do we want to continue documenting architecture here?
I think it makes sense because it will be versioned along with the code.
Just wanted to confirm there are no objections to continuing this practice.

A reviewer suggested we could accept the PR as is and leave the
architectural documentation as a follow on.  I think this makes sense
because it can be tedious to maintain a large PR as other smaller commits
are accepted into master.  An important requirement is the documentation
follow on must be completed in a timely manner, before the next release.
Are there any objections to doing it this way?

Re: [DISCUSS] Writer class refactor

2019-01-22 Thread Ryan Merriman

Thanks Mike, very helpful to have all that context.  I'm in agreement with
everything you've said.  Accepting duplicates may be a tradeoff we accept
to keep performance high.

Your comments are centered around Kafka but how does this apply to other
writers?  Since we're now handling multiple messages that come from a
single tuple, how should we handle partial failures?  The flush() method on
the Kafka writer waits until all messages have been written so we don't
have to worry about partial failures/successes there.  What about the ES
writer?  The way it's implemented now it returns a status of which messages
were successfully written and which messages were not.  Is it possible to
make a bulk write with the ES client an atomic operation?  If not I think
we'll have to accept duplicates (if we're not already).  I think this is an
issue in general for any writer we may implement and we need to be clear
about how messages are acked in this case.

I agree with your suggestion of using Map> but have
a small change I would like to propose.  Instead of Tuple can we use a
transaction id (String type)?  I would prefer to see Storm dependencies be
moved up to the bolts.  Maintaining a relationship of Tuples to transaction
ids there should be trivial.

On Fri, Jan 18, 2019 at 5:25 PM zeo...@gmail.com  wrote:

> Totally on board with everybody's comments above this point.
>
> Jon
>
> On Fri, Jan 18, 2019, 6:07 PM Michael Miklavcic <
> michael.miklav...@gmail.com> wrote:
>
>> Thanks for the write up, Ryan. I had to touch on some of this when
>> refactoring the kafka writer away from the async model so we could
>> guarantee delivery. We had potential to drop messages before that change
>> because of the async producer calls, which would ack the Storm tuple as
>> soon as the writer returned.
>>
>>- https://github.com/apache/metron/pull/1045
>>
>> We'll want to talk about these fixes/updates in context of our message
>> delivery semantics, both in Storm and Kafka. As it currently stands, we do
>> NOT use Storm Trident, which means we have at-least-once message
>> processing
>> in Storm. There is an inherent possibility that we will publish duplicate
>> messages in some instances. From a Kafka perspective, we have the same
>> issue. As of Kafka 0.11.0, they provide a way to get exactly-once
>> semantics, but I'm not sure we've done much to explicitly achieve that.
>>
>>- https://kafka.apache.org/10/documentation.html#semantics
>>
>> From a Kafka delivery guarantee perspective, it appears we're currently
>> setting # required acks to 1 by default. This means we get commit
>> confirmation as soon as the leader has written the message to its local
>> log. In this case should the leader fail immediately after acknowledging
>> the record but before the followers have replicated it then the record
>> will
>> be lost. We could investigate settings acks=all or acks=-1, but this would
>> be a tradeoff in performance for us.
>>
>>-
>>
>> https://github.com/apache/metron/blob/341960b91f8fe742d5cf947633b7edd2275587d5/metron-platform/metron-writer/src/main/java/org/apache/metron/writer/kafka/KafkaWriter.java#L87
>>- https://kafka.apache.org/10/documentation/#producerconfigs
>>
>> Per the KafkaProducer documentation, the flush() command will wait until
>> all messages are batched and sent, and will return with either success
>> (acked) or an error. "A request is considered completed when it is
>> successfully acknowledged according to the acks configuration you have
>> specified or else it results in an error."
>>
>>-
>>
>> https://kafka.apache.org/10/javadoc/org/apache/kafka/clients/producer/KafkaProducer.html
>>
>> With this combination of factors, I believe we can continue to guarantee
>> at-least-once semantics in the writer, regardless of batch size. To your
>> point about not passing 2 separate lists, I suggest that we modify the API
>> by passing in something like Map> so that the
>> tuples always get acked with respect to their messages. This way we can
>> avoid the tuple-message batch boundary problem by ensuring we only ack a
>> tuple when all associated messages are successfully written to Kafka.
>>
>> Best,
>> Mike
>>
>>
>> On Fri, Jan 18, 2019 at 1:31 PM Otto Fowler 
>> wrote:
>>
>> > Agreed
>> >
>> >
>> > On January 18, 2019 at 14:52:32, Ryan Merriman (merrim...@gmail.com)
>> > wrote:
>> >
>> > I am on board with that. In that case, I think it's even more important
>> > that we get the Writer interfaces right.
>> >
>> > On Fri, Ja

Re: [DISCUSS] Writer class refactor

2019-01-18 Thread Ryan Merriman

I am on board with that.  In that case, I think it's even more important
that we get the Writer interfaces right.

On Fri, Jan 18, 2019 at 1:34 PM Otto Fowler  wrote:

> I think that the writers should be loaded as, and act as extension points,
> such that it is possible to have 3rd party writers, and would structure
> them as such.
>
>
>
> On January 18, 2019 at 13:55:00, Ryan Merriman (merrim...@gmail.com)
> wrote:
>
> Recently there was a bug reported by a user where a parser that emits
> multiple messages from a single tuple doesn't work correctly:
> https://issues.apache.org/jira/browse/METRON-1968. This has exposed a
> problem with how the writer classes work.
>
> The fundamental issue is this: the writer classes operate under the
> assumption that there is a 1 to 1 mapping between tuples and messages to
> be
> written. A couple of examples:
>
> KafkaWriter
> <
> https://github.com/apache/metron/blob/master/metron-platform/metron-writer/src/main/java/org/apache/metron/writer/kafka/KafkaWriter.java#L236>
>
> -
> This class writes messages by iterating through the list of tuples and
> fetching the message with the same index. This is the cause of the Jira
> above. We could iterate through the message list instead but then we don't
> know which tuples have been fully processed. It would be possible for a
> batch to be flushed before all messages from a tuple are passed to the
> writer.
>
> BulkWriterComponent
> <
> https://github.com/apache/metron/blob/master/metron-platform/metron-writer/src/main/java/org/apache/metron/writer/BulkWriterComponent.java#L250>
>
> - The tuple list size is used to determine when a batch should be flushed.
> While inherently incorrect in my opinion (should be message list size),
> this also causes an issue where only the first message from the last tuple
> in a batch is written.
>
> I do not believe there are easy fixes to these problems. There is no way
> to properly store the relationship between tuples and messages to be
> written with the current BulkMessageWriter interface and
> BulkWriterResponse
> class. If we did have a way, how should we handle partial failures? If
> multiple messages are parsed from a tuple but only half of them are
> written
> successfully, what should happen? Should we replay the tuple? Should we
> just report the failed messages and continue on? I think it may be a good
> time to review our writer classes and consider a refactor. Do others
> agree? Are there easy fixes I'm missing?
>
> Assuming there is interest in refactoring, I will throw out some ideas for
> consideration. For those not as familiar with the writer classes, they are
> organized as follows (in order from lowest to highest level):
>
> Writers - These classes do the actual writing and implement the
> BulkMessageWriter or MessageWriter interfaces. There are 6 implementations
> I can see including KafkaWriter, SolrWriter, ElasticsearchWriter,
> HdfsWriter, etc. There is also an implementation that adapts a
> MessageWriter to a BulkMessageWriter (WriterToBulkWriter). The result of a
> writing operation is a BulkWriterResponse containing a list of either
> successful or failed tuples.
>
> Writer Containers - This includes the BulkWriterComponent and
> WriterHandler
> classes. These are responsible for batching and flushing messages,
> handling errors and acking tuples.
>
> Bolts - This includes ParserBolt, WriterBolt and BulkMessageWriterBolt.
> These classes implement the Storm Bolt interfaces, setup
> writers/components
> and execute tuples.
>
> I think the first step is to reevaluate the separation of concerns for
> these classes. Here is how I would change from what we currently have:
>
> Writers - These classes should only be concerned with writing messages and
> reporting what happened. They would also manage the lifecycle and
> configuration of the underlying client libraries as they do now. Instead
> of accepting 2 separate lists, they should accept a data structure that
> accurately represents the relationship between tuples and messages.
>
> Writer Containers - These classes would continue to handling batching and
> flushing but would only report the results of a flush rather than actually
> doing the acking or error handling.
>
> Bolts - These would now be responsible for acking and error reporting on
> tuples. They would transform a tuple into something the Writer Containers
> can accept as input.
>
> I think working through this and adjusting the contracts between the
> different layers will be necessary to fix the bugs described above. While
> we're at it I think there are other improvements we could also make:
>
> Decouple Storm - It would be beneficial to remove the depen

[DISCUSS] Writer class refactor

2019-01-18 Thread Ryan Merriman

Recently there was a bug reported by a user where a parser that emits
multiple messages from a single tuple doesn't work correctly:
https://issues.apache.org/jira/browse/METRON-1968.  This has exposed a
problem with how the writer classes work.

The fundamental issue is this:  the writer classes operate under the
assumption that there is a 1 to 1 mapping between tuples and messages to be
written.  A couple of examples:

KafkaWriter

-
This class writes messages by iterating through the list of tuples and
fetching the message with the same index.  This is the cause of the Jira
above.  We could iterate through the message list instead but then we don't
know which tuples have been fully processed.  It would be possible for a
batch to be flushed before all messages from a tuple are passed to the
writer.

BulkWriterComponent

- The tuple list size is used to determine when a batch should be flushed.
While inherently incorrect in my opinion (should be message list size),
this also causes an issue where only the first message from the last tuple
in a batch is written.

I do not believe there are easy fixes to these problems.  There is no way
to properly store the relationship between tuples and messages to be
written with the current BulkMessageWriter interface and BulkWriterResponse
class.  If we did have a way, how should we handle partial failures?  If
multiple messages are parsed from a tuple but only half of them are written
successfully, what should happen?  Should we replay the tuple?  Should we
just report the failed messages and continue on?  I think it may be a good
time to review our writer classes and consider a refactor.  Do others
agree?  Are there easy fixes I'm missing?

Assuming there is interest in refactoring, I will throw out some ideas for
consideration.  For those not as familiar with the writer classes, they are
organized as follows (in order from lowest to highest level):

Writers - These classes do the actual writing and implement the
BulkMessageWriter or MessageWriter interfaces.  There are 6 implementations
I can see including KafkaWriter, SolrWriter, ElasticsearchWriter,
HdfsWriter, etc.  There is also an implementation that adapts a
MessageWriter to a BulkMessageWriter (WriterToBulkWriter).  The result of a
writing operation is a BulkWriterResponse containing a list of either
successful or failed tuples.

Writer Containers - This includes the BulkWriterComponent and WriterHandler
classes.  These are responsible for batching and flushing messages,
handling errors and acking tuples.

Bolts - This includes ParserBolt, WriterBolt and BulkMessageWriterBolt.
These classes implement the Storm Bolt interfaces, setup writers/components
and execute tuples.

I think the first step is to reevaluate the separation of concerns for
these classes.  Here is how I would change from what we currently have:

Writers - These classes should only be concerned with writing messages and
reporting what happened.  They would also manage the lifecycle and
configuration of the underlying client libraries as they do now.  Instead
of accepting 2 separate lists, they should accept a data structure that
accurately represents the relationship between tuples and messages.

Writer Containers - These classes would continue to handling batching and
flushing but would only report the results of a flush rather than actually
doing the acking or error handling.

Bolts - These would now be responsible for acking and error reporting on
tuples.  They would transform a tuple into something the Writer Containers
can accept as input.

I think working through this and adjusting the contracts between the
different layers will be necessary to fix the bugs described above.  While
we're at it I think there are other improvements we could also make:

Decouple Storm - It would be beneficial to remove the dependency on tuples
in our writers and writer containers.  We could replace this with a simple
abstraction (an id would probably work fine).  This will allow us to more
easily port Metron to other streaming platforms.

Remove MessageWriter Interface - This is not being actively used as far as
I can tell.  Is that true?  Removing this will make our code simpler and
easier to follow (WriterHandler and WriterToBulkWriter classes can probably
go away).  I don't see any reason future writers, even those without bulk
writing capabilities, could not fit into the BulkMessageWriter interface.
A writer could either iterate through messages and write one at a time or
throw an exception.  As far as I know, the writer interfaces are not
something we advertise as extension points.  Is that true?

Consolidate our BulkMessageWriterBolt and WriterBolt classes - Is there

Re: [DISCUSS] Knox SSO feature branch review and features

2018-11-19 Thread Ryan Merriman

I just put up a PR that adds Metron as a Knox service here:
https://github.com/apache/metron/pull/1275.  This should give everyone a
good idea of what is involved.  I added a section on outstanding items that
highlights some of the things we have been discussion here.

On Fri, Nov 16, 2018 at 10:54 AM Ryan Merriman  wrote:

> I would also add that defaulting to Knox being on simplifies things at a
> technical level.
>
> On Fri, Nov 16, 2018 at 10:52 AM Michael Miklavcic <
> michael.miklav...@gmail.com> wrote:
>
>> That's fantastic, thanks for that detail.
>>
>> Also, I'm in agreement with the recent comments from Otto and Simon.
>>
>> On Fri, Nov 16, 2018 at 9:49 AM Ryan Merriman 
>> wrote:
>>
>> > I was still able to spin up the UI locally and debug in my testing.  I
>> am
>> > in complete agreement, we need to ensure the developer experience
>> doesn't
>> > change.
>> >
>> > On Fri, Nov 16, 2018 at 10:47 AM Michael Miklavcic <
>> > michael.miklav...@gmail.com> wrote:
>> >
>> > > Ryan, what's remote debugging look like for UI testing with Knox
>> enabled?
>> > > Anything we lose from a dev testability standpoint? The discussion of
>> > > defaults sounds reasonable to me, and I'd like to understand any other
>> > > tradeoffs there may be for non-prod deployments like full dev.
>> > >
>> > > On Fri, Nov 16, 2018 at 7:20 AM Ryan Merriman 
>> > wrote:
>> > >
>> > > > Most of the research I've done around adding Metron as a Knox
>> service
>> > is
>> > > > based on how other projects do it.  The documentation is not easy to
>> > > follow
>> > > > so I learned by reading other service definition files.  The
>> assumption
>> > > > that we are doing things drastically different is false.
>> > > >
>> > > > I completely agree with Simon.  Why would we want to be dependent on
>> > > Knox's
>> > > > release cycle?  How does that benefit us?  It may reduce some
>> > operational
>> > > > complexity but it makes our install process more complicated
>> because we
>> > > > require a certain version of Knox (who knows when that gets
>> released).
>> > > > What do we do in the meantime?  I would also like to point out that
>> > > Metron
>> > > > is inherently different than other Hadoop stack services.  We are a
>> > > > full-blown application with multiple UIs so the way we expose
>> services
>> > > > through Knox may be a little different.
>> > > >
>> > > > I think this will be easier to discuss when we can all see what is
>> > > actually
>> > > > involved.  I am working on a PR that adds Metron as a Knox service
>> and
>> > > will
>> > > > have that out soon.  That should give everyone more context.
>> > > >
>> > > > On Fri, Nov 16, 2018 at 7:39 AM Simon Elliston Ball <
>> > > > si...@simonellistonball.com> wrote:
>> > > >
>> > > > > You could say the same thing about Ambari, but that provides
>> mpacks.
>> > > Knox
>> > > > > is also designed to be extensible through Knox service stacks
>> since
>> > > they
>> > > > > realized they can’t support every project. The challenge is that
>> the
>> > > docs
>> > > > > have not made it as easy as they could for the ecosystem to plug
>> into
>> > > > Knox,
>> > > > > which has led to some confusion around this being a recommended
>> > pattern
>> > > > > (which it is).
>> > > > >
>> > > > > The danger of trying to get your bits into Knox is that that ties
>> you
>> > > to
>> > > > > their release cycle (a problem Ambari has felt hard, hence their
>> > > > community
>> > > > > is moving away from the everything inside model towards
>> everything is
>> > > an
>> > > > > mpack).
>> > > > >
>> > > > > A number of implementations of Knox also use the approach Ryan is
>> > > > > suggesting for their own organization specific end points, so it’s
>> > not
>> > > > like
>> > > > > this is an uncommon, or anti-pattern, it’s more the way Knox is
>> > > designed
>>

Re: [DISCUSS] Knox SSO feature branch review and features

2018-11-16 Thread Ryan Merriman

I would also add that defaulting to Knox being on simplifies things at a
technical level.

On Fri, Nov 16, 2018 at 10:52 AM Michael Miklavcic <
michael.miklav...@gmail.com> wrote:

> That's fantastic, thanks for that detail.
>
> Also, I'm in agreement with the recent comments from Otto and Simon.
>
> On Fri, Nov 16, 2018 at 9:49 AM Ryan Merriman  wrote:
>
> > I was still able to spin up the UI locally and debug in my testing.  I am
> > in complete agreement, we need to ensure the developer experience doesn't
> > change.
> >
> > On Fri, Nov 16, 2018 at 10:47 AM Michael Miklavcic <
> > michael.miklav...@gmail.com> wrote:
> >
> > > Ryan, what's remote debugging look like for UI testing with Knox
> enabled?
> > > Anything we lose from a dev testability standpoint? The discussion of
> > > defaults sounds reasonable to me, and I'd like to understand any other
> > > tradeoffs there may be for non-prod deployments like full dev.
> > >
> > > On Fri, Nov 16, 2018 at 7:20 AM Ryan Merriman 
> > wrote:
> > >
> > > > Most of the research I've done around adding Metron as a Knox service
> > is
> > > > based on how other projects do it.  The documentation is not easy to
> > > follow
> > > > so I learned by reading other service definition files.  The
> assumption
> > > > that we are doing things drastically different is false.
> > > >
> > > > I completely agree with Simon.  Why would we want to be dependent on
> > > Knox's
> > > > release cycle?  How does that benefit us?  It may reduce some
> > operational
> > > > complexity but it makes our install process more complicated because
> we
> > > > require a certain version of Knox (who knows when that gets
> released).
> > > > What do we do in the meantime?  I would also like to point out that
> > > Metron
> > > > is inherently different than other Hadoop stack services.  We are a
> > > > full-blown application with multiple UIs so the way we expose
> services
> > > > through Knox may be a little different.
> > > >
> > > > I think this will be easier to discuss when we can all see what is
> > > actually
> > > > involved.  I am working on a PR that adds Metron as a Knox service
> and
> > > will
> > > > have that out soon.  That should give everyone more context.
> > > >
> > > > On Fri, Nov 16, 2018 at 7:39 AM Simon Elliston Ball <
> > > > si...@simonellistonball.com> wrote:
> > > >
> > > > > You could say the same thing about Ambari, but that provides
> mpacks.
> > > Knox
> > > > > is also designed to be extensible through Knox service stacks since
> > > they
> > > > > realized they can’t support every project. The challenge is that
> the
> > > docs
> > > > > have not made it as easy as they could for the ecosystem to plug
> into
> > > > Knox,
> > > > > which has led to some confusion around this being a recommended
> > pattern
> > > > > (which it is).
> > > > >
> > > > > The danger of trying to get your bits into Knox is that that ties
> you
> > > to
> > > > > their release cycle (a problem Ambari has felt hard, hence their
> > > > community
> > > > > is moving away from the everything inside model towards everything
> is
> > > an
> > > > > mpack).
> > > > >
> > > > > A number of implementations of Knox also use the approach Ryan is
> > > > > suggesting for their own organization specific end points, so it’s
> > not
> > > > like
> > > > > this is an uncommon, or anti-pattern, it’s more the way Knox is
> > > designed
> > > > to
> > > > > work in the future, than the legacy of it only being able to
> handle a
> > > > > subset of Hadoop projects.
> > > > >
> > > > > Knox remains optional In our scenario, but we keep control over the
> > > > > shipping of things like rewrite rules, which allows Metron to
> control
> > > its
> > > > > release destiny should things like url patterns in the ui need to
> > > change
> > > > > (with a new release of angular / new module / new rest endpoint
> etc)
> > > > > instead of making a Metron release dependent on a Knox release.
> > > > >
> > > > > Imag

Re: [DISCUSS] Knox SSO feature branch review and features

2018-11-16 Thread Ryan Merriman

I was still able to spin up the UI locally and debug in my testing.  I am
in complete agreement, we need to ensure the developer experience doesn't
change.

On Fri, Nov 16, 2018 at 10:47 AM Michael Miklavcic <
michael.miklav...@gmail.com> wrote:

> Ryan, what's remote debugging look like for UI testing with Knox enabled?
> Anything we lose from a dev testability standpoint? The discussion of
> defaults sounds reasonable to me, and I'd like to understand any other
> tradeoffs there may be for non-prod deployments like full dev.
>
> On Fri, Nov 16, 2018 at 7:20 AM Ryan Merriman  wrote:
>
> > Most of the research I've done around adding Metron as a Knox service is
> > based on how other projects do it.  The documentation is not easy to
> follow
> > so I learned by reading other service definition files.  The assumption
> > that we are doing things drastically different is false.
> >
> > I completely agree with Simon.  Why would we want to be dependent on
> Knox's
> > release cycle?  How does that benefit us?  It may reduce some operational
> > complexity but it makes our install process more complicated because we
> > require a certain version of Knox (who knows when that gets released).
> > What do we do in the meantime?  I would also like to point out that
> Metron
> > is inherently different than other Hadoop stack services.  We are a
> > full-blown application with multiple UIs so the way we expose services
> > through Knox may be a little different.
> >
> > I think this will be easier to discuss when we can all see what is
> actually
> > involved.  I am working on a PR that adds Metron as a Knox service and
> will
> > have that out soon.  That should give everyone more context.
> >
> > On Fri, Nov 16, 2018 at 7:39 AM Simon Elliston Ball <
> > si...@simonellistonball.com> wrote:
> >
> > > You could say the same thing about Ambari, but that provides mpacks.
> Knox
> > > is also designed to be extensible through Knox service stacks since
> they
> > > realized they can’t support every project. The challenge is that the
> docs
> > > have not made it as easy as they could for the ecosystem to plug into
> > Knox,
> > > which has led to some confusion around this being a recommended pattern
> > > (which it is).
> > >
> > > The danger of trying to get your bits into Knox is that that ties you
> to
> > > their release cycle (a problem Ambari has felt hard, hence their
> > community
> > > is moving away from the everything inside model towards everything is
> an
> > > mpack).
> > >
> > > A number of implementations of Knox also use the approach Ryan is
> > > suggesting for their own organization specific end points, so it’s not
> > like
> > > this is an uncommon, or anti-pattern, it’s more the way Knox is
> designed
> > to
> > > work in the future, than the legacy of it only being able to handle a
> > > subset of Hadoop projects.
> > >
> > > Knox remains optional In our scenario, but we keep control over the
> > > shipping of things like rewrite rules, which allows Metron to control
> its
> > > release destiny should things like url patterns in the ui need to
> change
> > > (with a new release of angular / new module / new rest endpoint etc)
> > > instead of making a Metron release dependent on a Knox release.
> > >
> > > Imagine how we would have done with the Ambari side if we’d had to wait
> > > for them to release every time we needed to change something in the
> > > mpack... we don’t want that happening with Knox.
> > >
> > > Simon
> > >
> > > > On 16 Nov 2018, at 13:22, Otto Fowler 
> wrote:
> > > >
> > > >
> > >
> >
> https://issues.apache.org/jira/browse/KNOX-841?jql=project%20%3D%20KNOX%20AND%20text%20~%20support
> > > >
> > > > Solr is angular for example.
> > > >
> > > >
> > > > On November 16, 2018 at 08:12:55, Otto Fowler (
> ottobackwa...@gmail.com
> > )
> > > > wrote:
> > > >
> > > > Ok,  here is something I don’t understand, but would like to.
> > > >
> > > > Knox comes configured with build in services for a number of other
> > apache
> > > > products and UI’s.
> > > > It would seem to me, that the best integration with Knox would be to
> do
> > > > what these other products have done.
> > > >
> > > >
> > > > 1. Do whatever you have to do to make

Re: [DISCUSS] Knox SSO feature branch review and features

2018-11-16 Thread Ryan Merriman

Most of the research I've done around adding Metron as a Knox service is
based on how other projects do it.  The documentation is not easy to follow
so I learned by reading other service definition files.  The assumption
that we are doing things drastically different is false.

I completely agree with Simon.  Why would we want to be dependent on Knox's
release cycle?  How does that benefit us?  It may reduce some operational
complexity but it makes our install process more complicated because we
require a certain version of Knox (who knows when that gets released).
What do we do in the meantime?  I would also like to point out that Metron
is inherently different than other Hadoop stack services.  We are a
full-blown application with multiple UIs so the way we expose services
through Knox may be a little different.

I think this will be easier to discuss when we can all see what is actually
involved.  I am working on a PR that adds Metron as a Knox service and will
have that out soon.  That should give everyone more context.

On Fri, Nov 16, 2018 at 7:39 AM Simon Elliston Ball <
si...@simonellistonball.com> wrote:

> You could say the same thing about Ambari, but that provides mpacks. Knox
> is also designed to be extensible through Knox service stacks since they
> realized they can’t support every project. The challenge is that the docs
> have not made it as easy as they could for the ecosystem to plug into Knox,
> which has led to some confusion around this being a recommended pattern
> (which it is).
>
> The danger of trying to get your bits into Knox is that that ties you to
> their release cycle (a problem Ambari has felt hard, hence their community
> is moving away from the everything inside model towards everything is an
> mpack).
>
> A number of implementations of Knox also use the approach Ryan is
> suggesting for their own organization specific end points, so it’s not like
> this is an uncommon, or anti-pattern, it’s more the way Knox is designed to
> work in the future, than the legacy of it only being able to handle a
> subset of Hadoop projects.
>
> Knox remains optional In our scenario, but we keep control over the
> shipping of things like rewrite rules, which allows Metron to control its
> release destiny should things like url patterns in the ui need to change
> (with a new release of angular / new module / new rest endpoint etc)
> instead of making a Metron release dependent on a Knox release.
>
> Imagine how we would have done with the Ambari side if we’d had to wait
> for them to release every time we needed to change something in the
> mpack... we don’t want that happening with Knox.
>
> Simon
>
> > On 16 Nov 2018, at 13:22, Otto Fowler  wrote:
> >
> >
> https://issues.apache.org/jira/browse/KNOX-841?jql=project%20%3D%20KNOX%20AND%20text%20~%20support
> >
> > Solr is angular for example.
> >
> >
> > On November 16, 2018 at 08:12:55, Otto Fowler (ottobackwa...@gmail.com)
> > wrote:
> >
> > Ok,  here is something I don’t understand, but would like to.
> >
> > Knox comes configured with build in services for a number of other apache
> > products and UI’s.
> > It would seem to me, that the best integration with Knox would be to do
> > what these other products have done.
> >
> >
> > 1. Do whatever you have to do to make your own stuff compatible.
> > 2. Create a knox service definition and provide it or try to get it into
> > knox itself
> >
> > This would make the knox integration with metron optional and pluggable
> > wouldn’t it?
> >
> > Then knox with metron would just be the same as knox with anything else.
> > Please help me if I am wrong, but we seem to be going our own way here.
> > Why don’t we just do what these other products have done?
> > Why don’t we try to get apache metron services accepted to the knox
> > project?  Why don’t we model our knox integration with how XYZ does it?
> > Have we looked at how others integrate?   Having all the code and being
> > able to track stuff is kind of the point of this whole thing isn’t it?
> >
> > Maybe this is implied and I’m missing it, if so I apologize.
> >
> > I think consistency with the rest of the hadoop stack with knox helps us.
> >
> >
> >
> > On November 15, 2018 at 22:20:00, Ryan Merriman (merrim...@gmail.com)
> wrote:
> >
> > 1) Sorry I misspoke. I meant to say this is not possible in the Alerts UI
> > as far as I know. I put up a PR with a proposed solution here:
> > https://github.com/apache/metron/pull/1266.
> > 2) Yes Knox is a service you can install with Ambari, similar to Ranger
> or
> > Spark. There are some things that are specifically confi

Re: [DISCUSS] Knox SSO feature branch review and features

2018-11-15 Thread Ryan Merriman

1) Sorry I misspoke.  I meant to say this is not possible in the Alerts UI
as far as I know.  I put up a PR with a proposed solution here:
https://github.com/apache/metron/pull/1266.
2) Yes Knox is a service you can install with Ambari, similar to Ranger or
Spark.  There are some things that are specifically configured in Knox and
there are some things specific to Metron.  I will put up a PR with the
changes needed so you can see exactly what is involved.
3) I don't understand what you mean here.  Is this a question?
4) I think it's a little early to predict the Ambari changes required.
This will depend on how tasks 1-3 go.  I imagine it's similar to other
mpack work:  expose some parameters in ambari and bind those to config
files.  My understanding from this thread so far is that we should focus on
a manual, documented approach to start.

On Thu, Nov 15, 2018 at 7:53 PM Michael Miklavcic <
michael.miklav...@gmail.com> wrote:

> Thanks Ryan,
>
> 1) Can you clarify "not a good way to do this?" Are you saying we don't
> have a way to set this and need to add the config option, or that a
> solution is not obvious and it's unclear what to do? It seems to me you're
> saying the former, but I'd like to be sure.
> 2) Is Knox not a service made available by Ambari similar to Ranger or
> Spark? I'm assuming that similar to Kerberos, there are some things that
> are specifically configured in Knox and others that are app-specific. Some
> explanation of what this looks like would be helpful.
> 3) Sounds like this follows pretty naturally from #1
> 4) Relates to #2. I think we need some guidance on what a manual vs
> MPack/automated install would look like.
>
> Cheers,
> Mike
>
>
> On Thu, Nov 15, 2018 at 4:07 PM Ryan Merriman  wrote:
>
> > Wanted to give an update on the context path issue.  I investigated
> > rewriting url links in the outgoing static assets with Knox and it was
> not
> > trivial.  Fortunately I found a simple solution that works with or
> without
> > Knox.  I changed the base tag in index.html from  to  > href="./">, or in other words made the base href relative.
> >
> > I believe I am at the point where I can task this out and provide a high
> > level overview of the changes needed.  I think that each task will be a
> > manageable size and can stand alone so I don't think we need a feature
> > branch.
> >
> > The first task involves a general change to the UI code.  We need a way
> to
> > set the path to the REST service with a configuration setting because it
> is
> > different with and without Knox.  Currently there is not a good way to do
> > this in the UI.  We can use the environment files but that is a build
> time
> > setting and is not flexible.  I can see this capability being useful for
> > other use cases in the future.  I think we could even split this up into
> 2
> > separate tasks, one for the alerts UI and one for the management UI.
> >
> > The second task involves adding Knox to our stack either by default as a
> > dependency in the mpack or with a documented approach.  We would add our
> > REST service, Alerts UI, and Management UI as services in Knox.
> Everything
> > would continue to function as it currently does but with all
> communication
> > going through Knox.  LDAP authentication would be required when using
> Knox
> > and Knox will authenticate with the REST service by passing along an
> > Authorization header.  Enabling Knox would be a manual process that
> > involves deploying assets (Knox descriptor files) and changing
> > configuration.  There would be no change to how the UI functions by
> default
> > (without Knox) and either LDAP or JDBC authentication could still be
> used..
> >
> > The third task involves enabling SSO with Knox.  We would update the REST
> > service so that it can authenticate with a Knox SSO token.  We would
> > provide documentation on how to update the Knox settings in Ambari to
> > enable SSO.  The Alerts/Management UI would need to expose configuration
> > properties for the REST url and login url since these would be different
> > when Knox is enabled.  We would also need to provide documentation on how
> > to make these UI configuration changes, based on the work done in task 1.
> >
> > An optional forth task would be exposing configuration settings and
> > enabling Knox with Ambari.  We would eliminate the manual steps necessary
> > for enabling Knox and instead automate those steps with an Ambari input
> > control, similar to how LDAP is enabled.
> >
> > Thoughts on this plan?
> >
> > On Thu, Nov 15, 2018 at 10:22 AM James Sirota 
> wro

Re: [DISCUSS] Knox SSO feature branch review and features

2018-11-15 Thread Ryan Merriman

Wanted to give an update on the context path issue.  I investigated
rewriting url links in the outgoing static assets with Knox and it was not
trivial.  Fortunately I found a simple solution that works with or without
Knox.  I changed the base tag in index.html from  to , or in other words made the base href relative.

I believe I am at the point where I can task this out and provide a high
level overview of the changes needed.  I think that each task will be a
manageable size and can stand alone so I don't think we need a feature
branch.

The first task involves a general change to the UI code.  We need a way to
set the path to the REST service with a configuration setting because it is
different with and without Knox.  Currently there is not a good way to do
this in the UI.  We can use the environment files but that is a build time
setting and is not flexible.  I can see this capability being useful for
other use cases in the future.  I think we could even split this up into 2
separate tasks, one for the alerts UI and one for the management UI.

The second task involves adding Knox to our stack either by default as a
dependency in the mpack or with a documented approach.  We would add our
REST service, Alerts UI, and Management UI as services in Knox.  Everything
would continue to function as it currently does but with all communication
going through Knox.  LDAP authentication would be required when using Knox
and Knox will authenticate with the REST service by passing along an
Authorization header.  Enabling Knox would be a manual process that
involves deploying assets (Knox descriptor files) and changing
configuration.  There would be no change to how the UI functions by default
(without Knox) and either LDAP or JDBC authentication could still be used..

The third task involves enabling SSO with Knox.  We would update the REST
service so that it can authenticate with a Knox SSO token.  We would
provide documentation on how to update the Knox settings in Ambari to
enable SSO.  The Alerts/Management UI would need to expose configuration
properties for the REST url and login url since these would be different
when Knox is enabled.  We would also need to provide documentation on how
to make these UI configuration changes, based on the work done in task 1.

An optional forth task would be exposing configuration settings and
enabling Knox with Ambari.  We would eliminate the manual steps necessary
for enabling Knox and instead automate those steps with an Ambari input
control, similar to how LDAP is enabled.

Thoughts on this plan?

On Thu, Nov 15, 2018 at 10:22 AM James Sirota  wrote:

> In my view Knox SSO is such a minor feature when it comes to Metron's
> capabilities than it's not worth supporting multiple scenarios where it
> works with Knox or without Knox.  Where we should be configurable (and are
> configurable) is on the analytics and stream processing.  But this?  As
> long as the UI authenticates securely I don't think anyone is going to care
> what proxy it's using.  The code itself should be written in a way that
> it's pluggable so if we ever wanted to use another proxy or disable it all
> together we could.  But this should not be a configuration we pass on to
> the user.  The added complexity is simply not worth it here.  We have to
> start being opinionated about making sensible choices on behalf of the
> user.  A sensible choice here is to run with Knox and LDAP.  The JDBC
> component should exist for another release to allow the community to
> migrate over to LDAP and then be deprecated.  The code should still be
> pluggable and if anyone wanted to extend it to work with JDBC they could,
> or if people wanted to plug in another proxy they could, but this is not
> something we would officially support.
>
> Thanks,
> James
>
> 12.11.2018, 07:36, "Ryan Merriman" :
> > Let me clarify on exposing both legacy and Knox URLs at the same time.
> The
> > base urls will look something like this:
> >
> > Legacy REST - http://node1:8082/api/v1
> > Legacy Alerts UI - http://node1:4201:/alerts-list
> >
> > Knox REST - https://node1:8443/gateway/default/metron/api/v1
> > Knox Alerts UI -
> > https://node1:8443/gateway/default/metron-alerts-ui/alerts-list
> >
> > If Knox were turned on and the alerts UI deployed as is, it would not
> > work. This is because static assets are referenced with
> > http://node1:4201/assets/some-asset.js which does not include the
> correct
> > context path to the alerts UI in knox. To make it work, you have to set
> > the base ref to "/gateway/default/metron-alerts-ui" so that static assets
> > are referenced at
> > https://node1:8443/gateway/default/metron-alerts-ui/assets/some-asset.js
> .
> > When you do that, the legacy alerts UI will no longer work. I guess the
>

Re: [DISCUSS] Knox SSO feature branch review and features

2018-11-12 Thread Ryan Merriman

I'm just coming up to speed on Knox so maybe rewriting assets links are
trivial.  If anyone has a good example of how to do that or can point to
some documentation, please share.

On Mon, Nov 12, 2018 at 8:54 AM Simon Elliston Ball <
si...@simonellistonball.com> wrote:

> Doing the Knox proxy work first certainly does make a lot of sense vs the
> SSO first approach, so I'm in favour of this. It bypasses all the anti-CORS
> proxying stuff the other solution needed by being on the same URL space.
>
> Is there are reason we're not re-writing the asset link URLs in Knox? We
> should have a reverse content rewrite rule to avoid that problem and make
> it entirely transparent whether there is Knox or not. We shouldn't be
> changing anything about the UI services themselves. If the rewrite service
> is complete, there is no change to base ref in the UI code, Knox would
> effectively apply it by content filtering. Note also that the gateway URL
> is configurable and likely to vary from Knox to Knox, so baking it into the
> ng build will break non-full-dev builds. (e.g. gateway/default could well
> be gateway/xyz).
>
> I would also like to discuss removing the JDBC auth, because it's a set of
> plaintext passwords in a mysql DB... it introduces a problematic dependency
> (mysql) a ton of java dependencies we could cut out (JPA, eclipselink) and
> opens up a massive security hole. I personally know of several
> organisations who are blocked from using Metron by the presence of the JDBC
> authentication method in its current form.
>
> Simon
>
> On Mon, 12 Nov 2018 at 14:36, Ryan Merriman  wrote:
>
> > Let me clarify on exposing both legacy and Knox URLs at the same time.
> The
> > base urls will look something like this:
> >
> > Legacy REST - http://node1:8082/api/v1
> > Legacy Alerts UI - http://node1:4201:/alerts-list
> >
> > Knox REST - https://node1:8443/gateway/default/metron/api/v1
> > Knox Alerts UI -
> > https://node1:8443/gateway/default/metron-alerts-ui/alerts-list
> >
> > If Knox were turned on and the alerts UI deployed as is, it would not
> > work.  This is because static assets are referenced with
> > http://node1:4201/assets/some-asset.js which does not include the
> correct
> > context path to the alerts UI in knox.  To make it work, you have to set
> > the base ref to "/gateway/default/metron-alerts-ui" so that static assets
> > are referenced at
> > https://node1:8443/gateway/default/metron-alerts-ui/assets/some-asset.js
> .
> > When you do that, the legacy alerts UI will no longer work.  I guess the
> > point I'm trying to make is that we would have to switch between them or
> > have 2 separate application running.  I imagine most users only need one
> or
> > the other running so probably not an issue.
> >
> > Jon, the primary upgrade consideration I see is with authentication.  To
> be
> > able to use Knox, you would have to upgrade to LDAP-based authentication
> if
> > you were still using JDBC-based authentication in REST.  The urls would
> > also change obviously.
> >
> > On Sun, Nov 11, 2018 at 6:38 PM zeo...@gmail.com 
> wrote:
> >
> > > Phew, that was quite the thread to catch up on.
> > >
> > > I agree that this should be optional/pluggable to start, and I'm
> > interested
> > > to hear the issues as they relate to upgrading an existing cluster
> (given
> > > the suggested approach) and exposing both legacy and knox URLs at the
> > same
> > > time.
> > >
> > > Jon
> > >
> > > On Fri, Nov 9, 2018, 4:46 PM Michael Miklavcic <
> > > michael.miklav...@gmail.com>
> > > wrote:
> > >
> > > > A couple more things, and I think this goes without saying - whatever
> > we
> > > do
> > > > with Knox should NOT
> > > >
> > > >1. Require unit and integration tests to use Knox
> > > >2. Break fulldev
> > > >
> > > > Also, I don't know that I saw you mention this, but I'm unsure how we
> > > > should leverage Knox as a core piece of the platform. i.e. should we
> > make
> > > > this required or optional? I'm open to hearing opinions on this, but
> > I'm
> > > > inclined to keep this a pluggable option.
> > > >
> > > > Mike
> > > >
> > > >
> > > > On Fri, Nov 9, 2018 at 2:42 PM Michael Miklavcic <
> > > > michael.miklav...@gmail.com> wrote:
> > > >
> > > > > Thanks for the update Ryan. Per my earlier comments, I thought it
> >

Re: [DISCUSS] Knox SSO feature branch review and features

2018-11-09 Thread Ryan Merriman

curity.
> > > > >>
> > > > >>  The version of Knox used is the default from HDP. The link
> version
> > > you
> > > > >>  mention is a docs link. I'll update it to be the older version,
> > which
> > > > is
> > > > >>  the same and we can decide if we want to maintain the freshness
> of
> > it
> > > > when
> > > > >>  we look to upgrade underlying patterns. Either way, the content
> is
> > > the
> > > > >>  same.
> > > > >>
> > > > >>  I did consider other hosting mechanisms, including Undertow a
> > > > >>
> > > > >>  If you have a different suggestion to using the Spring default
> ways
> > > of
> > > > >>  doing things, or we want to use a framework other than Spring for
> > > this,
> > > > >>  then maybe we could change to that, but the route chosen here is
> > > > definitely
> > > > >>  the easy path in the context of the decision made to use Spring
> in
> > > > metron
> > > > >>  rest, and if anything opens up our choices while minimising, in
> > fact
> > > > >>  reducing, our dependency management overhead.
> > > > >>
> > > > >>  I hope that explains some of the thinking behind the choices
> made,
> > > but
> > > > the
> > > > >>  guiding principals I followed were:
> > > > >>  * Don't fight the framework if you don't have to
> > > > >>  * Reduce the need for additional installation pieces and third
> > party
> > > > repos
> > > > >>  * Minimize dependencies we would have to manage
> > > > >>  * Avoid excessive change of the architecture, or forcing users to
> > > adopt
> > > > >>  Knox if they didn't want the SSL overhead.
> > > > >>
> > > > >>  Simon
> > > > >>
> > > > >>  On Tue, 18 Sep 2018 at 02:46, Michael Miklavcic <
> > > > >>  michael.miklav...@gmail.com> wrote:
> > > > >>
> > > > >>>  Thanks for the write-up Ryan, this is a great start. I have some
> > > > further
> > > > >>>  questions based on your feedback and in addition to my initial
> > > thread.
> > > > >>>
> > > > >>>  Just for clarification, what version of Knox are we using? HDP
> > > 2.6.5,
> > > > >>>  which
> > > > >>>  is what we currently run full dev against, supports 0.12.0.
> > > > >>>
> > > > >>>
> > > >
> > >
> >
> https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.6.5/bk_release-notes/content/comp_versions.html
> > > > >>>  .
> > > > >>>  I see references to Knox 1.1.0 (latest) in this committed PR -
> > > > >>>
> > > > >>>
> > > >
> > >
> >
> https://github.com/apache/metron/pull//files#diff-70b412194819f3cb829566f05d77c1a6R122
> > > > >>>  .
> > > > >>>  This is probably just a super small mismatch, and it probably
> goes
> > > > without
> > > > >>>  saying, but I just want to be doubly sure that we're installing
> > the
> > > > >>>  default
> > > > >>>  via the standard install mechanism as opposed to something
> > separate
> > > > and
> > > > >>>  manual.
> > > > >>>
> > > > >>>  On the subject of Zuul wrt Nodejs filters. I'd like to hear some
> > > more
> > > > >>>  detail on:
> > > > >>>
> > > > >>> 1. Why do we need filtering via Zuul? For instance, is
> > filtering
> > > > >>>  routing
> > > > >>> not handled by Knox? From the beginner docs: "The gateway
> > itself
> > > > is a
> > > > >>>  layer
> > > > >>> over an embedded Jetty JEE server. At the very highest level
> > the
> > > > >>>  gateway
> > > > >>> processes requests by using request URLs to lookup specific
> JEE
> > > > Servlet
> > > > >>> Filter chain that is used to process the request. The gateway
> > > > framework
>

Re: [DISCUSS] Deprecate split-join enrichment topology in favor of unified enrichment topology

2018-11-01 Thread Ryan Merriman

+1

On Thu, Nov 1, 2018 at 5:38 PM Casey Stella  wrote:

> +1
> On Thu, Nov 1, 2018 at 18:34 Nick Allen  wrote:
>
> > +1
> >
> > On Thu, Nov 1, 2018, 6:27 PM Justin Leet  wrote:
> >
> > > +1, I haven't seen any case where the split-join topology isn't made
> > > obsolete by the unified topology.
> > >
> > > On Thu, Nov 1, 2018 at 6:17 PM Michael Miklavcic <
> > > michael.miklav...@gmail.com> wrote:
> > >
> > > > Fellow Metronians,
> > > >
> > > > We've had the unified enrichment topology around for a number of
> months
> > > > now, it has proved itself stable, and there is yet to be a time that
> I
> > > have
> > > > seen the split-join topology outperform the unified one. Here are
> some
> > > > simple reasons to deprecate the split-join topology.
> > > >
> > > >1. Unified topology performs better.
> > > >2. The configuration, especially for performance tuning is much,
> > much
> > > >simpler in the unified model.
> > > >3. The footprint within the cluster is smaller.
> > > >4. One of the first activities for any install is that we spend
> time
> > > >instructing users to switch to the unified topology.
> > > >5. One less moving part to maintain.
> > > >
> > > > I'd like to recommend that we deprecate the split-join topology and
> > make
> > > > the unified enrichment topology the new default.
> > > >
> > > > Best,
> > > > Mike
> > > >
> > >
> >
>

Re: [DISCUSS] Stellar REST client

2018-10-31 Thread Ryan Merriman

Just FYI, I put a PR up for this feature:
https://github.com/apache/metron/pull/1250.  I believe I've addressed
everyone's questions either in the PR description or the code itself.  I'm
think we can continue the discussion there.

On Fri, Oct 19, 2018 at 12:02 PM Otto Fowler 
wrote:

> I believe the issue of introducing and supporting higher latency
> enrichments is a systemic one, and should be solved as such,
> with the rest and other higher latency enrichments build on top of that
> framework.
>
>
>
>
> On October 19, 2018 at 12:22:28, Ryan Merriman (merrim...@gmail.com)
> wrote:
>
> Thanks Casey, good questions.
>
> As far as the verbs go, just thinking we might want to support calls other
> than GET at some point. For the use case stated (enriching messages from
> 3rd party services) GET is all we need. Probably a moot point anyways
> since every http library will support the different HTTP verbs.
>
> Agreed on the caching. I will defer to those that are more familiar with
> the Stellar internals on what the correct approach is.
>
> I was thinking the same thing with regards to the client libraries. Apache
> HttpComponents is probably the safest choice but OkHttp looks nice and
> could reduce effort and complexity as long as it meets our requirements.
>
> On Fri, Oct 19, 2018 at 10:58 AM Casey Stella  wrote:
>
> > I think it makes a lot of sense. A couple of questions:
> >
> > - What actions do you see the REST verbs corresponding to? I would
> > understand GET (which is in effect "evaluate an expression", right?),
> > but
> > I'm not sure about the others.
> > - We should probably be careful about caching stellar expressions. Not
> > all stellar expressions are deterministic (e.g. PROFILE_GET may not be
> > as
> > the lookback window is bound to current time). Ultimately, I think we
> > should probably bake whether a function is deterministic into stellar so
> > that *stellar* can cache where appropriate (e.g. if every part of an
> > expression is deterministic, then pull from cache otherwise recompute).
> > All of this to say, if you're going to make it configurable, IMO we
> > should
> > make it a configuration that the user passes in at request time so they
> > have the control over whether the expression is safe to cache or
> > otherwise.
> >
> > Without more compelling reasons to not do so, I'd suggest we use HTTP
> > Components as it's another apache project and under active
> > development/support. I'd also be ok with OkHttp if it's actively
> > maintained.
> >
> > On Fri, Oct 19, 2018 at 11:46 AM Ryan Merriman 
> > wrote:
> >
> > > I want to open up discussion around adding a Stellar REST client
> > function.
> > > There are services available to enrich security telemetry and they are
> > > commonly exposed through a REST interface. The primary purpose of this
> > > discuss thread to collect requirements from the community and agree on
> a
> > > general architectural approach.
> > >
> > > At a minimum I see a Stellar REST client supporting:
> > >
> > > - Common HTTP verbs including GET, POST, DELETE, etc
> > > - Option to provide headers and request parameters as needed
> > > - Support for basic authentication
> > > - Proper request and error handling (we can discuss further how this
> > > should work)
> > > - SSL support
> > > - Option to use a proxy server (including authentication)
> > > - JSON format
> > >
> > > In addition to these functional requirements, I would also propose we
> > > include these performance requirements:
> > >
> > > - Provide a configurable caching layer
> > > - Provide a mechanism for pooling connections
> > > - Provide clear documentation and guidance on how to properly use this
> > > feature since there is a significant risk of introducing latency
> > issues
> > >
> > > What else would you like to see included?
> > >
> > > I think the primary architectural decision we need to make (based on
> the
> > > agreed upon requirements of course) is an appropriate Java HTTP/REST
> > client
> > > library. Ideally we choose a library that supports everything we need
> > > OOTB. I think the majority of the work for this feature will involve
> > > wrapping this library in a Stellar function and exposing the
> > configuration
> > > knobs through Metron's configuration interface (Ambari, Zookeeper,
> > etc). I
> > > have done some very light research and here is my initial list:
>

Re: [DISCUSS] Stellar REST client

2018-10-19 Thread Ryan Merriman

Thanks Casey, good questions.

As far as the verbs go, just thinking we might want to support calls other
than GET at some point.  For the use case stated (enriching messages from
3rd party services) GET is all we need.  Probably a moot point anyways
since every http library will support the different HTTP verbs.

Agreed on the caching.  I will defer to those that are more familiar with
the Stellar internals on what the correct approach is.

I was thinking the same thing with regards to the client libraries.  Apache
HttpComponents is probably the safest choice but OkHttp looks nice and
could reduce effort and complexity as long as it meets our requirements.

On Fri, Oct 19, 2018 at 10:58 AM Casey Stella  wrote:

> I think it makes a lot of sense.  A couple of questions:
>
>- What actions do you see the REST verbs corresponding to?  I would
>understand GET (which is in effect "evaluate an expression", right?),
> but
>I'm not sure about the others.
>- We should probably be careful about caching stellar expressions.  Not
>all stellar expressions are deterministic (e.g. PROFILE_GET may not be
> as
>the lookback window is bound to current time).  Ultimately, I think we
>should probably bake whether a function is deterministic into stellar so
>that *stellar* can cache where appropriate (e.g. if every part of an
>expression is deterministic, then pull from cache otherwise recompute).
>All of this to say, if you're going to make it configurable, IMO we
> should
>make it a configuration that the user passes in at request time so they
>have the control over whether the expression is safe to cache or
> otherwise.
>
> Without more compelling reasons to not do so, I'd suggest we use HTTP
> Components as it's another apache project and under active
> development/support.  I'd also be ok with OkHttp if it's actively
> maintained.
>
> On Fri, Oct 19, 2018 at 11:46 AM Ryan Merriman 
> wrote:
>
> > I want to open up discussion around adding a Stellar REST client
> function.
> > There are services available to enrich security telemetry and they are
> > commonly exposed through a REST interface.  The primary purpose of this
> > discuss thread to collect requirements from the community and agree on a
> > general architectural approach.
> >
> > At a minimum I see a Stellar REST client supporting:
> >
> >- Common HTTP verbs including GET, POST, DELETE, etc
> >- Option to provide headers and request parameters as needed
> >- Support for basic authentication
> >- Proper request and error handling (we can discuss further how this
> >should work)
> >- SSL support
> >- Option to use a proxy server (including authentication)
> >- JSON format
> >
> > In addition to these functional requirements, I would also propose we
> > include these performance requirements:
> >
> >- Provide a configurable caching layer
> >- Provide a mechanism for pooling connections
> >- Provide clear documentation and guidance on how to properly use this
> >feature since there is a significant risk of introducing latency
> issues
> >
> > What else would you like to see included?
> >
> > I think the primary architectural decision we need to make (based on the
> > agreed upon requirements of course) is an appropriate Java HTTP/REST
> client
> > library.  Ideally we choose a library that supports everything we need
> > OOTB.  I think the majority of the work for this feature will involve
> > wrapping this library in a Stellar function and exposing the
> configuration
> > knobs through Metron's configuration interface (Ambari, Zookeeper,
> etc).  I
> > have done some very light research and here is my initial list:
> >
> >- Apache HttpComponents - https://hc.apache.org/
> >- Has support for all of the features listed above as far as I can
> tell
> >   - Doesn't introduce a large number of new dependencies (am I wrong
> >   here?)
> >   - Is sort of included already (we will need to upgrade from
> >   httpclient)
> >   - Lower level
> >- Google HTTP Client Library for Java -
> >
> >
> https://developers.google.com/api-client-library/java/google-http-java-client/
> >- Higher level API with pluggable components
> >   - Introduces dependencies (we've had issues with Guava in the past)
> >- Netflix Ribbon - https://github.com/Netflix/ribbon
> >   - Has a lot of nice features that may be useful in the future
> >   - Introduces dependencies (including guava)
> >   - Hasn't been committed to in the

[DISCUSS] Stellar REST client

2018-10-19 Thread Ryan Merriman

I want to open up discussion around adding a Stellar REST client function.
There are services available to enrich security telemetry and they are
commonly exposed through a REST interface.  The primary purpose of this
discuss thread to collect requirements from the community and agree on a
general architectural approach.

At a minimum I see a Stellar REST client supporting:

   - Common HTTP verbs including GET, POST, DELETE, etc
   - Option to provide headers and request parameters as needed
   - Support for basic authentication
   - Proper request and error handling (we can discuss further how this
   should work)
   - SSL support
   - Option to use a proxy server (including authentication)
   - JSON format

In addition to these functional requirements, I would also propose we
include these performance requirements:

   - Provide a configurable caching layer
   - Provide a mechanism for pooling connections
   - Provide clear documentation and guidance on how to properly use this
   feature since there is a significant risk of introducing latency issues

What else would you like to see included?

I think the primary architectural decision we need to make (based on the
agreed upon requirements of course) is an appropriate Java HTTP/REST client
library.  Ideally we choose a library that supports everything we need
OOTB.  I think the majority of the work for this feature will involve
wrapping this library in a Stellar function and exposing the configuration
knobs through Metron's configuration interface (Ambari, Zookeeper, etc).  I
have done some very light research and here is my initial list:

   - Apache HttpComponents - https://hc.apache.org/
   - Has support for all of the features listed above as far as I can tell
  - Doesn't introduce a large number of new dependencies (am I wrong
  here?)
  - Is sort of included already (we will need to upgrade from
  httpclient)
  - Lower level
   - Google HTTP Client Library for Java -
   
https://developers.google.com/api-client-library/java/google-http-java-client/
   - Higher level API with pluggable components
  - Introduces dependencies (we've had issues with Guava in the past)
   - Netflix Ribbon - https://github.com/Netflix/ribbon
  - Has a lot of nice features that may be useful in the future
  - Introduces dependencies (including guava)
  - Hasn't been committed to in the last 5-6 months
   - Unirest - https://github.com/Kong/unirest-java
  - Lightweight API built on top of HttpComponents
  - Pluggable serialization library (jackson is an issue for us so this
  is nice)
  - Also has not received a commit in a while
   - OkHttp - http://square.github.io/okhttp/
   - Good documentation and looks easy to use
  - Actively maintained

Obviously we have a lot of choices.  I think it comes down to balancing the
tradeoff between ease of use (HttpComponents will likely require the most
work since it is lower level) and capability.  Introducing additional
dependencies is something we should also be mindful of because our shading
practices.

This should get us started.  Let me know what you think!

Re: [DISCUSS] Batch Profiler Feature Branch

2018-09-27 Thread Ryan Merriman

+1 from me.  Great work.

On Thu, Sep 27, 2018 at 12:41 PM Justin Leet  wrote:

> I'm +1 on merging the feature branch into master. There's a lot of good
> work here, and it's definitely been nice to see the couple remaining
> improvements make it in.
>
> Thanks a lot for the contribution, this is great stuff!
>
> On Wed, Sep 26, 2018 at 6:26 PM Nick Allen  wrote:
>
> > Or support to be offered for merging this feature branch into master?
> >
> > On Wed, Sep 26, 2018 at 6:20 PM Nick Allen  wrote:
> >
> > > Thanks for the review.  With
> https://github.com/apache/metron/pull/1209
> > complete,
> > > I think the feature branch is ready to be merged.  Sounds like I have
> > > Mike's support.  Anyone else have comments, concerns, questions?
> > >
> > > On Tue, Sep 25, 2018 at 10:33 PM Michael Miklavcic <
> > > michael.miklav...@gmail.com> wrote:
> > >
> > >> I just made a couple minor comments on that PR, and I am in agreement
> > >> about
> > >> the readiness for merging with master. Good stuff Nick.
> > >>
> > >> On Fri, Sep 21, 2018 at 12:37 PM Nick Allen 
> wrote:
> > >>
> > >> > Here is a PR that adds the input time constraints to the Batch
> > Profiler
> > >> > (METRON-1787);  https://github.com/apache/metron/pull/1209.
> > >> >
> > >> > It seems that the consensus is that this is probably the last
> feature
> > we
> > >> > need before merging the FB into master.  The other two can wait
> until
> > >> after
> > >> > the feature branch has been merged.  Let me know if you disagree.
> > >> >
> > >> > Thanks
> > >> >
> > >> >
> > >> > On Thu, Sep 20, 2018 at 1:55 PM Nick Allen 
> > wrote:
> > >> >
> > >> > > Yeah, agreed.  Per use case 3, when deploying to production there
> > >> really
> > >> > > wouldn't be a huge overlap like 3 months of already profiled data.
> > >> Its
> > >> > day
> > >> > > 1, the profile was just deployed around the same time as you are
> > >> running
> > >> > > the Batch Profiler, so the overlap is in minutes, maybe hours.
> But
> > I
> > >> can
> > >> > > definitely see the usefulness of the feature for re-runs, etc as
> you
> > >> have
> > >> > > described.
> > >> > >
> > >> > > Based on this discussion, I created a few JIRAs.  Thanks all for
> the
> > >> > great
> > >> > > feedback and keep it coming.
> > >> > >
> > >> > > [1] METRON-1787 - Input Time Constraints for Batch Profiler
> > >> > > [2] METRON-1788 - Fetch Profile Definitions from Zk for Batch
> > Profiler
> > >> > > [3] METRON-1789 - MPack Should Define Default Input Path for Batch
> > >> > > Profiler
> > >> > >
> > >> > >
> > >> > > --
> > >> > > [1] https://issues.apache.org/jira/browse/METRON-1787
> > >> > > [2] https://issues.apache.org/jira/browse/METRON-1788
> > >> > > [3] https://issues.apache.org/jira/browse/METRON-1789
> > >> > >
> > >> > >
> > >> > >
> > >> > >
> > >> > >
> > >> > >
> > >> > > On Thu, Sep 20, 2018 at 1:34 PM Michael Miklavcic <
> > >> > > michael.miklav...@gmail.com> wrote:
> > >> > >
> > >> > >> I think we might want to allow the flexibility to choose the date
> > >> range
> > >> > >> then. I don't yet feel like I have a good enough understanding of
> > all
> > >> > the
> > >> > >> ways in which users would want to seed to force them to run the
> > batch
> > >> > job
> > >> > >> over all the data. It might also make it easier to deal with
> > >> > remediation,
> > >> > >> ie an error doesn't force you to re-run over the entire history.
> > Same
> > >> > goes
> > >> > >> for testing out the profile seeing batch job in the first place.
> > >> > >>
> > >> > >> On Thu, Sep 20, 2018 at 11:23 AM Nick Allen 
> > >> wrote:
> > >> > >>
> > >> > >> > Assuming you have 9 months of data archived, yes.
> > >> > >> >
> > >> > >> > On Thu, Sep 20, 2018 at 1:22 PM Michael Miklavcic <
> > >> > >> > michael.miklav...@gmail.com> wrote:
> > >> > >> >
> > >> > >> > > So in the case of 3 - if you had 6 months of data that hadn't
> > >> been
> > >> > >> > profiled
> > >> > >> > > and another 3 that had been profiled (9 months total data),
> in
> > >> its
> > >> > >> > current
> > >> > >> > > form the batch job runs over all 9 months?
> > >> > >> > >
> > >> > >> > > On Thu, Sep 20, 2018 at 11:13 AM Nick Allen <
> > n...@nickallen.org>
> > >> > >> wrote:
> > >> > >> > >
> > >> > >> > > > > How do we establish "tm" from 1.1 above? Any concerns
> about
> > >> > >> overlap
> > >> > >> > or
> > >> > >> > > > gaps after the seeding is performed?
> > >> > >> > > >
> > >> > >> > > > Good point.  Right now, if the Streaming and Batch Profiler
> > >> > overlap
> > >> > >> the
> > >> > >> > > > last write wins.  And presumably the output of the
> Streaming
> > >> and
> > >> > >> Batch
> > >> > >> > > > Profiler are the same, so no worries, right? :)
> > >> > >> > > >
> > >> > >> > > > So it kind of works, but it is definitely not ideal for use
> > >> case
> > >> > >> 3.  I
> > >> > >> > > > could add --begin and --end args to constrain the time
> frame
> > >> over
> > >> > >> which
> > >> > >> > > the
> > >> > >> > > >

Re: [DISCUSS] Knox SSO feature branch review and features

2018-09-17 Thread Ryan Merriman

I have reviewed a couple different PRs so I'll add some context where I
can.  Obviously Simon would be the most qualified to answer but I'll add my
thoughts.

For question 1, while they may not all be necessary I think it does make
sense to include them in this feature branch if our primary goal is
integrating Knox SSO.  We could push off removing JDBC authentication for
reasons I'll get to in my response to question 2.  If we want to do one at
a time (switch to spring boot, add Zuul as a dependency, then add Knox SSO)
then that's ok but I do think there are dependencies and should be done in
order.  For example, adding Knox SSO requires some work around request
filtering.  If we were to do this before moving to Spring Boot we would
need to implement the filters in Nodejs which would be throwaway once we
get around to migrating away from that.  For Zuul, I believe it's purpose
is to facilitate the filtering (although it does a lot more) so it doesn't
make sense to add that separate from the Knox SSO work.

For question 2, I think you bring up a good point.  We probably don't want
to just rip our current authentication method out.  We might want to
consider deprecating it instead and making Knox SSO and LDAP authentication
optional.

For question 3, this is a bigger shift than just a component upgrade.  It's
more like shifting platforms, from Elasticsearch to Solr for example.  Like
I alluded to in my response to question 1, I don't think we should require
throwaway work just because we want to review these parts separately.

For question 4, I will defer to Simon.  I don't believe we necessarily
require Zuul so I will let him elaborate on why he choose that library and
what the potential impact is of adding it to our project.

For question 5 and 6, I will also defer to Simon on this.  The focus of
this feature as I understand it is a consistent authentication mechanism
and support for SSO.  I will let him lay out his vision for micro services.

Knox SSO would be a great improvement and is what I think we should focus
on in this feature branch.  Micro services is something we should certainly
discuss but it might be a bit of a distraction and I wouldn't want to hold
up the other useful parts.

On Fri, Sep 14, 2018 at 8:38 PM Michael Miklavcic <
michael.miklav...@gmail.com> wrote:

> Hey all,
>
> I started looking through the Knox SSO feature branch (see here
> https://issues.apache.org/jira/browse/METRON-1663). This is some great new
> security functionality work and it looks like it will bring some important
> new features to the Metron platform. I'm coming at this pretty green, so I
> do have some questions regarding the proposed changes from a high level
> architectural perspective. There are a few changes within the current FB
> PR's that I think could use some further explanation. At first glance, it
> seems we could potentially simplify this branch a great deal and get it
> completed much sooner if we narrowed the focus a bit. But I could certainly
> be wrong here and happy for other opinions. I searched through the mailing
> list history to see if there is any additional background and the main
> DISCUSS thread I could find was regarding initially setting up the feature
> branch, which talked about adding Knox and LDAP.
>
> https://lists.apache.org/thread.html/cac2e6314284015b487121e77abf730abbb7ebec4ace014b19093b4c@%3Cdev.metron.apache.org%3E
> .
> If I've missed any follow-up, please let me know.
>
> Looking at the broader set of Jiras associated with 1663 and the first PR
> 1665, it looks like there are 4 main thrusts of this branch right now:
>
>1.  Knox/SSO
>2.  Node migrated to Spring Boot
>3.  JDBC removed completely in favor of LDAP
>4.  Introduction of Zuul, also microservices?
>
> I strongly urge for the purpose of reviewing this feature branch that we
> base much of the discussion off of
> https://issues.apache.org/jira/browse/METRON-1755, the architecture
> diagram. Minimally, an explanation of the current architecture along with
> discussion around the additional proposed changes and rationale would be
> useful for evaluation. I don't have a solid enough understanding yet of the
> full scope of changes and how they differ from the existing architecture
> just from looking at the PR's alone.
>
>1. The first question is a general one regarding the necessity of the 3
>additional features alongside Knox - migrating Node to Spring Boot,
>removing JDBC altogether, adding dependencies on Netflix's Zuul
> framework.
>Are these necessary for adding Knox/SSO? They seem like potentially
>separate features, imo.
>2. It looks like LDAP will be a required component for interacting with
>Metron via the UI's. I see this PR
>https://github.com/apache/metron/pull/1186 which removes JDBC
>authentication. Are we ready to remove it completely or would it be
> better
>to leave it as a minimal installation option? What is the proposed
>migration path

Re: [DISCUSS] Internal Metron fields

2018-09-07 Thread Ryan Merriman

Internal means it’s not configurable, doesn’t contain our default separator 
(dots) and is namespaced with metron.  We can definitely improve on DRY but 
there’s more to it than that.  For example, having 2 different versions of this 
field name (ES and Solr) adds a significant amount of complexity for no real 
benefit.

> On Sep 7, 2018, at 5:12 PM, Michael Miklavcic  
> wrote:
> 
> Can you elaborate on what you mean by "convert to internal?" From your
> description, it looks like the challenge is from our violations of DRY when
> it comes to constants referencing those keys, which would be eliminated by
> refactoring.
> 
>> On Fri, Sep 7, 2018, 3:50 PM Ryan Merriman  wrote:
>> 
>> I recently worked on a PR that involved changing the default behavior of
>> the ElasticsearchWriter to store data using field names with the default
>> Metron separator, dots.  One of the unfortunate consequences of this is
>> that although dots are allowed in more recent versions of ES, it changes
>> how these fields are stored.  Having a dot in a field name causes ES to
>> treat it as an object field type.  We're not quite comfortable with this
>> because it could introduce unforeseen side effects that may not be
>> obvious.  Here's the PR:  https://github.com/apache/metron/pull/1181
>> 
>> As I worked through it I noticed there are a couple fields that include
>> separators where it's not actually necessary.  They are not nested by
>> nature and are internal to Metron.  The fact that they are internal means
>> they show up in constants and are hardcoded in several different places.
>> That made the work in the PR above much harder and tedious than it should
>> have been.  There are 2 in particular that I had to deal with:  source:type
>> and threat:triage:score in metaalerts.
>> 
>> Is it worth considering converting these to internal Metron fields so that
>> they stay constant and this isn't a problem in the future?  I could see
>> these fields following the same pattern as 'metron_alert'.  However this
>> would cause pain when upgrading because existing data would need to be
>> updated with these new fields.
>> 
>> Just an idea.  Curious if other have an opinion on the subject.
>>

[DISCUSS] Internal Metron fields

2018-09-07 Thread Ryan Merriman

I recently worked on a PR that involved changing the default behavior of
the ElasticsearchWriter to store data using field names with the default
Metron separator, dots.  One of the unfortunate consequences of this is
that although dots are allowed in more recent versions of ES, it changes
how these fields are stored.  Having a dot in a field name causes ES to
treat it as an object field type.  We're not quite comfortable with this
because it could introduce unforeseen side effects that may not be
obvious.  Here's the PR:  https://github.com/apache/metron/pull/1181

As I worked through it I noticed there are a couple fields that include
separators where it's not actually necessary.  They are not nested by
nature and are internal to Metron.  The fact that they are internal means
they show up in constants and are hardcoded in several different places.
That made the work in the PR above much harder and tedious than it should
have been.  There are 2 in particular that I had to deal with:  source:type
and threat:triage:score in metaalerts.

Is it worth considering converting these to internal Metron fields so that
they stay constant and this isn't a problem in the future?  I could see
these fields following the same pattern as 'metron_alert'.  However this
would cause pain when upgrading because existing data would need to be
updated with these new fields.

Just an idea.  Curious if other have an opinion on the subject.

Re: [DISCUSS] Pcap query branch completion

2018-08-20 Thread Ryan Merriman

The feature branch has been merged into master.

On Thu, Aug 16, 2018 at 5:53 PM, Michael Miklavcic <
michael.miklav...@gmail.com> wrote:

> I'm +1, thanks for adding that fix, Ryan. (Note, for purposes of vote, I
> was a contributor in the feature branch).
>
> Mike
>
> On Thu, Aug 16, 2018, 4:17 PM Ryan Merriman  wrote:
>
> > We discovered a bug in our testing and felt it should be fixed before we
> > merge.  There is a PR up for review that already has a +1:
> > https://github.com/apache/metron/pull/1168.  I don't anticipate this
> > changing anyone's vote but wanted to be clear about the state of the
> > branch.  If anyone is concerned with this and would like more discussion
> > before we merge, let me know.
> >
> > On Thu, Aug 16, 2018 at 8:25 AM, James Sirota 
> wrote:
> >
> > > +1 on the merge as well
> > >
> > > 16.08.2018, 05:46, "Casey Stella" :
> > > > I'm +1 on the merge. This is great work and congrats to those who
> > > > contributed to it!
> > > >
> > > > On Thu, Aug 16, 2018 at 8:27 AM Otto Fowler  >
> > > wrote:
> > > >
> > > >>  Looks good, thanks!
> > > >>
> > > >>  On August 15, 2018 at 19:38:12, Ryan Merriman (merrim...@gmail.com
> )
> > > wrote:
> > > >>
> > > >>  Otto, I believe the items you requested are in the feature branch
> > now.
> > > Is
> > > >>  there anything outstanding that we missed? The Jiras for the Pcap
> > > feature
> > > >>  branch should be up to date:
> > > >>  https://issues.apache.org/jira/browse/METRON-1554
> > > >>
> > > >>  On Mon, Aug 13, 2018 at 5:13 PM, Ryan Merriman <
> merrim...@gmail.com>
> > > >>  wrote:
> > > >>
> > > >>  > - Date range limits on queries
> > > >>  >
> > > >>  > I will add a warning in the Job cleanup PR. That seems like an
> > > >>  > appropriate place for it (ie. make sure you don't cause health
> > > issues in
> > > >>  > your cluster).
> > > >>  >
> > > >>  > - UI should manage a queue/history of jobs
> > > >>  >
> > > >>  > I can add some documentation around killing jobs manually with
> the
> > > YARN
> > > >>  > CLI. However if they haven't set up a YARN queue, I'm not sure
> how
> > > you
> > > >>  > would view only Pcap jobs. I'm also not sure how you would get
> the
> > > >>  > application id for the job to kill because it's not displayed
> > > anywhere in
> > > >>  > the UI. However, I believe we are wired for a job name but REST
> > > doesn't
> > > >>  > set this. Maybe we could get a proper job name associated with
> pcap
> > > >>  > queries and then this would be possible to document?
> > > >>  >
> > > >>  > - Documentation/blueprint for YARN configuration
> > > >>  >
> > > >>  > You make a good point. A YARN tuning guide for Metron does sound
> > > useful.
> > > >>  > I will add a follow on Jira.
> > > >>  >
> > > >>  > On Mon, Aug 13, 2018 at 4:53 PM, Otto Fowler <
> > > ottobackwa...@gmail.com>
> > > >>  > wrote:
> > > >>  >
> > > >>  >>
> > > >>  >> - Date range limits on queries
> > > >>  >>
> > > >>  >> I took the point the wrong way apparently, sorry, I withdraw. I
> > > thought
> > > >>  >> you meant allow specifying a limit on the query, not the system
> > > imposing
> > > >>  a
> > > >>  >> limit.
> > > >>  >> This should be documented with a warning or something
> > > >>  >>
> > > >>  >> - UI should manage a queue/history of jobs
> > > >>  >>
> > > >>  >> I was thinking that if there where multiple users/jobs, there
> > should
> > > >>  >> be some thought or documentation + script on how to manage them.
> > > >>  >> “To see all the jobs still running on your cluster, across users
> > > and ui
> > > >>  >> instances do X”
> > > >>  >> “If there is an issue with the jobs you can’t resolve in the UI
> > for
> &g

Re: [DISCUSS] Pcap query branch completion

2018-08-16 Thread Ryan Merriman

We discovered a bug in our testing and felt it should be fixed before we
merge.  There is a PR up for review that already has a +1:
https://github.com/apache/metron/pull/1168.  I don't anticipate this
changing anyone's vote but wanted to be clear about the state of the
branch.  If anyone is concerned with this and would like more discussion
before we merge, let me know.

On Thu, Aug 16, 2018 at 8:25 AM, James Sirota  wrote:

> +1 on the merge as well
>
> 16.08.2018, 05:46, "Casey Stella" :
> > I'm +1 on the merge. This is great work and congrats to those who
> > contributed to it!
> >
> > On Thu, Aug 16, 2018 at 8:27 AM Otto Fowler 
> wrote:
> >
> >>  Looks good, thanks!
> >>
> >>  On August 15, 2018 at 19:38:12, Ryan Merriman (merrim...@gmail.com)
> wrote:
> >>
> >>  Otto, I believe the items you requested are in the feature branch now.
> Is
> >>  there anything outstanding that we missed? The Jiras for the Pcap
> feature
> >>  branch should be up to date:
> >>  https://issues.apache.org/jira/browse/METRON-1554
> >>
> >>  On Mon, Aug 13, 2018 at 5:13 PM, Ryan Merriman 
> >>  wrote:
> >>
> >>  > - Date range limits on queries
> >>  >
> >>  > I will add a warning in the Job cleanup PR. That seems like an
> >>  > appropriate place for it (ie. make sure you don't cause health
> issues in
> >>  > your cluster).
> >>  >
> >>  > - UI should manage a queue/history of jobs
> >>  >
> >>  > I can add some documentation around killing jobs manually with the
> YARN
> >>  > CLI. However if they haven't set up a YARN queue, I'm not sure how
> you
> >>  > would view only Pcap jobs. I'm also not sure how you would get the
> >>  > application id for the job to kill because it's not displayed
> anywhere in
> >>  > the UI. However, I believe we are wired for a job name but REST
> doesn't
> >>  > set this. Maybe we could get a proper job name associated with pcap
> >>  > queries and then this would be possible to document?
> >>  >
> >>  > - Documentation/blueprint for YARN configuration
> >>  >
> >>  > You make a good point. A YARN tuning guide for Metron does sound
> useful.
> >>  > I will add a follow on Jira.
> >>  >
> >>  > On Mon, Aug 13, 2018 at 4:53 PM, Otto Fowler <
> ottobackwa...@gmail.com>
> >>  > wrote:
> >>  >
> >>  >>
> >>  >> - Date range limits on queries
> >>  >>
> >>  >> I took the point the wrong way apparently, sorry, I withdraw. I
> thought
> >>  >> you meant allow specifying a limit on the query, not the system
> imposing
> >>  a
> >>  >> limit.
> >>  >> This should be documented with a warning or something
> >>  >>
> >>  >> - UI should manage a queue/history of jobs
> >>  >>
> >>  >> I was thinking that if there where multiple users/jobs, there should
> >>  >> be some thought or documentation + script on how to manage them.
> >>  >> “To see all the jobs still running on your cluster, across users
> and ui
> >>  >> instances do X”
> >>  >> “If there is an issue with the jobs you can’t resolve in the UI for
> that
> >>  >> user, or you are an admin and want to do something then X"
> >>  >>
> >>  >> - Documentation/blueprint for YARN configuration
> >>  >>
> >>  >> I agree with what you are saying. Although, we offer guidance on
> storm
> >>  >> tuning, and that is conceptually the same isn’t it? That is why it
> comes
> >>  >> to mind.
> >>  >> Maybe this can be a follow on, in the tuning guide?
> >>  >>
> >>  >> On August 13, 2018 at 17:36:41, Ryan Merriman (merrim...@gmail.com)
> >>  >> wrote:
> >>  >>
> >>  >> - Date range limits on queries
> >>  >>
> >>  >> Can you describe what you think is needed here? Each Metron user
> could
> >>  >> have different volumes of pcap data spread out over different time
> >>  >> periods. Are you saying we should limit the data range to something
> >>  either
> >>  >>
> >>  >> constant or configurable? Are we sure all users would want this? Am
> I
> >>  >> misinterpreting this requirement?
> >>  >>
> >>  >> - UI should manage a queue/history of jobs
> >>  >>
> >>  >> What should we document here? Reading that bullet point again, it's
> sort
> >>  >> of vague and not very description. What I am referring to is a
> design
> >>  that
> >>  >>
> >>  >> provides users a way to view and manage jobs in the UI. Currently
> jobs
> >>  can
> >>  >>
> >>  >> only be run 1 at a time and progress is shown with a status bar, so
> it's
> >>  >> somewhat interactive.
> >>  >>
> >>  >> - Documentation/blueprint for YARN configuration
> >>  >>
> >>  >>
> >>  >
>
> ---
> Thank you,
>
> James Sirota
> PMC- Apache Metron
> jsirota AT apache DOT org
>
>

Re: [DISCUSS] Pcap query branch completion

2018-08-15 Thread Ryan Merriman

Otto, I believe the items you requested are in the feature branch now.  Is
there anything outstanding that we missed?  The Jiras for the Pcap feature
branch should be up to date:
https://issues.apache.org/jira/browse/METRON-1554

On Mon, Aug 13, 2018 at 5:13 PM, Ryan Merriman  wrote:

> - Date range limits on queries
>
> I will add a warning in the Job cleanup PR.  That seems like an
> appropriate place for it (ie. make sure you don't cause health issues in
> your cluster).
>
> - UI should manage a queue/history of jobs
>
> I can add some documentation around killing jobs manually with the YARN
> CLI.  However if they haven't set up a YARN queue, I'm not sure how you
> would view only Pcap jobs.  I'm also not sure how you would get the
> application id for the job to kill because it's not displayed anywhere in
> the UI.  However, I believe we are wired for a job name but REST doesn't
> set this.  Maybe we could get a proper job name associated with pcap
> queries and then this would be possible to document?
>
> - Documentation/blueprint for YARN configuration
>
> You make a good point.  A YARN tuning guide for Metron does sound useful.
> I will add a follow on Jira.
>
> On Mon, Aug 13, 2018 at 4:53 PM, Otto Fowler 
> wrote:
>
>>
>> - Date range limits on queries
>>
>> I took the point the wrong way apparently, sorry, I withdraw.  I thought
>> you meant allow specifying a limit on the query, not the system imposing a
>> limit.
>> This should be documented with a warning or something
>>
>> - UI should manage a queue/history of jobs
>>
>> I was thinking that if there where multiple users/jobs, there should
>> be some thought or documentation + script on how to manage them.
>> “To see all the jobs still running on your cluster, across users and ui
>> instances do X”
>> “If there is an issue with the jobs you can’t resolve in the UI for that
>> user, or you are an admin and want to do something then X"
>>
>> - Documentation/blueprint for YARN configuration
>>
>> I agree with what you are saying.  Although, we offer guidance on storm
>> tuning, and that is conceptually the same isn’t it?  That is why it comes
>> to mind.
>> Maybe this can be a follow on, in the tuning guide?
>>
>> On August 13, 2018 at 17:36:41, Ryan Merriman (merrim...@gmail.com)
>> wrote:
>>
>> - Date range limits on queries
>>
>> Can you describe what you think is needed here? Each Metron user could
>> have different volumes of pcap data spread out over different time
>> periods. Are you saying we should limit the data range to something either
>>
>> constant or configurable? Are we sure all users would want this? Am I
>> misinterpreting this requirement?
>>
>> - UI should manage a queue/history of jobs
>>
>> What should we document here? Reading that bullet point again, it's sort
>> of vague and not very description. What I am referring to is a design that
>>
>> provides users a way to view and manage jobs in the UI. Currently jobs can
>>
>> only be run 1 at a time and progress is shown with a status bar, so it's
>> somewhat interactive.
>>
>> - Documentation/blueprint for YARN configuration
>>
>>
>

Re: [DISCUSS] Pcap query branch completion

2018-08-13 Thread Ryan Merriman

- Date range limits on queries

I will add a warning in the Job cleanup PR.  That seems like an appropriate
place for it (ie. make sure you don't cause health issues in your cluster).

- UI should manage a queue/history of jobs

I can add some documentation around killing jobs manually with the YARN
CLI.  However if they haven't set up a YARN queue, I'm not sure how you
would view only Pcap jobs.  I'm also not sure how you would get the
application id for the job to kill because it's not displayed anywhere in
the UI.  However, I believe we are wired for a job name but REST doesn't
set this.  Maybe we could get a proper job name associated with pcap
queries and then this would be possible to document?

- Documentation/blueprint for YARN configuration

You make a good point.  A YARN tuning guide for Metron does sound useful.
I will add a follow on Jira.

On Mon, Aug 13, 2018 at 4:53 PM, Otto Fowler 
wrote:

>
> - Date range limits on queries
>
> I took the point the wrong way apparently, sorry, I withdraw.  I thought
> you meant allow specifying a limit on the query, not the system imposing a
> limit.
> This should be documented with a warning or something
>
> - UI should manage a queue/history of jobs
>
> I was thinking that if there where multiple users/jobs, there should
> be some thought or documentation + script on how to manage them.
> “To see all the jobs still running on your cluster, across users and ui
> instances do X”
> “If there is an issue with the jobs you can’t resolve in the UI for that
> user, or you are an admin and want to do something then X"
>
> - Documentation/blueprint for YARN configuration
>
> I agree with what you are saying.  Although, we offer guidance on storm
> tuning, and that is conceptually the same isn’t it?  That is why it comes
> to mind.
> Maybe this can be a follow on, in the tuning guide?
>
> On August 13, 2018 at 17:36:41, Ryan Merriman (merrim...@gmail.com) wrote:
>
> - Date range limits on queries
>
> Can you describe what you think is needed here? Each Metron user could
> have different volumes of pcap data spread out over different time
> periods. Are you saying we should limit the data range to something either
>
> constant or configurable? Are we sure all users would want this? Am I
> misinterpreting this requirement?
>
> - UI should manage a queue/history of jobs
>
> What should we document here? Reading that bullet point again, it's sort
> of vague and not very description. What I am referring to is a design that
>
> provides users a way to view and manage jobs in the UI. Currently jobs can
>
> only be run 1 at a time and progress is shown with a status bar, so it's
> somewhat interactive.
>
> - Documentation/blueprint for YARN configuration
>
>

Re: [DISCUSS] Pcap query branch completion

2018-08-13 Thread Ryan Merriman

Thanks for the feedback Otto.  I have created a sub task for documenting
the Job cleanup documentation:
https://issues.apache.org/jira/browse/METRON-1737.  I completely agree with
you there, this needs to be documented.  For the others you marked "Follow
on" I will create follow on tasks in Jira.

I have a few questions about a couple others you commented on

- Date range limits on queries

Can you describe what you think is needed here?  Each Metron user could
have different volumes of pcap data spread out over different time
periods.  Are you saying we should limit the data range to something either
constant or configurable?  Are we sure all users would want this?  Am I
misinterpreting this requirement?

- UI should manage a queue/history of jobs

What should we document here?  Reading that bullet point again, it's sort
of vague and not very description.  What I am referring to is a design that
provides users a way to view and manage jobs in the UI.  Currently jobs can
only be run 1 at a time and progress is shown with a status bar, so it's
somewhat interactive.

- Documentation/blueprint for YARN configuration

We are setup for YARN scheduling in that we offer a configuration setting
to submit a Pcap query to a specified YARN queue (this part is
documented).  Any YARN setup or tuning would be out of scope since
everyone's YARN settings will be different and potentially expand beyond
the Metron use case.  I think a Hadoop admin is likely to have this
knowledge and to have already set up YARN queues.  Do you disagree?

On Mon, Aug 13, 2018 at 8:21 AM, Otto Fowler 
wrote:

> - Job cleanup/TTL
>
> Documented at least, or a helper script to help yourself if you are in a
> situation
>
>
> - Expose the Query filter (vs Fixed) in the UI
>
> Follow on
>
>
> - Date range limits on queries
>
> I don’t see how this won’t be immediately required. I would do this for
> minimum viable.
>
>
> - Pcap query as a separate UI
>
> Follow on
>
>
> - UI should manage a queue/history of jobs
>
> Follow on, but maybe we need documentation
>
>
> - BPF filtering
>
> This is going to be a PITA, follow on
>
>
> - Sharing PCA jobs with other users
>
> Follow on
>
>
> - Provide a way in the UI to populate a pcap query from an alert/metaalert
>
> Follow on
>
>
> - Documentation/blueprint for YARN configuration
>
> Should have
>
>
>

[DISCUSS] Pcap query branch completion

2018-08-12 Thread Ryan Merriman

We are nearing a fully functional Pcap query feature branch. I want to
take a moment before we merge to review the original discussion threads and
make sure the community is happy with the state of this feature branch
before we accept it into master.

The original discuss threads are located at:
- Back end architecture thread:
https://lists.apache.org/thread.html/1db7c6fa1b0f364f8c03520db9989b4f7a446de82eb4d9786055048c@%3Cdev.metron.apache.org%3E
- UI requirements:
https://lists.apache.org/thread.html/e62e361971092e49446e2012550319f06c8c31944224bcd6326718d9@%3Cdev.metron.apache.org%3E

The JIRA epic can be found here:
https://issues.apache.org/jira/browse/METRON-1554. The state of each task
should be accurate. We expect all tasks to be finished within the next
couple days (except for https://issues.apache.org/jira/browse/METRON-1561).

I reviewed the original discuss threads and overall I think we accomplished
a lot. We were able to create abstractions around managing and submitting
jobs. We were able to configure the YARN queue for Pcap queries so we are
set up for multi-tenancy in the future. We have basic guards in place to
keep users from overwhelming the cluster. We were able to expose results
in the UI and as a binary download. We have basic authorization in place
that can be expanded later.

We expect the outstanding Jira mentioned above to be converted to a follow
on Jira. There are several other ideas that were brought up in the discuss
threads but not done in the feature branch. They do not currently have
Jiras :

- Job cleanup/TTL
- Expose the Query filter (vs Fixed) in the UI
- Date range limits on queries
- Pcap query as a separate UI
- UI should manage a queue/history of jobs
- BPF filtering
- Sharing PCA jobs with other users
- Provide a way in the UI to populate a pcap query from an alert/metaalert
- Documentation/blueprint for YARN configuration

I'm sure I missed some so please chime in with any you want to add. Which
of these do we still feel should be done? Are there any features or
changes you feel need to be done before this feature branch is merged? I
will create the appropriate Jiras as needed.

Ryan

Re: [DISCUSS] Deprecating metron-api

2018-06-29 Thread Ryan Merriman

Adding user list.  Is anyone out there currently using the metron-api
module to query pcap data?

On Fri, Jun 29, 2018 at 4:35 PM, Casey Stella  wrote:

> I have no objection and would consider it to be a prerequisite to bringing
> in the PR unless there's someone depending on it out there.  You might want
> to cc user@ as well, to get a broader set of input for the "are people
> using it?" question.
>
> On Fri, Jun 29, 2018 at 5:21 PM Ryan Merriman  wrote:
>
> > We are currently working on adding pcap query capabilities to the Alerts
> UI
> > as part of https://issues.apache.org/jira/browse/METRON-1554.  This
> > involves exposing pcap endpoints in our REST application which will make
> > metron-api obsolete.
> >
> > Is anyone currently using this module?  Are there any objections to
> > deprecating it and removing it from our codebase once this feature branch
> > is complete?
> >
>

[DISCUSS] Deprecating metron-api

2018-06-29 Thread Ryan Merriman

We are currently working on adding pcap query capabilities to the Alerts UI
as part of https://issues.apache.org/jira/browse/METRON-1554.  This
involves exposing pcap endpoints in our REST application which will make
metron-api obsolete.

Is anyone currently using this module?  Are there any objections to
deprecating it and removing it from our codebase once this feature branch
is complete?

Re: [DISCUSS] Field conversions

2018-06-05 Thread Ryan Merriman

I agree completely.  I will leave this thread open for a day or two to give
others a chance to weigh in.  If no one opposes, I will creates Jiras for
removing field transformations and transforming existing data.

On Tue, Jun 5, 2018 at 8:21 AM, Casey Stella  wrote:

> Well, on write it is a transformation, on read it's a translation.  This is
> to say that you're providing a mapping on read to translate field names
> given the index you're using.  The other approach that I was considering
> last night is a field transformation REST call which translates field names
> that the UI could call.  So, the UI would pass 'source.type' to the field
> translation service and in Solr it'd return source.type and in ES it'd
> return source:type.  Underneath the hood the service would use the same
> transformation as the writer uses.  That's another way to skin this cat.
>
> Ultimately, I think we should just ditch this field transformation
> business, as Laurens said, as long as we have a utility to transform
> existing data.
>
> On Tue, Jun 5, 2018 at 8:54 AM Ryan Merriman  wrote:
>
> > Having 2 different patterns for configuring field name transformations on
> > read vs write is confusing to me.  I agree with both of you that
> > normalizing on '.' and not having to do the translation at all would be
> > ideal.  Like you both suggested, we would need some utility or script to
> > convert preexisting data to match this format.  There could also be some
> > adjustments a user would need to make in the UI but I feel like we could
> > document around that.  Are there any objections to doing it this way?
> >
> >
> >
> > On Mon, Jun 4, 2018 at 4:30 PM, Laurens Vets  wrote:
> >
> > > ES 2.x support officially ended 4 months ago (
> > > https://www.elastic.co/support/eol), so why still support ':' at all?
> :)
> > > Additionally, 2.x isn't even supported at all on the last 2 Ubuntu LTS
> > > releases (16.04 & 18.05).
> > >
> > > Therefor, move everything to use '.' and provide a conversion/upgrade
> > > script to change '.' to ':'?
> > >
> > >
> > > On 2018-06-04 13:55, Ryan Merriman wrote:
> > >
> > >> We've been dealing with a reoccurring challenge in Metron.  It is
> common
> > >> for various fields to contain '.' characters for the purpose of making
> > >> them
> > >> more readable, namespacing, etc.  At one point we only supported
> > >> Elasticsearch 2.3 which did not allow dots and forced us to use ':'
> > >> instead.  This limitation does not exist in later versions of
> > >> Elasticsearch
> > >> or Solr.
> > >>
> > >> Now we're in a situation where we need to allow a user to use either
> one
> > >> because they may still be using ES 2.3 or have data with ':'
> characters
> > in
> > >> field names.  We've attempted to make this configurable in a couple
> > >> different PRs:
> > >>
> > >> https://github.com/apache/metron/pull/1022
> > >> https://github.com/apache/metron/pull/1010
> > >> https://github.com/apache/metron/pull/1038
> > >>
> > >> The approaches taken in these are not consistent and fall short in
> > >> different ways.  The first (METRON-1569 Allow user to change field
> name
> > >> conversion when indexing) only applies to indexing and not querying.
> > The
> > >> others only apply to a single field which does not scale well.  Now we
> > >> have
> > >> an issue with another field in
> > >> https://issues.apache.org/jira/browse/METRON-1600.  Rather than
> > >> continuing
> > >> with a patchwork of different fixes I want to attempt to design a
> > >> system-wide solution.
> > >>
> > >> My first thought is to expand
> > https://github.com/apache/metron/pull/1022
> > >> to
> > >> apply globally.  However this is not trivial and would require
> > significant
> > >> changes.  It would also make https://github.com/apache/
> metron/pull/1010
> > >> obsolete and we might end up having to revert all of it.
> > >>
> > >> Does anyone have any ideas or opinions?  I am still researching
> > solutions
> > >> but would love some guidance from the community.
> > >>
> > >
> >
>

Re: [DISCUSS] Field conversions

2018-06-05 Thread Ryan Merriman

Having 2 different patterns for configuring field name transformations on
read vs write is confusing to me.  I agree with both of you that
normalizing on '.' and not having to do the translation at all would be
ideal.  Like you both suggested, we would need some utility or script to
convert preexisting data to match this format.  There could also be some
adjustments a user would need to make in the UI but I feel like we could
document around that.  Are there any objections to doing it this way?



On Mon, Jun 4, 2018 at 4:30 PM, Laurens Vets  wrote:

> ES 2.x support officially ended 4 months ago (
> https://www.elastic.co/support/eol), so why still support ':' at all? :)
> Additionally, 2.x isn't even supported at all on the last 2 Ubuntu LTS
> releases (16.04 & 18.05).
>
> Therefor, move everything to use '.' and provide a conversion/upgrade
> script to change '.' to ':'?
>
>
> On 2018-06-04 13:55, Ryan Merriman wrote:
>
>> We've been dealing with a reoccurring challenge in Metron.  It is common
>> for various fields to contain '.' characters for the purpose of making
>> them
>> more readable, namespacing, etc.  At one point we only supported
>> Elasticsearch 2.3 which did not allow dots and forced us to use ':'
>> instead.  This limitation does not exist in later versions of
>> Elasticsearch
>> or Solr.
>>
>> Now we're in a situation where we need to allow a user to use either one
>> because they may still be using ES 2.3 or have data with ':' characters in
>> field names.  We've attempted to make this configurable in a couple
>> different PRs:
>>
>> https://github.com/apache/metron/pull/1022
>> https://github.com/apache/metron/pull/1010
>> https://github.com/apache/metron/pull/1038
>>
>> The approaches taken in these are not consistent and fall short in
>> different ways.  The first (METRON-1569 Allow user to change field name
>> conversion when indexing) only applies to indexing and not querying.  The
>> others only apply to a single field which does not scale well.  Now we
>> have
>> an issue with another field in
>> https://issues.apache.org/jira/browse/METRON-1600.  Rather than
>> continuing
>> with a patchwork of different fixes I want to attempt to design a
>> system-wide solution.
>>
>> My first thought is to expand https://github.com/apache/metron/pull/1022
>> to
>> apply globally.  However this is not trivial and would require significant
>> changes.  It would also make https://github.com/apache/metron/pull/1010
>> obsolete and we might end up having to revert all of it.
>>
>> Does anyone have any ideas or opinions?  I am still researching solutions
>> but would love some guidance from the community.
>>
>

[DISCUSS] Field conversions

2018-06-04 Thread Ryan Merriman

We've been dealing with a reoccurring challenge in Metron.  It is common
for various fields to contain '.' characters for the purpose of making them
more readable, namespacing, etc.  At one point we only supported
Elasticsearch 2.3 which did not allow dots and forced us to use ':'
instead.  This limitation does not exist in later versions of Elasticsearch
or Solr.

Now we're in a situation where we need to allow a user to use either one
because they may still be using ES 2.3 or have data with ':' characters in
field names.  We've attempted to make this configurable in a couple
different PRs:

https://github.com/apache/metron/pull/1022
https://github.com/apache/metron/pull/1010
https://github.com/apache/metron/pull/1038

The approaches taken in these are not consistent and fall short in
different ways.  The first (METRON-1569 Allow user to change field name
conversion when indexing) only applies to indexing and not querying.  The
others only apply to a single field which does not scale well.  Now we have
an issue with another field in
https://issues.apache.org/jira/browse/METRON-1600.  Rather than continuing
with a patchwork of different fixes I want to attempt to design a
system-wide solution.

My first thought is to expand https://github.com/apache/metron/pull/1022 to
apply globally.  However this is not trivial and would require significant
changes.  It would also make https://github.com/apache/metron/pull/1010
obsolete and we might end up having to revert all of it.

Does anyone have any ideas or opinions?  I am still researching solutions
but would love some guidance from the community.

1 2 >

1 - 100 of 104 matches

Mail list logo