METRON-831, PR @ https://github.com/apache/incubator-metron/pull/517
Just so we're clear, let's assume the following: - Enrichment table called 'enrichments' - Enrichment CF called 't' - A message field called user_ids that is a list of user IDs - The enrichment type for this HBase enrichment is 'et' - The indicator in the HBase enrichment is a user ID - You want a certain field out of the HBase enrichment data per user ID. Let's call that field 'login_time' In order do that with METRON-831, you'd do the following to get the login time fields for the list of users. For the sake of simplicity, I'll break it into temporary variables: - enriched_users := MAP( user_ids, &( user_id : ENRICHMENT_GET('et', user_id, 'enrichments', 't') ) ) - login_times := MAP(enriched_users, &( enrichment : MAP_GET(enrichment, 'login_time') ) ) - MAP_GET here retrieves the value associated with the key 'login_time', which is the name. Since you probably don't want intermediate values out there, you might want to smash that into one big statement (we need a way to remove temporary variables in stellar enrichments, btw): - MAP(MAP( user_ids, &( user_id : ENRICHMENT_GET('et', user_id, 'enrichments', 't') ) ), &( enrichment : MAP_GET(enrichment, 'login_time') ) ) On a side-note, it might be nice to have an optional arg to ENRICHMENT_GET that lets you specify just the fields to return. That would simplify the call to: - MAP( user_ids, &( user_id : ENRICHMENT_GET('et', user_id, 'enrichments', 't', ['login_time']) ) ) On Thu, Apr 6, 2017 at 8:10 PM, Casey Stella <ceste...@gmail.com> wrote: > There'll be a JIRA and a PR tonight ;) It sprung from the keyboard. I've > been waiting for a good reason for some time. heh > > On Thu, Apr 6, 2017 at 8:08 PM, Otto Fowler <ottobackwa...@gmail.com> > wrote: > >> Is there a Jira for the MAP Casey? >> >> >> On April 6, 2017 at 14:07:15, Casey Stella (ceste...@gmail.com) wrote: >> >> Ok, so yeah, you've hit upon a limitation currently. Right now, via >> Stellar you can use ENRICHMENT_GET which takes the following parameters: >> >> - enrichment_type - The enrichment type >> - indicator - The string indicator to look up >> - hbase_table - The HBase Table to use >> - column_family - The Column Family to use >> >> Right now we only accept a string for the indicator (which likely would >> be your user_id). You'd probably like to call ENRICHMENT_GET for each id >> in the user_id variable. We can't quite do that yet. There has been some >> talk about a MAP function created where you can apply a stellar function >> across a list of values. i.e. MAP( user_id, @ENRICHMENT_GET('et', $, >> 'enrichments', 't')) which would return a list containing the output of >> ENRICHMENT_GET for each call. >> >> There is another, more immediate change that could be made for this >> specific case. We could enable ENRICHMENT_GET to take a list of indicators >> as the second argument. >> >> Sorry, that doesn't exactly solve your problem in the immediate-case, but >> it provides some context for future fixes. ;) I don't suppose you know the >> length of the list beforehand, right? Even the maximum size? >> >> Casey >> >> >> On Sun, Apr 2, 2017 at 10:26 AM, Ali Nazemian <alinazem...@gmail.com> >> wrote: >> >>> Hi all, >>> >>> >>> I was wondering how I can achieve the following use case in the current >>> version of Metron? >>> >>> >>> >>> I want to have attributes in the Metron JSON object that are an array. >>> For example, if a threat is impacting multiple users, they are all >>> contained in an attribute (e.g. user_id:[id1, id2, id3]). Now if I want >>> to enrich the event with data that requires the user_id as a key in >>> enrichment stored in HBASE, how would I do this? >>> >>> >>> Cheers, >>> Ali >>> >> >> >