[ 
https://issues.apache.org/jira/browse/METRON-735?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15878691#comment-15878691
 ] 

Otto Fowler commented on METRON-735:
------------------------------------

I think having a configurable strategy of changing the data to suite the 
indexer should be a base feature for all indexer's, and just turned on by 
default ( or forced on ) for Solr if it will not work without it.

I think this ties into our conversations on the list around 'deriving' indexing 
templates from parsers+enrichments automatically as well.  In other words, 
would that system exist, then it may have to tie together with this

> Support Complex Data Types in Telemetry Messages
> ------------------------------------------------
>
>                 Key: METRON-735
>                 URL: https://issues.apache.org/jira/browse/METRON-735
>             Project: Metron
>          Issue Type: Improvement
>            Reporter: Nick Allen
>
> Up until now, we have been working under the assumption that all telemetry 
> messages in Metron must not contain complex data types like lists or maps.  
> Complex data types were 'flattened' to remove any complex data types from the 
> telemetry messages. 
> Most parsers today produce flat telemetry messages.  That is messages with no 
> complex data types.  The problem is that even if I use a parser that 
> generates only flattened data, I could create an enrichment (like using 
> GEO_GET) that appends non-flat data to a message. Thus we have non-flat data 
> coursing through the veins of Metron. ;)
> I think the original idea of flattening data was because one of our Indexers 
> could not handle non-flat, complex data types. At the time, we just decided, 
> well don't create any non-flat data.
> But now, since we have a completely 'programmable' system, I don't think it 
> is safe to assume that the data will always be non-flat. A user could create 
> their own Stellar function to use during enrichment. Should we force on them 
> the burden of flattening the data?
> It makes way more sense in my mind, to make the indexer transform the data 
> however it needs to , to correctly index the data. If the current issue is 
> with the Solr Indexer, then we should fix that to flatten any data that it 
> needs to. There would be one touch point to address this issue rather than 
> many.
> This is a blanket JIRA that might result in multiple sub-tasks of changes 
> needed to allow telemetry messages to contain complex data types like lists 
> and maps.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to