Hi, let me ask a suggestion on how to proceed: I already have flow data stored HDFS in Parquet files from an existing netflow receiver system, but with different columns/schema than Spot. I'd like to patch spot-ml and spot-oa to have them run directly on that data without having to store everything twice.
I'm still figuring out the parsing code, any hints on how I should do this? Or, even better, how to do it in a sane/modular way that can be useful for everyone? Thanks a lot! Giacomo
