Github user nickwallen commented on the issue:

    https://github.com/apache/metron/pull/622
  
    There was a lot of good discussion on this, but I find it hard to summarize 
completely the positions of everyone including @mattf-horton and @cestella.  
Here is my attempt in trying to do that.  Please correct anything that I have 
misstated.
    
    1. Everyone agreed that a ToC (table of contents) is a useful additional 
feature for the Profiler.  The decodable row key would be needed in addition 
to, not instead of, a ToC.
    
    1. In implementing a decodable row key, we do need to plan for future 
changes in row key format.  This was handled in this PR, but can be improved.
    
    1. The decodable row key feature should be completed **before** a ToC so 
that the row keys can be used to generate (or regenerate) a ToC on-demand.
    
    1. There were various suggestions made on how to shorten up the row key 
format.  Some of those I completed on this PR (like using a murmur hash) and 
others (like using shorts instead of ints) I would need to incorporate in a 
future PR for a decodable row key.
    
    1. There is a need for a migration tool.  A tool that can read the existing 
row key format and rewrite the same data using a new format.  This tool is 
necessary even if it cannot be implemented deterministically with the current 
row key format.  The tool may not hints from the user like the names of known 
profiles.
    
    
    Once I compile a summary of these changes, I will close this PR.  All 
enhancements around this will be implemented on new PRs.


---

Reply via email to