EandrewJones opened a new issue, #30:
URL: https://github.com/apache/flagon-distill/issues/30

   > This indeed fixes the problem. However, it's unclear why we need to 
construct an id from the individual field values in the log.
   > 
   > Questions:
   > 
   >     * How is this function used elsewhere in the codebase?
   > 
   >     * Do we want/need to be able to parse the id to retrieve those values 
in some way?
   > 
   >     * Do we want our ids to have a partial-logical ordering?
   > 
   > 
   > If the only requirement is to generate a uuid, then why not just use 
`str(uuid.uuid4())` and call it a day? We'll never get a collision.
   > 
   > Tagging @Jyyjy or @amirmghaemi
   
   As the package is written, getUUID has to return the same value when the 
same log is passed in. `str(uuid.uuid4())` will create a different uuid when 
the same log is passed. hash() might be a better option.
   
   1. getUUID isn't really used within the distill package. Users are expected 
to use it to create a dictionary mapping UUID to logs. Then that dictionary is 
whats passed to the segmentation functions. This is one of the biggest pains of 
working with distill, you have to manage the UUID's and dictionary of logs 
yourself.
   2. No, all that info is in the log, which the UUID (assuming the user set 
things up correctly) maps to.
   3. Not sure exactly what you mean. One of the assumptions of the 
segmentation functions is that the user sorts the log dictionary by clienttime.
   
   Also, the reason @mdiep-cese ran into this issue is that interval logs have 
some inconsistencies in userale, and we have historically filtered out all 
interval logs. I'm not sure about the details, but that's been josh's guidance. 
[This](https://github.com/apache/flagon-useralejs/issues/13) may be the 
relevant ticket. But my point is that nothing in this package is built to deal 
with interval logs. 
   
   Distill is even less mature than userale. The upside is that we can change 
things a lot without really affecting anyone. I'm team fresh rewrite.
   
   Edit: found an old PR which sparked a discussion about this last year
   https://github.com/UMD-ARLIS/incubator-flagon-distill/pull/18
   
   _Originally posted by @Jyyjy in 
https://github.com/apache/flagon-distill/issues/29#issuecomment-1940018667_
               


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: notifications-unsubscr...@flagon.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: notifications-unsubscr...@flagon.apache.org
For additional commands, e-mail: notifications-h...@flagon.apache.org

Reply via email to