fzoepffel opened a new pull request, #2063:
URL: https://github.com/apache/systemds/pull/2063

   In this PR incremental SliceLine was further developed.  The following 
functionalities were added:
   
   
   - a params list parameter ensures consistent parameters over all runs
   - further parameters were added that are necessary for pruning
   - a function was added to determine the unchanged slices on each level
   - several pruning strategies were added or adjusted. e.g. score pruning 
based on the lowest (updated) score of the previous top k was added (basic 
slices and normal slices are pruned), unchanged slices under the minSup are 
removed right away. 
   - consistent one hot encoded matrices are ensured 
   - encoding and decoding functions with domain size scaling were added. If 
the parameter encodeLat is set to true the lattice is stored in an encoded form 
for reduced size. 
   - the prevStats only include the slice sizes as the other columns are not 
needed for pruning
   - the code was cleaned up, comments were added, larger code snippets were 
replaced by functions
   - first experiments have indicated a strong decrease in execution time, 
especially if the number of added tuples is significantly smaller than the 
total number of slices in an incremental run. This is the case because a larger 
number of added tuples usually tends to change a larger portion of slices. Only 
unchanged slices can lead to significant efficiency increases. 
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@systemds.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to