Baunsgaard commented on PR #2171:
URL: https://github.com/apache/systemds/pull/2171#issuecomment-2593418762

   This commit merge the BWARE optimizations to transform encode. Attached are 
the full logs for various transformations.
   Most transformations improve in performance. 
   
   Notable results include : 
   
   passthough compressed 3x faster.
   ```txt
   before
   Transform Encode Perf: rows: 10000000 schema:[UINT4, UINT4, UINT4, UINT4, 
UINT4, UINT4, UINT4, UINT4, UINT4, UINT4]
   {}
                                Normal,  207.217+- 11.819 ms,           
                            Compressed,  336.523+- 26.097 ms,   
   
   after:
   Transform Encode Perf: rows: 10000000 schema:[UINT4, UINT4, UINT4, UINT4, 
UINT4, UINT4, UINT4, UINT4, UINT4, UINT4]
   {}
                                Normal,  212.838+-  4.184 ms,           
                            Compressed,  106.941+- 43.943 ms,  
   ```
   
   
   hash to dummy encode 3 x faster uncompressed.
   ```txt
   Before:
   
   After:
   Transform Encode Perf: rows: 10000000 schema:[INT32, INT32, INT32, INT32, 
INT32, INT32, INT32, INT32, INT32, INT32]
   {ids:true, hash:[1,2,3,4,5,6,7,8,9,10], K:10, 
dummycode:[1,2,3,4,5,6,7,8,9,10]}
                                Normal, 2435.954+-267.029 ms,   
                            Compressed,  472.872+- 35.932 ms,    
   ```
   
   Full logs:
   
[BeforeSU1.md](https://github.com/user-attachments/files/18427619/BeforeSU1.md)
   [afterSU1.md](https://github.com/user-attachments/files/18427621/afterSU1.md)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@systemds.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to