phaniarnab commented on PR #1650:
URL: https://github.com/apache/systemds/pull/1650#issuecomment-1197059845

   Thanks for the PR @BACtaki. I had an initial look at the code changes today. 
Before I get into detailed comments, I'd like to clarify a few things.
   - Are the optimizations only improve the naive cases where the input 
dimension is less than 1024 for the given direction (row/col)? 
   - I see that you are now iterating the dense and sparse inputs in a more 
cache-conscious manner (reducing CPU cache misses). Are there any other 
optimizations you are employing (e.g. reducing the number of intermediates)?
   
   Sorry for the delay. I will add my comments and suggestions tomorrow. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@systemds.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to