janniklinde opened a new pull request, #2352: URL: https://github.com/apache/systemds/pull/2352
This patch adds an out-of-core CSV reblock instruction. It supports reading single or multiple row partitioned CSV files into dense matrix blocks. Reads are currently performed by a single thread and thus performance is comparable to (slightly slower than) non-parallel dense CSV reads if all blocks per row can be held in cache. The number of maximum `blen` x `blen` matrix blocks that are constructed in memory simultaneously can be specified by `MAX_BLOCKS_IN_CACHE`. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
