Hi, I have a 100 x 1000,000 matrix of double value, and I want to perform distributed computing on a 'window' of 100 x 50, where the window starts at each column. That is, each task must have access to columns j to j+50.
Spark examples only come with accessing a single row per task. Is it possible to have access to a small part of the matrix?
