[
https://issues.apache.org/jira/browse/MAHOUT-1884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15546112#comment-15546112
]
Sebastian Schelter commented on MAHOUT-1884:
--------------------------------------------
I know that this is already supported internally, I want to expose it as
optional parameters to drmDfsRead. I disagree that caching an input matrix to
read is always intended by the users, at least I want to be able to retain
control over what is cached and what not.
> Allow specification of dimensions of a DRM
> ------------------------------------------
>
> Key: MAHOUT-1884
> URL: https://issues.apache.org/jira/browse/MAHOUT-1884
> Project: Mahout
> Issue Type: Improvement
> Affects Versions: 0.12.2
> Reporter: Sebastian Schelter
> Assignee: Sebastian Schelter
> Priority: Minor
>
> Currently, in many cases, a DRM must be read to compute its dimensions when a
> user calls nrow or ncol. This also implicitly caches the corresponding DRM.
> In some cases, the user actually knows the matrix dimensions (e.g., when the
> matrices are synthetically generated, or when some metadata about them is
> known). In such cases, the user should be able to specify the dimensions upon
> creating the DRM and the caching should be avoided.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)