[
https://issues.apache.org/jira/browse/MAHOUT-1674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Pat Ferrel resolved MAHOUT-1674.
--------------------------------
Resolution: Fixed
Assignee: Pat Ferrel (was: Dmitriy Lyubimov)
Made change to blas that catch this case, passes one user's test that I was
able to reporduce.
> A'A fails getting with an index out of range for a row vector
> -------------------------------------------------------------
>
> Key: MAHOUT-1674
> URL: https://issues.apache.org/jira/browse/MAHOUT-1674
> Project: Mahout
> Issue Type: Bug
> Components: s
> Affects Versions: 0.10.0
> Reporter: Pat Ferrel
> Assignee: Pat Ferrel
> Priority: Critical
> Fix For: 0.10.0
>
>
> A'A and possibly A'B can fail with an index out of bounds on the row vector.
> This seems related to partitioning where some partitions may be empty.
> This can be reproduce with the attached data as input into
> spark-itemsimilarity. This is only A data and the one large csv will complete
> correctly but passing in the directory of part files will exhibit the error.
> The data is identical except in the number of files that are used to contain
> the data.
> The error occurs using the local raw filesystem and with master = local and
> is pretty fast to reach.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)