[
https://issues.apache.org/jira/browse/MAHOUT-172?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Isabel Drost updated MAHOUT-172:
--------------------------------
Attachment: lda.patch
The patch extends the url pattern to not match everything in the output
directory but only stuff that starts with part* - since the lda job seems to
run fine for me.
> When running on a Hadoop cluster LDA fails with Caused by:
> java.io.IOException: Cannot open filename /user/*/output/state-*/_logs
> ---------------------------------------------------------------------------------------------------------------------------------
>
> Key: MAHOUT-172
> URL: https://issues.apache.org/jira/browse/MAHOUT-172
> Project: Mahout
> Issue Type: Bug
> Components: Clustering
> Affects Versions: 0.1
> Reporter: Isabel Drost
> Fix For: 0.2
>
> Attachments: lda.patch
>
>
> I tried running the reuters example of lda on a hadoop cluster today. Seems
> like the implementation tries to read all files in output/state-* which fails
> if in that directory "_logs" is found.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.