DisplaySpectralKMeans Example Surfaces FileNotFoundException in
DistributedRowMatrix.times() Usage/Implementation
-----------------------------------------------------------------------------------------------------------------
Key: MAHOUT-712
URL: https://issues.apache.org/jira/browse/MAHOUT-712
Project: Mahout
Issue Type: Bug
Components: Clustering
Affects Versions: 0.5
Reporter: Jeff Eastman
Fix For: 0.6
Not clear if this is due to SpectralKMeansDriver usage or an implementation
issue deeper inside. The error is repeatable:
Exception in thread "main" java.lang.IllegalStateException:
java.io.FileNotFoundException: File
file:/home/dev/workspace/mahout/output/calculations/laplacian-3/tmp/data does
not exist.
at
org.apache.mahout.math.hadoop.DistributedRowMatrix.times(DistributedRowMatrix.java:217)
at
org.apache.mahout.math.decomposer.lanczos.LanczosSolver.solve(LanczosSolver.java:104)
at
org.apache.mahout.math.hadoop.decomposer.DistributedLanczosSolver.runJob(DistributedLanczosSolver.java:70)
at
org.apache.mahout.clustering.spectral.kmeans.SpectralKMeansDriver.run(SpectralKMeansDriver.java:155)
at
org.apache.mahout.clustering.display.DisplaySpectralKMeans.main(DisplaySpectralKMeans.java:71)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
at
org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:68)
at org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:139)
at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:187)
Caused by: java.io.FileNotFoundException: File
file:/home/dev/workspace/mahout/output/calculations/laplacian-3/tmp/data does
not exist.
at
org.apache.hadoop.fs.RawLocalFileSystem.getFileStatus(RawLocalFileSystem.java:361)
at
org.apache.hadoop.fs.FilterFileSystem.getFileStatus(FilterFileSystem.java:245)
at
org.apache.hadoop.mapred.SequenceFileInputFormat.listStatus(SequenceFileInputFormat.java:51)
at
org.apache.hadoop.mapred.FileInputFormat.getSplits(FileInputFormat.java:201)
at org.apache.hadoop.mapred.JobClient.writeOldSplits(JobClient.java:810)
at
org.apache.hadoop.mapred.JobClient.submitJobInternal(JobClient.java:781)
at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:730)
at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1249)
at
org.apache.mahout.math.hadoop.DistributedRowMatrix.times(DistributedRowMatrix.java:209)
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira