Hi!

I’ve been experimenting with the SSVDSolver and unfortunately, during runtime, 
I encounter this error:

10648576 [Thread-13] WARN org.apache.hadoop.mapred.LocalJobRunner  - 
job_local1958711697_0001
java.lang.NoClassDefFoundError: org/apache/commons/httpclient/HttpMethod
        at 
org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:546)
Caused by: java.lang.ClassNotFoundException: 
org.apache.commons.httpclient.HttpMethod
        at java.net.URLClassLoader$1.run(URLClassLoader.java:372)
        at java.net.URLClassLoader$1.run(URLClassLoader.java:361)
        at java.security.AccessController.doPrivileged(Native Method)
        at java.net.URLClassLoader.findClass(URLClassLoader.java:360)
        at java.lang.ClassLoader.loadClass(ClassLoader.java:424)
        at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:308)
        at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
        ... 1 more

Exception in thread "Thread-13" java.lang.NoClassDefFoundError: 
org/apache/commons/httpclient/HttpMethod
        at 
org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:562)
Caused by: java.lang.ClassNotFoundException: 
org.apache.commons.httpclient.HttpMethod
        at java.net.URLClassLoader$1.run(URLClassLoader.java:372)
        at java.net.URLClassLoader$1.run(URLClassLoader.java:361)
        at java.security.AccessController.doPrivileged(Native Method)
        at java.net.URLClassLoader.findClass(URLClassLoader.java:360)
        at java.lang.ClassLoader.loadClass(ClassLoader.java:424)
        at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:308)
        at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
        ... 1 more 

The actual invocation is:

public static void runSSVDOnSparseVectors(String inputPath,
                        String outputPath, int rank, int oversampling, int 
blocks,
                        int reduceTasks, int powerIterations, boolean halfSigma)
                        throws IOException {
        Configuration conf = new Configuration();
        SSVDSolver solver = new SSVDSolver(conf, new Path[] { new Path(
                        inputPath) }, new Path(outputPath), blocks, rank, 
oversampling,
                        reduceTasks);
        solver.setQ(powerIterations);
        if (halfSigma) {
                solver.setcUHalfSigma(true);
                solver.setcVHalfSigma(true);
        }
        solver.run();
}

while being invoked with (input.getParent() + “/" + TERM_DOC_MATRIX_NAME, 
input.getParent() + “/" + SVD_FOLDER_NAME, k, 2 * k, Math.min(200000, (int) (3 
* k * 0.01 * Math.max(lsaTraining.getNoDocuments(),lsaTraining.getNoWords()))), 
5, 2, true);

I’m using Mahout 0.10 with httpclient-4.4.1.jar (I tried also 4.2.5 from the 
package archive) on a 48k words X 53k docs matrix.

Any ideas? It works fine with the similar variables if I run the job in command 
line.

Also, how should I tweak the input variables?


Thanks in advance!
Mihai

Reply via email to