See <https://builds.apache.org/job/Mahout-Examples-Classify-20News/64/changes>
Changes: [ssc] RegressionResultAnalyzer must use US locale ------------------------------------------ [...truncated 6207 lines...] 888 441 889 440 890 440 891 440 892 440 893 440 894 440 895 440 896 439 897 439 898 439 899 438 900 437 901 437 902 437 903 434 904 434 905 434 906 434 907 434 908 433 909 433 910 433 911 432 912 431 913 431 914 430 915 430 916 430 917 428 918 428 919 427 920 426 921 426 922 425 923 425 924 425 925 424 926 424 927 424 928 423 929 423 930 423 931 422 932 422 933 422 934 422 935 421 936 420 937 420 938 419 939 419 940 419 941 419 942 419 943 419 944 418 945 417 946 416 947 415 948 415 949 415 950 414 951 413 952 412 953 411 954 410 955 410 956 409 957 408 958 408 959 408 960 407 961 407 962 407 963 407 964 407 965 406 966 406 967 405 968 405 969 404 970 403 971 402 972 402 973 402 974 401 975 400 976 400 977 399 978 398 979 398 980 397 981 396 982 396 983 396 984 396 985 395 986 394 987 394 988 394 989 393 990 393 991 393 992 392 993 392 994 392 995 391 996 391 997 391 998 391 999 390 1000 390 12/06/29 20:14:45 INFO driver.MahoutDriver: Program took 454126 ms (Minutes: 7.568783333333333) Testing on /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/ with model: /tmp/news-group.model hadoop binary is not in PATH,HADOOP_HOME/bin,HADOOP_PREFIX/bin, running locally SLF4J: Class path contains multiple SLF4J bindings. SLF4J: Found binding in [jar:<https://builds.apache.org/job/Mahout-Examples-Classify-20News/ws/trunk/examples/target/mahout-examples-0.8-SNAPSHOT-job.jar!/org/slf4j/impl/StaticLoggerBinder.class]> SLF4J: Found binding in [jar:<https://builds.apache.org/job/Mahout-Examples-Classify-20News/ws/trunk/examples/target/dependency/slf4j-jcl-1.6.6.jar!/org/slf4j/impl/StaticLoggerBinder.class]> SLF4J: Found binding in [jar:<https://builds.apache.org/job/Mahout-Examples-Classify-20News/ws/trunk/examples/target/dependency/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]> SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an explanation. SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory] 12/06/29 20:14:46 WARN driver.MahoutDriver: No org.apache.mahout.classifier.sgd.TestNewsGroups.props found on classpath, will use command-line arguments only 7532 test files ======================================================= Summary ------------------------------------------------------- Correctly Classified Instances : 5520 73.2873% Incorrectly Classified Instances : 2012 26.7127% Total Classified Instances : 7532 ======================================================= Confusion Matrix ------------------------------------------------------- a b c d e f g h i j k l m n o p q r s t <--Classified as 37 23 0 4 38 82 17 32 1 2 5 0 0 3 10 11 18 3 4 95 | 385 a = comp.sys.mac.hardware 2 290 0 10 28 21 7 4 2 5 2 1 0 2 4 1 5 1 3 6 | 394 b = comp.os.ms-windows.misc 0 1 285 1 2 0 3 1 3 7 6 16 3 16 1 2 3 6 2 6 | 364 c = talk.politics.guns 1 30 0 287 39 3 5 12 2 2 3 0 0 0 1 0 2 2 1 5 | 395 d = comp.windows.x 4 18 1 14 292 7 9 8 0 2 7 0 0 0 1 2 3 4 5 12 | 389 e = comp.graphics 1 52 0 4 18 229 2 14 0 1 1 1 0 1 4 2 1 1 0 60 | 392 f = comp.sys.ibm.pc.hardware 0 0 1 2 9 0 352 6 2 0 3 0 3 2 0 0 3 0 0 11 | 394 g = sci.space 0 2 1 1 2 12 5 344 0 0 2 1 0 0 3 6 3 0 1 7 | 390 h = misc.forsale 0 3 1 1 6 1 3 3 312 25 2 21 0 2 2 0 10 1 1 4 | 398 i = soc.religion.christian 0 1 1 0 4 0 11 2 24 220 3 29 4 1 3 0 9 2 2 3 | 319 j = alt.atheism 0 2 0 1 3 0 0 5 3 1 356 0 0 1 1 1 0 0 15 8 | 397 k = rec.sport.baseball 0 1 12 0 2 2 11 2 34 41 5 122 2 3 1 1 8 0 2 2 | 251 l = talk.religion.misc 0 0 3 1 1 0 2 0 9 29 4 1 300 11 2 3 2 2 3 3 | 376 m = talk.politics.mideast 0 1 95 0 4 0 7 1 5 8 0 8 3 160 2 1 9 4 2 0 | 310 n = talk.politics.misc 0 1 0 2 1 0 0 5 2 1 2 0 0 2 365 6 6 0 0 5 | 398 o = rec.motorcycles 0 2 1 0 5 2 7 11 3 2 8 0 0 1 18 293 5 2 3 33 | 396 p = rec.autos 0 2 4 2 15 1 10 10 13 7 6 2 1 1 3 7 283 0 2 27 | 396 q = sci.med 3 3 2 1 6 1 5 2 4 3 3 2 0 0 2 1 4 339 0 15 | 396 r = sci.crypt 1 0 0 0 0 1 1 1 4 2 17 1 0 0 1 0 2 0 365 3 | 399 s = rec.sport.hockey 0 3 1 2 20 14 16 9 2 2 3 0 1 1 4 6 6 13 1 289 | 393 t = sci.electronics Avg. Log-likelihood: -1.1212616906335722 25%-ile: -1.6696006521059248 75%-ile: -0.5413681618725742 12/06/29 20:15:05 INFO driver.MahoutDriver: Program took 18648 ms (Minutes: 0.3108) + echo 2 + ./examples/bin/classify-20newsgroups.sh Please select a number to choose the corresponding task to run 1. cnaivebayes 2. naivebayes 3. sgd 4. clean -- cleans up the work area in /tmp/mahout-work-jenkins ok. You chose 2 and we'll use naivebayes creating work directory at /tmp/mahout-work-jenkins + echo 'Preparing 20newsgroups data' Preparing 20newsgroups data + rm -rf /tmp/mahout-work-jenkins/20news-all + mkdir /tmp/mahout-work-jenkins/20news-all + cp -R /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/alt.atheism /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/comp.graphics /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/comp.os.ms-windows.misc /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/comp.sys.ibm.pc.hardware /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/comp.sys.mac.hardware /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/comp.windows.x /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/misc.forsale /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/rec.autos /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/rec.motorcycles /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/rec.sport.baseball /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/rec.sport.hockey /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/sci.crypt /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/sci.electronics /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/sci.med /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/sci.space /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/soc.religion.christian /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/talk.politics.guns /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/talk.politics.mideast /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/talk.politics.misc /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/talk.religion.misc /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/alt.atheism /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/comp.graphics /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/comp.os.ms-windows.misc /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/comp.sys.ibm.pc.hardware /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/comp.sys.mac.hardware /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/comp.windows.x /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/misc.forsale /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/rec.autos /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/rec.motorcycles /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/rec.sport.baseball /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/rec.sport.hockey /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/sci.crypt /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/sci.electronics /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/sci.med /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/sci.space /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/soc.religion.christian /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/talk.politics.guns /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/talk.politics.mideast /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/talk.politics.misc /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/talk.religion.misc /tmp/mahout-work-jenkins/20news-all + echo 'Creating sequence files from 20newsgroups data' Creating sequence files from 20newsgroups data + ./bin/mahout seqdirectory -i /tmp/mahout-work-jenkins/20news-all -o /tmp/mahout-work-jenkins/20news-seq hadoop binary is not in PATH,HADOOP_HOME/bin,HADOOP_PREFIX/bin, running locally SLF4J: Class path contains multiple SLF4J bindings. SLF4J: Found binding in [jar:<https://builds.apache.org/job/Mahout-Examples-Classify-20News/ws/trunk/examples/target/mahout-examples-0.8-SNAPSHOT-job.jar!/org/slf4j/impl/StaticLoggerBinder.class]> SLF4J: Found binding in [jar:<https://builds.apache.org/job/Mahout-Examples-Classify-20News/ws/trunk/examples/target/dependency/slf4j-jcl-1.6.6.jar!/org/slf4j/impl/StaticLoggerBinder.class]> SLF4J: Found binding in [jar:<https://builds.apache.org/job/Mahout-Examples-Classify-20News/ws/trunk/examples/target/dependency/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]> SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an explanation. SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory] 12/06/29 20:15:07 INFO common.AbstractJob: Command line arguments: {--charset=[UTF-8], --chunkSize=[64], --endPhase=[2147483647], --fileFilterClass=[org.apache.mahout.text.PrefixAdditionFilter], --input=[/tmp/mahout-work-jenkins/20news-all], --keyPrefix=[], --output=[/tmp/mahout-work-jenkins/20news-seq], --startPhase=[0], --tempDir=[temp]} 12/06/29 20:15:14 INFO driver.MahoutDriver: Program took 6972 ms (Minutes: 0.1162) + echo 'Converting sequence files to vectors' Converting sequence files to vectors + ./bin/mahout seq2sparse -i /tmp/mahout-work-jenkins/20news-seq -o /tmp/mahout-work-jenkins/20news-vectors -lnorm -nv -wt tfidf hadoop binary is not in PATH,HADOOP_HOME/bin,HADOOP_PREFIX/bin, running locally SLF4J: Class path contains multiple SLF4J bindings. SLF4J: Found binding in [jar:<https://builds.apache.org/job/Mahout-Examples-Classify-20News/ws/trunk/examples/target/mahout-examples-0.8-SNAPSHOT-job.jar!/org/slf4j/impl/StaticLoggerBinder.class]> SLF4J: Found binding in [jar:<https://builds.apache.org/job/Mahout-Examples-Classify-20News/ws/trunk/examples/target/dependency/slf4j-jcl-1.6.6.jar!/org/slf4j/impl/StaticLoggerBinder.class]> SLF4J: Found binding in [jar:<https://builds.apache.org/job/Mahout-Examples-Classify-20News/ws/trunk/examples/target/dependency/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]> SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an explanation. SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory] 12/06/29 20:15:15 INFO vectorizer.SparseVectorsFromSequenceFiles: Maximum n-gram size is: 1 12/06/29 20:15:15 INFO vectorizer.SparseVectorsFromSequenceFiles: Minimum LLR value: 1.0 12/06/29 20:15:15 INFO vectorizer.SparseVectorsFromSequenceFiles: Number of reduce tasks: 1 12/06/29 20:15:15 INFO input.FileInputFormat: Total input paths to process : 1 12/06/29 20:15:16 INFO mapred.JobClient: Running job: job_local_0001 12/06/29 20:15:17 INFO mapred.JobClient: map 0% reduce 0% 12/06/29 20:15:21 INFO mapred.Task: Task:attempt_local_0001_m_000000_0 is done. And is in the process of commiting 12/06/29 20:15:21 INFO mapred.LocalJobRunner: 12/06/29 20:15:21 INFO mapred.Task: Task attempt_local_0001_m_000000_0 is allowed to commit now 12/06/29 20:15:21 INFO output.FileOutputCommitter: Saved output of task 'attempt_local_0001_m_000000_0' to /tmp/mahout-work-jenkins/20news-vectors/tokenized-documents 12/06/29 20:15:22 INFO mapred.LocalJobRunner: 12/06/29 20:15:22 INFO mapred.LocalJobRunner: 12/06/29 20:15:22 INFO mapred.Task: Task 'attempt_local_0001_m_000000_0' done. 12/06/29 20:15:23 INFO mapred.JobClient: map 100% reduce 0% 12/06/29 20:15:23 INFO mapred.JobClient: Job complete: job_local_0001 12/06/29 20:15:23 INFO mapred.JobClient: Counters: 8 12/06/29 20:15:23 INFO mapred.JobClient: File Output Format Counters 12/06/29 20:15:23 INFO mapred.JobClient: Bytes Written=27717956 12/06/29 20:15:23 INFO mapred.JobClient: File Input Format Counters 12/06/29 20:15:23 INFO mapred.JobClient: Bytes Read=36979301 12/06/29 20:15:23 INFO mapred.JobClient: FileSystemCounters 12/06/29 20:15:23 INFO mapred.JobClient: FILE_BYTES_READ=67766861 12/06/29 20:15:23 INFO mapred.JobClient: FILE_BYTES_WRITTEN=58778031 12/06/29 20:15:23 INFO mapred.JobClient: Map-Reduce Framework 12/06/29 20:15:23 INFO mapred.JobClient: Map input records=18846 12/06/29 20:15:23 INFO mapred.JobClient: Spilled Records=0 12/06/29 20:15:23 INFO mapred.JobClient: SPLIT_RAW_BYTES=113 12/06/29 20:15:23 INFO mapred.JobClient: Map output records=18846 12/06/29 20:15:23 INFO input.FileInputFormat: Total input paths to process : 1 12/06/29 20:15:23 INFO mapred.JobClient: Running job: job_local_0002 12/06/29 20:15:24 WARN mapred.LocalJobRunner: job_local_0002 java.lang.ClassCastException: org.apache.hadoop.mapreduce.lib.input.FileSplit cannot be cast to org.apache.hadoop.mapred.InputSplit at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:412) at org.apache.hadoop.mapred.MapTask.run(MapTask.java:372) at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:212) 12/06/29 20:15:24 INFO mapred.JobClient: map 0% reduce 0% 12/06/29 20:15:24 INFO mapred.JobClient: Job complete: job_local_0002 12/06/29 20:15:24 INFO mapred.JobClient: Counters: 0 Exception in thread "main" java.lang.IllegalStateException: Job failed! at org.apache.mahout.vectorizer.DictionaryVectorizer.startWordCounting(DictionaryVectorizer.java:360) at org.apache.mahout.vectorizer.DictionaryVectorizer.createTermFrequencyVectors(DictionaryVectorizer.java:171) at org.apache.mahout.vectorizer.SparseVectorsFromSequenceFiles.run(SparseVectorsFromSequenceFiles.java:272) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:79) at org.apache.mahout.vectorizer.SparseVectorsFromSequenceFiles.main(SparseVectorsFromSequenceFiles.java:55) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) at java.lang.reflect.Method.invoke(Method.java:597) at org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:68) at org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:139) at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:195) Build step 'Execute shell' marked build as failure
