See <https://builds.apache.org/job/Mahout-Examples-Classify-20News/64/changes>

Changes:

[ssc] RegressionResultAnalyzer must use US locale

------------------------------------------
[...truncated 6207 lines...]
888     441
889     440
890     440
891     440
892     440
893     440
894     440
895     440
896     439
897     439
898     439
899     438
900     437
901     437
902     437
903     434
904     434
905     434
906     434
907     434
908     433
909     433
910     433
911     432
912     431
913     431
914     430
915     430
916     430
917     428
918     428
919     427
920     426
921     426
922     425
923     425
924     425
925     424
926     424
927     424
928     423
929     423
930     423
931     422
932     422
933     422
934     422
935     421
936     420
937     420
938     419
939     419
940     419
941     419
942     419
943     419
944     418
945     417
946     416
947     415
948     415
949     415
950     414
951     413
952     412
953     411
954     410
955     410
956     409
957     408
958     408
959     408
960     407
961     407
962     407
963     407
964     407
965     406
966     406
967     405
968     405
969     404
970     403
971     402
972     402
973     402
974     401
975     400
976     400
977     399
978     398
979     398
980     397
981     396
982     396
983     396
984     396
985     395
986     394
987     394
988     394
989     393
990     393
991     393
992     392
993     392
994     392
995     391
996     391
997     391
998     391
999     390
1000    390
12/06/29 20:14:45 INFO driver.MahoutDriver: Program took 454126 ms (Minutes: 
7.568783333333333)
Testing on /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/ with 
model: /tmp/news-group.model
hadoop binary is not in PATH,HADOOP_HOME/bin,HADOOP_PREFIX/bin, running locally
SLF4J: Class path contains multiple SLF4J bindings.
SLF4J: Found binding in 
[jar:<https://builds.apache.org/job/Mahout-Examples-Classify-20News/ws/trunk/examples/target/mahout-examples-0.8-SNAPSHOT-job.jar!/org/slf4j/impl/StaticLoggerBinder.class]>
SLF4J: Found binding in 
[jar:<https://builds.apache.org/job/Mahout-Examples-Classify-20News/ws/trunk/examples/target/dependency/slf4j-jcl-1.6.6.jar!/org/slf4j/impl/StaticLoggerBinder.class]>
SLF4J: Found binding in 
[jar:<https://builds.apache.org/job/Mahout-Examples-Classify-20News/ws/trunk/examples/target/dependency/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]>
SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an explanation.
SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory]
12/06/29 20:14:46 WARN driver.MahoutDriver: No 
org.apache.mahout.classifier.sgd.TestNewsGroups.props found on classpath, will 
use command-line arguments only
7532 test files
=======================================================
Summary
-------------------------------------------------------
Correctly Classified Instances          :       5520       73.2873%
Incorrectly Classified Instances        :       2012       26.7127%
Total Classified Instances              :       7532

=======================================================
Confusion Matrix
-------------------------------------------------------
a       b       c       d       e       f       g       h       i       j       
k       l       m       n       o       p       q       r       s       t       
<--Classified as
37      23      0       4       38      82      17      32      1       2       
5       0       0       3       10      11      18      3       4       95      
 |  385         a     = comp.sys.mac.hardware
2       290     0       10      28      21      7       4       2       5       
2       1       0       2       4       1       5       1       3       6       
 |  394         b     = comp.os.ms-windows.misc
0       1       285     1       2       0       3       1       3       7       
6       16      3       16      1       2       3       6       2       6       
 |  364         c     = talk.politics.guns
1       30      0       287     39      3       5       12      2       2       
3       0       0       0       1       0       2       2       1       5       
 |  395         d     = comp.windows.x
4       18      1       14      292     7       9       8       0       2       
7       0       0       0       1       2       3       4       5       12      
 |  389         e     = comp.graphics
1       52      0       4       18      229     2       14      0       1       
1       1       0       1       4       2       1       1       0       60      
 |  392         f     = comp.sys.ibm.pc.hardware
0       0       1       2       9       0       352     6       2       0       
3       0       3       2       0       0       3       0       0       11      
 |  394         g     = sci.space
0       2       1       1       2       12      5       344     0       0       
2       1       0       0       3       6       3       0       1       7       
 |  390         h     = misc.forsale
0       3       1       1       6       1       3       3       312     25      
2       21      0       2       2       0       10      1       1       4       
 |  398         i     = soc.religion.christian
0       1       1       0       4       0       11      2       24      220     
3       29      4       1       3       0       9       2       2       3       
 |  319         j     = alt.atheism
0       2       0       1       3       0       0       5       3       1       
356     0       0       1       1       1       0       0       15      8       
 |  397         k     = rec.sport.baseball
0       1       12      0       2       2       11      2       34      41      
5       122     2       3       1       1       8       0       2       2       
 |  251         l     = talk.religion.misc
0       0       3       1       1       0       2       0       9       29      
4       1       300     11      2       3       2       2       3       3       
 |  376         m     = talk.politics.mideast
0       1       95      0       4       0       7       1       5       8       
0       8       3       160     2       1       9       4       2       0       
 |  310         n     = talk.politics.misc
0       1       0       2       1       0       0       5       2       1       
2       0       0       2       365     6       6       0       0       5       
 |  398         o     = rec.motorcycles
0       2       1       0       5       2       7       11      3       2       
8       0       0       1       18      293     5       2       3       33      
 |  396         p     = rec.autos
0       2       4       2       15      1       10      10      13      7       
6       2       1       1       3       7       283     0       2       27      
 |  396         q     = sci.med
3       3       2       1       6       1       5       2       4       3       
3       2       0       0       2       1       4       339     0       15      
 |  396         r     = sci.crypt
1       0       0       0       0       1       1       1       4       2       
17      1       0       0       1       0       2       0       365     3       
 |  399         s     = rec.sport.hockey
0       3       1       2       20      14      16      9       2       2       
3       0       1       1       4       6       6       13      1       289     
 |  393         t     = sci.electronics



Avg. Log-likelihood: -1.1212616906335722 25%-ile: -1.6696006521059248 75%-ile: 
-0.5413681618725742
12/06/29 20:15:05 INFO driver.MahoutDriver: Program took 18648 ms (Minutes: 
0.3108)
+ echo 2
+ ./examples/bin/classify-20newsgroups.sh
Please select a number to choose the corresponding task to run
1. cnaivebayes
2. naivebayes
3. sgd
4. clean -- cleans up the work area in /tmp/mahout-work-jenkins
ok. You chose 2 and we'll use naivebayes
creating work directory at /tmp/mahout-work-jenkins
+ echo 'Preparing 20newsgroups data'
Preparing 20newsgroups data
+ rm -rf /tmp/mahout-work-jenkins/20news-all
+ mkdir /tmp/mahout-work-jenkins/20news-all
+ cp -R /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/alt.atheism 
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/comp.graphics 
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/comp.os.ms-windows.misc
 
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/comp.sys.ibm.pc.hardware
 
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/comp.sys.mac.hardware 
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/comp.windows.x 
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/misc.forsale 
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/rec.autos 
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/rec.motorcycles 
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/rec.sport.baseball 
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/rec.sport.hockey 
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/sci.crypt 
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/sci.electronics 
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/sci.med 
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/sci.space 
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/soc.religion.christian
 /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/talk.politics.guns 
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/talk.politics.mideast 
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/talk.politics.misc 
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/talk.religion.misc 
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/alt.atheism 
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/comp.graphics 
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/comp.os.ms-windows.misc
 
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/comp.sys.ibm.pc.hardware
 
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/comp.sys.mac.hardware
 /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/comp.windows.x 
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/misc.forsale 
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/rec.autos 
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/rec.motorcycles 
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/rec.sport.baseball 
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/rec.sport.hockey 
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/sci.crypt 
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/sci.electronics 
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/sci.med 
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/sci.space 
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/soc.religion.christian
 /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/talk.politics.guns 
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/talk.politics.mideast
 /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/talk.politics.misc 
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/talk.religion.misc 
/tmp/mahout-work-jenkins/20news-all
+ echo 'Creating sequence files from 20newsgroups data'
Creating sequence files from 20newsgroups data
+ ./bin/mahout seqdirectory -i /tmp/mahout-work-jenkins/20news-all -o 
/tmp/mahout-work-jenkins/20news-seq
hadoop binary is not in PATH,HADOOP_HOME/bin,HADOOP_PREFIX/bin, running locally
SLF4J: Class path contains multiple SLF4J bindings.
SLF4J: Found binding in 
[jar:<https://builds.apache.org/job/Mahout-Examples-Classify-20News/ws/trunk/examples/target/mahout-examples-0.8-SNAPSHOT-job.jar!/org/slf4j/impl/StaticLoggerBinder.class]>
SLF4J: Found binding in 
[jar:<https://builds.apache.org/job/Mahout-Examples-Classify-20News/ws/trunk/examples/target/dependency/slf4j-jcl-1.6.6.jar!/org/slf4j/impl/StaticLoggerBinder.class]>
SLF4J: Found binding in 
[jar:<https://builds.apache.org/job/Mahout-Examples-Classify-20News/ws/trunk/examples/target/dependency/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]>
SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an explanation.
SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory]
12/06/29 20:15:07 INFO common.AbstractJob: Command line arguments: 
{--charset=[UTF-8], --chunkSize=[64], --endPhase=[2147483647], 
--fileFilterClass=[org.apache.mahout.text.PrefixAdditionFilter], 
--input=[/tmp/mahout-work-jenkins/20news-all], --keyPrefix=[], 
--output=[/tmp/mahout-work-jenkins/20news-seq], --startPhase=[0], 
--tempDir=[temp]}
12/06/29 20:15:14 INFO driver.MahoutDriver: Program took 6972 ms (Minutes: 
0.1162)
+ echo 'Converting sequence files to vectors'
Converting sequence files to vectors
+ ./bin/mahout seq2sparse -i /tmp/mahout-work-jenkins/20news-seq -o 
/tmp/mahout-work-jenkins/20news-vectors -lnorm -nv -wt tfidf
hadoop binary is not in PATH,HADOOP_HOME/bin,HADOOP_PREFIX/bin, running locally
SLF4J: Class path contains multiple SLF4J bindings.
SLF4J: Found binding in 
[jar:<https://builds.apache.org/job/Mahout-Examples-Classify-20News/ws/trunk/examples/target/mahout-examples-0.8-SNAPSHOT-job.jar!/org/slf4j/impl/StaticLoggerBinder.class]>
SLF4J: Found binding in 
[jar:<https://builds.apache.org/job/Mahout-Examples-Classify-20News/ws/trunk/examples/target/dependency/slf4j-jcl-1.6.6.jar!/org/slf4j/impl/StaticLoggerBinder.class]>
SLF4J: Found binding in 
[jar:<https://builds.apache.org/job/Mahout-Examples-Classify-20News/ws/trunk/examples/target/dependency/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]>
SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an explanation.
SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory]
12/06/29 20:15:15 INFO vectorizer.SparseVectorsFromSequenceFiles: Maximum 
n-gram size is: 1
12/06/29 20:15:15 INFO vectorizer.SparseVectorsFromSequenceFiles: Minimum LLR 
value: 1.0
12/06/29 20:15:15 INFO vectorizer.SparseVectorsFromSequenceFiles: Number of 
reduce tasks: 1
12/06/29 20:15:15 INFO input.FileInputFormat: Total input paths to process : 1
12/06/29 20:15:16 INFO mapred.JobClient: Running job: job_local_0001
12/06/29 20:15:17 INFO mapred.JobClient:  map 0% reduce 0%
12/06/29 20:15:21 INFO mapred.Task: Task:attempt_local_0001_m_000000_0 is done. 
And is in the process of commiting
12/06/29 20:15:21 INFO mapred.LocalJobRunner: 
12/06/29 20:15:21 INFO mapred.Task: Task attempt_local_0001_m_000000_0 is 
allowed to commit now
12/06/29 20:15:21 INFO output.FileOutputCommitter: Saved output of task 
'attempt_local_0001_m_000000_0' to 
/tmp/mahout-work-jenkins/20news-vectors/tokenized-documents
12/06/29 20:15:22 INFO mapred.LocalJobRunner: 
12/06/29 20:15:22 INFO mapred.LocalJobRunner: 
12/06/29 20:15:22 INFO mapred.Task: Task 'attempt_local_0001_m_000000_0' done.
12/06/29 20:15:23 INFO mapred.JobClient:  map 100% reduce 0%
12/06/29 20:15:23 INFO mapred.JobClient: Job complete: job_local_0001
12/06/29 20:15:23 INFO mapred.JobClient: Counters: 8
12/06/29 20:15:23 INFO mapred.JobClient:   File Output Format Counters 
12/06/29 20:15:23 INFO mapred.JobClient:     Bytes Written=27717956
12/06/29 20:15:23 INFO mapred.JobClient:   File Input Format Counters 
12/06/29 20:15:23 INFO mapred.JobClient:     Bytes Read=36979301
12/06/29 20:15:23 INFO mapred.JobClient:   FileSystemCounters
12/06/29 20:15:23 INFO mapred.JobClient:     FILE_BYTES_READ=67766861
12/06/29 20:15:23 INFO mapred.JobClient:     FILE_BYTES_WRITTEN=58778031
12/06/29 20:15:23 INFO mapred.JobClient:   Map-Reduce Framework
12/06/29 20:15:23 INFO mapred.JobClient:     Map input records=18846
12/06/29 20:15:23 INFO mapred.JobClient:     Spilled Records=0
12/06/29 20:15:23 INFO mapred.JobClient:     SPLIT_RAW_BYTES=113
12/06/29 20:15:23 INFO mapred.JobClient:     Map output records=18846
12/06/29 20:15:23 INFO input.FileInputFormat: Total input paths to process : 1
12/06/29 20:15:23 INFO mapred.JobClient: Running job: job_local_0002
12/06/29 20:15:24 WARN mapred.LocalJobRunner: job_local_0002
java.lang.ClassCastException: org.apache.hadoop.mapreduce.lib.input.FileSplit 
cannot be cast to org.apache.hadoop.mapred.InputSplit
        at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:412)
        at org.apache.hadoop.mapred.MapTask.run(MapTask.java:372)
        at 
org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:212)
12/06/29 20:15:24 INFO mapred.JobClient:  map 0% reduce 0%
12/06/29 20:15:24 INFO mapred.JobClient: Job complete: job_local_0002
12/06/29 20:15:24 INFO mapred.JobClient: Counters: 0
Exception in thread "main" java.lang.IllegalStateException: Job failed!
        at 
org.apache.mahout.vectorizer.DictionaryVectorizer.startWordCounting(DictionaryVectorizer.java:360)
        at 
org.apache.mahout.vectorizer.DictionaryVectorizer.createTermFrequencyVectors(DictionaryVectorizer.java:171)
        at 
org.apache.mahout.vectorizer.SparseVectorsFromSequenceFiles.run(SparseVectorsFromSequenceFiles.java:272)
        at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
        at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:79)
        at 
org.apache.mahout.vectorizer.SparseVectorsFromSequenceFiles.main(SparseVectorsFromSequenceFiles.java:55)
        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
        at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
        at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
        at java.lang.reflect.Method.invoke(Method.java:597)
        at 
org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:68)
        at org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:139)
        at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:195)
Build step 'Execute shell' marked build as failure

Reply via email to