[
https://issues.apache.org/jira/browse/MAHOUT-1876?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15411149#comment-15411149
]
ASF GitHub Bot commented on MAHOUT-1876:
----------------------------------------
GitHub user bond- opened a pull request:
https://github.com/apache/mahout/pull/248
MAHOUT-1876: Upgrade lucene to 5.5.2 and fix compilation failures
Looked at the Lucene migrate guides and past deprecation warnings to find
alternatives to removed features. This PR is compatible with Java 7 and above.
All tests successful:
https://gist.github.com/bond-/6f7872cd9557fce5f09cdc3d9915b996
Also tested the following examples and are successful with cdh-cluster 5.5
and Java 7:
- classify-wikipedia.sh
- Option 2
- cluster-reuters.sh
- Option 1,2
- classify-20newsgroups.sh
- Option 1
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/bond-/mahout mahout-1876-java7
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/mahout/pull/248.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #248
----
commit ebfba1c0d9f47df1d93c3494e8c932f4ef9b59dd
Author: Raviteja Lokineni <[email protected]>
Date: 2016-08-07T17:51:50Z
MAHOUT-1876: Upgrade lucene to 5.5.2 and fix compilation failures
Looked at the lucene migrate guides and past deprecation warnings to find
alternatives to removed features
commit 984f7c4101e00b3ca911a663c15492117564c906
Author: Raviteja Lokineni <[email protected]>
Date: 2016-08-07T18:21:44Z
Merge remote-tracking branch 'upstream/master' into mahout-1876-java7
----
> Mahout fails to read from lucene index of solr-6.1.0
> ----------------------------------------------------
>
> Key: MAHOUT-1876
> URL: https://issues.apache.org/jira/browse/MAHOUT-1876
> Project: Mahout
> Issue Type: Bug
> Affects Versions: 0.12.2
> Environment: Solr: 6.1.0
> JDK: 1.8.0_92
> Mahout: 0.12.2
> OS: Linux
> Reporter: Raviteja Lokineni
>
> Command: {noformat}bin/mahout lucene.vector --dir
> ~/softwares/solr-6.1.0/server/solr/nlp-core/data/index --output
> /tmp/solr-nlp-core/out.vec --field rspns_val --dictOut
> /tmp/solr-nlp-core/dictionary.txt --norm 2{noformat}
> Stacktrace:
> {noformat}
> hadoop binary is not in PATH,HADOOP_HOME/bin,HADOOP_PREFIX/bin, running
> locally
> SLF4J: Class path contains multiple SLF4J bindings.
> SLF4J: Found binding in
> [jar:file:/home/lok268/softwares/apache-mahout-distribution-0.12.2/mahout-examples-0.12.2-job.jar!/org/slf4j/impl/StaticLoggerBinder.class]
> SLF4J: Found binding in
> [jar:file:/home/lok268/softwares/apache-mahout-distribution-0.12.2/mahout-mr-0.12.2-job.jar!/org/slf4j/impl/StaticLoggerBinder.class]
> SLF4J: Found binding in
> [jar:file:/home/lok268/softwares/apache-mahout-distribution-0.12.2/lib/slf4j-log4j12-1.7.19.jar!/org/slf4j/impl/StaticLoggerBinder.class]
> SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an
> explanation.
> SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory]
> Exception in thread "main"
> org.apache.lucene.index.IndexFormatTooNewException: Format version is not
> supported (resource:
> ChecksumIndexInput(MMapIndexInput(path="/home/lok268/softwares/solr-6.1.0/server/solr/nlp-core/data/index/segments_2"))):
> 6 (needs to be between 0 and 1)
> at
> org.apache.lucene.codecs.CodecUtil.checkHeaderNoMagic(CodecUtil.java:148)
> at org.apache.lucene.index.SegmentInfos.read(SegmentInfos.java:329)
> at
> org.apache.lucene.index.StandardDirectoryReader$1.doBody(StandardDirectoryReader.java:56)
> at
> org.apache.lucene.index.SegmentInfos$FindSegmentsFile.run(SegmentInfos.java:843)
> at
> org.apache.lucene.index.StandardDirectoryReader.open(StandardDirectoryReader.java:52)
> at
> org.apache.lucene.index.DirectoryReader.open(DirectoryReader.java:66)
> at
> org.apache.mahout.utils.vectors.lucene.Driver.dumpVectors(Driver.java:89)
> at org.apache.mahout.utils.vectors.lucene.Driver.main(Driver.java:277)
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
> at
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> at java.lang.reflect.Method.invoke(Method.java:498)
> at
> org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:72)
> at org.apache.hadoop.util.ProgramDriver.run(ProgramDriver.java:145)
> at org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:153)
> at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:195)
> {noformat}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)