[ 
https://issues.apache.org/jira/browse/MAHOUT-459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12896151#action_12896151
 ] 

Stephen McGill commented on MAHOUT-459:
---------------------------------------

One *important* fix that is needed right now is the ability to grab all 
documents from Lucene.  On Line 171 of the diff, it reads:

+               String a = new String("press");

which grabs all documents with the word "press" - not the intended goal.  I 
have some commented code that might fix this, but I am unable to try it today.

Also, I do not think this is included in this posted diff, but the 
DefaultAnalyzer class should be deleted.

> Reading an Index from Lucene/Solr 4.0-dev
> -----------------------------------------
>
>                 Key: MAHOUT-459
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-459
>             Project: Mahout
>          Issue Type: Improvement
>          Components: Utils
>    Affects Versions: 0.4
>         Environment: Windows Server 2008 R2 Standard, Cygwin, Solr-trunk, 
> Mahout-trunk
>            Reporter: Stephen McGill
>            Priority: Minor
>             Fix For: 0.4
>
>         Attachments: Mahout-Importing-Vectors-Lucene-Solr-4-dev.diff
>
>
> It is not possible to read indexes created by Lucene/Solr 4.0-dev (the trunk 
> development) with the Lucene libraries that are included with Mahout-dev.  
> When adding the new Lucene/Solr 4.0-dev, there are API changes that do not 
> allow Mahout to compile.
> By adapting mahout-utils to fit Lucene/Solr 4.0-dev's API changes, it is 
> possible to read its index.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to