[ 
https://issues.apache.org/jira/browse/SOLR-1804?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12900791#action_12900791
 ] 

Robert Muir commented on SOLR-1804:
-----------------------------------

Stanislaw: I looked at the ILanguageModel interface along with your other 
comments, and had these thoughts:

For these carrot2-specific things (isCommonWord, isStopLabel), the carrot2 
integration could have TokenFilters that set Attributes. So for example a 
Carrot2CommonWordFilter would not remove tokens from the stream, it would 
simply mark a boolean attribute such as Carrot2CommonWordAttribute. Stop label 
processing could be done the same way.

So, carrot2 processing could be based upon a schema.xml declaration, but the 
configuration would use these special carrot2 filters to mark attributes that 
carrot2 needs.


> Upgrade Carrot2 to 3.2.0
> ------------------------
>
>                 Key: SOLR-1804
>                 URL: https://issues.apache.org/jira/browse/SOLR-1804
>             Project: Solr
>          Issue Type: Improvement
>          Components: contrib - Clustering
>            Reporter: Grant Ingersoll
>            Assignee: Grant Ingersoll
>         Attachments: SOLR-1804-carrot2-3.4.0-dev-trunk.patch, 
> SOLR-1804-carrot2-3.4.0-dev.patch, SOLR-1804-carrot2-3.4.0-libs.zip, 
> SOLR-1804.patch
>
>
> http://project.carrot2.org/release-3.2.0-notes.html
> Carrot2 is now LGPL free, which means we should be able to bundle the binary!

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to