[
https://issues.apache.org/jira/browse/SOLR-1804?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12900791#action_12900791
]
Robert Muir commented on SOLR-1804:
-----------------------------------
Stanislaw: I looked at the ILanguageModel interface along with your other
comments, and had these thoughts:
For these carrot2-specific things (isCommonWord, isStopLabel), the carrot2
integration could have TokenFilters that set Attributes. So for example a
Carrot2CommonWordFilter would not remove tokens from the stream, it would
simply mark a boolean attribute such as Carrot2CommonWordAttribute. Stop label
processing could be done the same way.
So, carrot2 processing could be based upon a schema.xml declaration, but the
configuration would use these special carrot2 filters to mark attributes that
carrot2 needs.
> Upgrade Carrot2 to 3.2.0
> ------------------------
>
> Key: SOLR-1804
> URL: https://issues.apache.org/jira/browse/SOLR-1804
> Project: Solr
> Issue Type: Improvement
> Components: contrib - Clustering
> Reporter: Grant Ingersoll
> Assignee: Grant Ingersoll
> Attachments: SOLR-1804-carrot2-3.4.0-dev-trunk.patch,
> SOLR-1804-carrot2-3.4.0-dev.patch, SOLR-1804-carrot2-3.4.0-libs.zip,
> SOLR-1804.patch
>
>
> http://project.carrot2.org/release-3.2.0-notes.html
> Carrot2 is now LGPL free, which means we should be able to bundle the binary!
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]