[ 
https://issues.apache.org/jira/browse/LUCENE-2899?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13641968#comment-13641968
 ] 

Lance Norskog commented on LUCENE-2899:
---------------------------------------

Maciej- This is a good point. This package needs changes in a lot of places and 
it might be easier to package it the way you say. 

Zack- The "churn" in the APIs is a major problem in the Lucene code management. 
The original patch worked in the 4.x branch and trunk when it was posted. What 
Em fixed is in an area which is very very basic to Lucene. The API changed with 
no notice and no change in versions or method names. 

Everyone- It's great that this has gained some interest. Please create a new 
master patch with whatever changes are needed for the current code base.

Lucene grand masters- Please don't say "hey kids, write plugins, they're cool!" 
and then make subtle incompatible changes in APIs. 
                
> Add OpenNLP Analysis capabilities as a module
> ---------------------------------------------
>
>                 Key: LUCENE-2899
>                 URL: https://issues.apache.org/jira/browse/LUCENE-2899
>             Project: Lucene - Core
>          Issue Type: New Feature
>          Components: modules/analysis
>            Reporter: Grant Ingersoll
>            Assignee: Grant Ingersoll
>            Priority: Minor
>             Fix For: 4.3
>
>         Attachments: LUCENE-2899.patch, LUCENE-2899.patch, LUCENE-2899.patch, 
> LUCENE-2899.patch, LUCENE-2899.patch, LUCENE-2899.patch, 
> LUCENE-2899-RJN.patch, OpenNLPFilter.java, OpenNLPTokenizer.java, 
> opennlp_trunk.patch
>
>
> Now that OpenNLP is an ASF project and has a nice license, it would be nice 
> to have a submodule (under analysis) that exposed capabilities for it. Drew 
> Farris, Tom Morton and I have code that does:
> * Sentence Detection as a Tokenizer (could also be a TokenFilter, although it 
> would have to change slightly to buffer tokens)
> * NamedEntity recognition as a TokenFilter
> We are also planning a Tokenizer/TokenFilter that can put parts of speech as 
> either payloads (PartOfSpeechAttribute?) on a token or at the same position.
> I'd propose it go under:
> modules/analysis/opennlp

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to