[
https://issues.apache.org/jira/browse/SOLR-12789?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16625774#comment-16625774
]
Aaron LaBella commented on SOLR-12789:
--------------------------------------
Hi Alexandre, thank for you the additional detail and background. While I
understand the goal here, I don't agree with how the end result was achieved.
I think the real "issue" here was that the examples and documentation are
stale. Likewise, UIMA core can (and should) be upgraded to the latest 2.10.2,
and the additional unnecessary dependencies should absolutely be removed from
the dist. I'm attaching a simple patch (*SOLR-12789-4.patch*) that does just
this. I would like to propose that we re-instate the contrib/uima project and
apply my patch instead. I think this is a fair compromise since 6 Java classes
doesn't quite compromise as "dead weight", especially if those 6 classes
provide direct end-user value. While I would certain agree, UIMA has a steep
learning curve, there are folks out there that are using it, and removing it
entirely from the Solr dist is likely to do a disservice to those folks who are
in-fact doing text analytics using it.
All that being said, I think the only thing that really remains is a clean-up
of the documentation and examples. I'm happy to do that over the next couple
weeks if we agree to this strategy.
Thanks so much.
> UIMA enhancements to allow for dynamic AE detection
> ---------------------------------------------------
>
> Key: SOLR-12789
> URL: https://issues.apache.org/jira/browse/SOLR-12789
> Project: Solr
> Issue Type: Improvement
> Security Level: Public(Default Security Level. Issues are Public)
> Components: contrib - UIMA
> Affects Versions: 6.0
> Reporter: Aaron LaBella
> Priority: Major
> Labels: ready-to-commit
> Attachments: SOLR-12789-1.patch, SOLR-12789-2.patch,
> SOLR-12789-3.patch, SOLR-12789-4.patch
>
>
> I've been sitting on this patch for over 2 years (and likewise it's been
> running IN production for the same) ... finally got around to contributing it
> back to the community. This change prepares the UIMAUpdateRequestProcessor
> to allow subclasses to have additional control over how the analysis engine
> is selected. In my case, I wrote a sub-class that allows for *dynamic*
> detection of the UIMA analysis engine based on the document fields. ie: a
> field in the document can be used to select different UIMA configurations and
> rules.
>
> Can someone please commit this as soon as possible. I don't necessarily need
> it to be back-ported, having in 7.4.1 would suffice.
> Thanks!
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]