[ 
https://issues.apache.org/jira/browse/CONNECTORS-1270?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15118228#comment-15118228
 ] 

Karl Wright commented on CONNECTORS-1270:
-----------------------------------------

"OpenNLP is at the end a tool for training your own Annotation models, so 
providing only URL based access to the models would prevent someone to use 
their own trained ones for a custom annotation job."

Yes, I understand this, which is why I was thinking of including models as 
resources in the actual connector jar.  Presumably users even in a distributed 
cluster environment have to deploy ManifoldCF jars in some distributed way.  Of 
course, if they're using Docker or puppet or a similar technology, they can 
reasonably distribute models into the cluster file system by packaging their 
own RPMs.

The UI for this connector is currently not one thing nor another; if we assume 
file system compatibility, then we probably should assume there's one specific 
directory out there with all the models in it (settable via properties.xml), 
and allow the user to browse and select from that; if we don't assume file 
system compatibility, then we should assume that the models live as resources 
in the jar.


> Import OpenNLP connector into trunk
> -----------------------------------
>
>                 Key: CONNECTORS-1270
>                 URL: https://issues.apache.org/jira/browse/CONNECTORS-1270
>             Project: ManifoldCF
>          Issue Type: Task
>            Reporter: Karl Wright
>            Assignee: Rafa Haro
>             Fix For: ManifoldCF 2.4
>
>
> An OpenNLP connector has been contributed on github.  Need to import it into 
> MCF, first to a branch, then to trunk.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to