Moving those to ctakes-resources on Sourceforge sounds like the way to go to 
me. 
I was hoping to take a stab at it tomorrow but that is looking unlikely.

I am hoping that, to keep install process for our end users relatively simple, 
we can still have a convenience binary with the resources (jars, models, 
dictionaries) except the UMLS ones (which need to be separate for licensing)
Otherwise I will be greatly concerned about the step back we are taking from an 
end user (non-programmer) install perspective.

- James

> -----Original Message-----
> From: ctakes-dev-return-1080-Masanz.James=mayo....@incubator.apache.org
> [mailto:ctakes-dev-return-1080-Masanz.James=mayo....@incubator.apache.org]
> On Behalf Of Chen, Pei
> Sent: Tuesday, January 22, 2013 8:03 AM
> To: <[email protected]>
> Cc: [email protected]
> Subject: Re: [DISCUSS] What should we do with cTAKES resources?
> 
> James,
> I was under the pretense that we could include the models, but it sounds
> like it is not the case. We can move every single bin/model to ctakes-
> resources in Source forge and do a MVN deploy to push it to maven central;
> like what we did for umls/lvg. I can take a stab at it later this week if
> no one gets to it (and if there's an agreement).
> 
> 
> Sent from my iPhone
> 
> On Jan 22, 2013, at 5:35 AM, "Jörn Kottmann" <[email protected]> wrote:
> 
> > On 01/22/2013 04:00 AM, Masanz, James J. wrote:
> >> Jörn,
> >>
> >> Today Benson wrote the following in this post to incubator
> >> http://s.apache.org/Gz5 "I fear that cTakes needs to have an
> interaction with LEGAL to adopt the SpamAssassin model, since, from a
> strict constructionist perspective, the source of the models is precisely
> what you cannot release."
> >>
> >> Is he just unaware of some discussion you already had with LEGAL for
> >> OpenNLP - I ask because in the discussion below you indicated it
> >> would be OK to release models at Apache without releasing the data
> >> the models were built from. Is there some previous post we can point
> >> to or should I open a discussion with LEGAL about cTAKES models
> >
> > I was under the assumption that it is ok the just release the model
> > and not the training data under AL 2.0 here at Apache, over at UIMA we
> > had a similar discussion for French POS Tagger (UIMA-2146). There the
> concern was that its very cumbersome to train again on the data, but not
> that it can't be released.
> >
> > To circumvent this particular issue it should be possible to release
> > the models outside of Apache and then just redistribute them as class A
> dependency in the cTAKES binary distribution.
> >
> > Jörn

Reply via email to