Moving those to ctakes-resources on Sourceforge sounds like the way to go to me. I was hoping to take a stab at it tomorrow but that is looking unlikely.
I am hoping that, to keep install process for our end users relatively simple, we can still have a convenience binary with the resources (jars, models, dictionaries) except the UMLS ones (which need to be separate for licensing) Otherwise I will be greatly concerned about the step back we are taking from an end user (non-programmer) install perspective. - James > -----Original Message----- > From: ctakes-dev-return-1080-Masanz.James=mayo....@incubator.apache.org > [mailto:ctakes-dev-return-1080-Masanz.James=mayo....@incubator.apache.org] > On Behalf Of Chen, Pei > Sent: Tuesday, January 22, 2013 8:03 AM > To: <[email protected]> > Cc: [email protected] > Subject: Re: [DISCUSS] What should we do with cTAKES resources? > > James, > I was under the pretense that we could include the models, but it sounds > like it is not the case. We can move every single bin/model to ctakes- > resources in Source forge and do a MVN deploy to push it to maven central; > like what we did for umls/lvg. I can take a stab at it later this week if > no one gets to it (and if there's an agreement). > > > Sent from my iPhone > > On Jan 22, 2013, at 5:35 AM, "Jörn Kottmann" <[email protected]> wrote: > > > On 01/22/2013 04:00 AM, Masanz, James J. wrote: > >> Jörn, > >> > >> Today Benson wrote the following in this post to incubator > >> http://s.apache.org/Gz5 "I fear that cTakes needs to have an > interaction with LEGAL to adopt the SpamAssassin model, since, from a > strict constructionist perspective, the source of the models is precisely > what you cannot release." > >> > >> Is he just unaware of some discussion you already had with LEGAL for > >> OpenNLP - I ask because in the discussion below you indicated it > >> would be OK to release models at Apache without releasing the data > >> the models were built from. Is there some previous post we can point > >> to or should I open a discussion with LEGAL about cTAKES models > > > > I was under the assumption that it is ok the just release the model > > and not the training data under AL 2.0 here at Apache, over at UIMA we > > had a similar discussion for French POS Tagger (UIMA-2146). There the > concern was that its very cumbersome to train again on the data, but not > that it can't be released. > > > > To circumvent this particular issue it should be possible to release > > the models outside of Apache and then just redistribute them as class A > dependency in the cTAKES binary distribution. > > > > Jörn
