Hello Nicolas, did you already prepare the IP clearance template? We usually maintain those files under our SVN website, see [1] as an example of a former IP Clearance (see also the related thread [2]). If the IP is ready you can add a patch to the website in the same Jira issue [3] so that we can review it. Regards, Tommaso
[1] : http://svn.apache.org/repos/asf/uima/site/trunk/uima-website/xdocs/ip-clearances/ [2] : http://markmail.org/message/gbtph456u3445yfn [3] : https://issues.apache.org/jira/browse/UIMA-2146 2011/6/16 Nicolas Hernandez <[email protected]> > Please tell me what can I do for it > > On Wed, Jun 15, 2011 at 5:58 PM, Tommaso Teofili > <[email protected]> wrote: > > 2011/6/15 Tommaso Teofili <[email protected]> > >> > >> Nicolas, > >> your post on opennlp-user@ made me realize we didn't take care of > helping > >> you here yet. > >> Did you get the ACK for your SGA? > > > > I see it's been recorded, so I think we can proceed. > > Tommaso > > > >> > >> Regards, > >> Tommaso > >> 2011/5/26 Nicolas Hernandez <[email protected]> > >>> > >>> Hi > >>> > >>> French data models for the Apache UIMA Sandbox HMM Tagger have been > >>> submitted via the jira issue > >>> https://issues.apache.org/jira/browse/UIMA-2146 > >>> > >>> Documentation on the procedure to build the models from the French > >>> Treebank can be found here (accidentally it is in French...) > >>> > >>> > http://enicolashernandez.blogspot.com/2011/05/construire-des-modelisations-du-french.html > >>> > >>> The SLA has been sent and we are waiting for receiving the ack. > >>> > >>> I have prepared an IP form but have not right to commit it... > >>> > >>> Finaly is there an "appropriate volunter" for executing the IP > >>> Clearance processing? > >>> > >>> I hope I have nothing forgotten. > >>> > >>> Best regards > >>> > >>> /Nicolas > >>> > >>> On Thu, May 19, 2011 at 3:47 PM, Thilo Götz <[email protected]> wrote: > >>> > On 5/19/2011 15:04, Nicolas Hernandez wrote: > >>> >> Hello Everyone > >>> >> > >>> >> Jörn, yes it (training MaxEnt models for OpenNLP from the French > >>> >> Treebank) is actually part of our plan (building a French-Speaking > >>> >> UIMA Community). We wanted also to contribute to the OpenNLP project > >>> >> since no models was available for French processing! > >>> >> > >>> >> About the right to train models on this data set and then > distribute > >>> >> them under Apache License 2: It took time for us to get the right to > >>> >> do it, but I think it was because we were the first to ask for. Now > >>> >> they know about it. I know that the maltparser team > >>> >> (http://maltparser.org/) would be also interested by the grant. You > >>> >> may ask for the French Treebank authors. I can also ask them for > >>> >> letting an explicit mention about the right to do it on their web > >>> >> site. > >>> >> > >>> >> As far as I know, the data training set for the English and German > POS > >>> >> models are not freely available, are they ? > >>> > > >>> > The English model was trained on the Brown corpus, which is free. > >>> > The German model was trained on a non-free corpus. > >>> > > >>> >> > >>> >> Eventually, Jörn, I m not sure to understand. Do you think the IP > >>> >> clearance process is not adapted for submitting our contribution ? > >>> >> > >>> >> Tommaso, I will blog post the procedure I used to train the models. > >>> >> There is nothing really special. I used some freely available (under > >>> >> AL2) AE components. The HMM learner is already present in the HMM > >>> >> Tagger addon. The few other UIMA components I used are also > available > >>> >> on some google forges (uima-common, uima-connectors, > >>> >> uima-type-mapper). > >>> >> > >>> >> Regards > >>> >> > >>> >> /Nicolas > >>> >> > >>> >> On Thu, May 19, 2011 at 9:57 AM, Jörn Kottmann <[email protected]> > >>> >> wrote: > >>> >>> On 5/19/11 9:00 AM, Tommaso Teofili wrote: > >>> >>>> > >>> >>>> If you also plan to donate the models I think the IP clearance is > >>> >>>> the > >>> >>>> right > >>> >>>> way both for UIMA and for you as a researcher. > >>> >>>> > >>> >>> > >>> >>> In my opinion it is very important that we have the possibility > >>> >>> to retrain the models on the data set, otherwise it will block > >>> >>> code changes and bug fixes. > >>> >>> > >>> >>> Therefore I think we need the right to train models on this > >>> >>> data set and then distribute them under AL 2.0. > >>> >>> > >>> >>> Jörn > >>> >>> > >>> >> > >>> >> > >>> >> > >>> > > >>> > >>> > >>> > >>> -- > >>> [email protected] > >>> # > >>> http://enicolashernandez.blogspot.com > >>> http://www.univ-nantes.fr/hernandez-n > >>> # > >>> Laboratoire LINA-TALN CNRS UMR 6241 > >>> tel. +33 (0)2 51 12 58 55 > >>> # > >>> Université de Nantes - Institut Universitaire de Technologie - > >>> Département Informatique > >>> tel. +33 (0)2 40 30 60 67 > >> > > > > > > > > -- > [email protected] > # > http://enicolashernandez.blogspot.com > http://www.univ-nantes.fr/hernandez-n > # > Laboratoire LINA-TALN CNRS UMR 6241 > tel. +33 (0)2 51 12 58 55 > # > Université de Nantes - Institut Universitaire de Technologie - > Département Informatique > tel. +33 (0)2 40 30 60 67 >
