Patch submitted. Let me know if I have to be more precise. Regards,
/Nicolas On Fri, Jun 17, 2011 at 8:38 AM, Tommaso Teofili <[email protected]> wrote: > Hello Nicolas, > did you already prepare the IP clearance template? > We usually maintain those files under our SVN website, see [1] as an example > of a former IP Clearance (see also the related thread [2]). > If the IP is ready you can add a patch to the website in the same Jira issue > [3] so that we can review it. > Regards, > Tommaso > [1] > : http://svn.apache.org/repos/asf/uima/site/trunk/uima-website/xdocs/ip-clearances/ > [2] : http://markmail.org/message/gbtph456u3445yfn > [3] : https://issues.apache.org/jira/browse/UIMA-2146 > > 2011/6/16 Nicolas Hernandez <[email protected]> >> >> Please tell me what can I do for it >> >> On Wed, Jun 15, 2011 at 5:58 PM, Tommaso Teofili >> <[email protected]> wrote: >> > 2011/6/15 Tommaso Teofili <[email protected]> >> >> >> >> Nicolas, >> >> your post on opennlp-user@ made me realize we didn't take care of >> >> helping >> >> you here yet. >> >> Did you get the ACK for your SGA? >> > >> > I see it's been recorded, so I think we can proceed. >> > Tommaso >> > >> >> >> >> Regards, >> >> Tommaso >> >> 2011/5/26 Nicolas Hernandez <[email protected]> >> >>> >> >>> Hi >> >>> >> >>> French data models for the Apache UIMA Sandbox HMM Tagger have been >> >>> submitted via the jira issue >> >>> https://issues.apache.org/jira/browse/UIMA-2146 >> >>> >> >>> Documentation on the procedure to build the models from the French >> >>> Treebank can be found here (accidentally it is in French...) >> >>> >> >>> >> >>> http://enicolashernandez.blogspot.com/2011/05/construire-des-modelisations-du-french.html >> >>> >> >>> The SLA has been sent and we are waiting for receiving the ack. >> >>> >> >>> I have prepared an IP form but have not right to commit it... >> >>> >> >>> Finaly is there an "appropriate volunter" for executing the IP >> >>> Clearance processing? >> >>> >> >>> I hope I have nothing forgotten. >> >>> >> >>> Best regards >> >>> >> >>> /Nicolas >> >>> >> >>> On Thu, May 19, 2011 at 3:47 PM, Thilo Götz <[email protected]> wrote: >> >>> > On 5/19/2011 15:04, Nicolas Hernandez wrote: >> >>> >> Hello Everyone >> >>> >> >> >>> >> Jörn, yes it (training MaxEnt models for OpenNLP from the French >> >>> >> Treebank) is actually part of our plan (building a French-Speaking >> >>> >> UIMA Community). We wanted also to contribute to the OpenNLP >> >>> >> project >> >>> >> since no models was available for French processing! >> >>> >> >> >>> >> About the right to train models on this data set and then >> >>> >> distribute >> >>> >> them under Apache License 2: It took time for us to get the right >> >>> >> to >> >>> >> do it, but I think it was because we were the first to ask for. Now >> >>> >> they know about it. I know that the maltparser team >> >>> >> (http://maltparser.org/) would be also interested by the grant. You >> >>> >> may ask for the French Treebank authors. I can also ask them for >> >>> >> letting an explicit mention about the right to do it on their web >> >>> >> site. >> >>> >> >> >>> >> As far as I know, the data training set for the English and German >> >>> >> POS >> >>> >> models are not freely available, are they ? >> >>> > >> >>> > The English model was trained on the Brown corpus, which is free. >> >>> > The German model was trained on a non-free corpus. >> >>> > >> >>> >> >> >>> >> Eventually, Jörn, I m not sure to understand. Do you think the IP >> >>> >> clearance process is not adapted for submitting our contribution ? >> >>> >> >> >>> >> Tommaso, I will blog post the procedure I used to train the models. >> >>> >> There is nothing really special. I used some freely available >> >>> >> (under >> >>> >> AL2) AE components. The HMM learner is already present in the HMM >> >>> >> Tagger addon. The few other UIMA components I used are also >> >>> >> available >> >>> >> on some google forges (uima-common, uima-connectors, >> >>> >> uima-type-mapper). >> >>> >> >> >>> >> Regards >> >>> >> >> >>> >> /Nicolas >> >>> >> >> >>> >> On Thu, May 19, 2011 at 9:57 AM, Jörn Kottmann <[email protected]> >> >>> >> wrote: >> >>> >>> On 5/19/11 9:00 AM, Tommaso Teofili wrote: >> >>> >>>> >> >>> >>>> If you also plan to donate the models I think the IP clearance is >> >>> >>>> the >> >>> >>>> right >> >>> >>>> way both for UIMA and for you as a researcher. >> >>> >>>> >> >>> >>> >> >>> >>> In my opinion it is very important that we have the possibility >> >>> >>> to retrain the models on the data set, otherwise it will block >> >>> >>> code changes and bug fixes. >> >>> >>> >> >>> >>> Therefore I think we need the right to train models on this >> >>> >>> data set and then distribute them under AL 2.0. >> >>> >>> >> >>> >>> Jörn >> >>> >>> >> >>> >> >> >>> >> >> >>> >> >> >>> > >> >>> >> >>> >> >>> >> >>> -- >> >>> [email protected] >> >>> # >> >>> http://enicolashernandez.blogspot.com >> >>> http://www.univ-nantes.fr/hernandez-n >> >>> # >> >>> Laboratoire LINA-TALN CNRS UMR 6241 >> >>> tel. +33 (0)2 51 12 58 55 >> >>> # >> >>> Université de Nantes - Institut Universitaire de Technologie - >> >>> Département Informatique >> >>> tel. +33 (0)2 40 30 60 67 >> >> >> > >> > >> >> >> >> -- >> [email protected] >> # >> http://enicolashernandez.blogspot.com >> http://www.univ-nantes.fr/hernandez-n >> # >> Laboratoire LINA-TALN CNRS UMR 6241 >> tel. +33 (0)2 51 12 58 55 >> # >> Université de Nantes - Institut Universitaire de Technologie - >> Département Informatique >> tel. +33 (0)2 40 30 60 67 > > -- [email protected] # http://enicolashernandez.blogspot.com http://www.univ-nantes.fr/hernandez-n # Laboratoire LINA-TALN CNRS UMR 6241 tel. +33 (0)2 51 12 58 55 # Université de Nantes - Institut Universitaire de Technologie - Département Informatique tel. +33 (0)2 40 30 60 67
