That sounds good. I'm currently arranging with distributors of various corpora to allow models trained on their materials to be used with OpenNLP under the ASL license, but it would be great to have annotated texts that are fully open, both for expansion/modification and for use as training material.
On Sun, Dec 5, 2010 at 5:11 AM, Tommaso Teofili <[email protected]>wrote: > Hi Thilo, > > 2010/12/5 Thilo Goetz <[email protected]> > > > I've been wondering if there is something we could do > > to enhance collaboration between all the text related > > projects at Apache. For example, I used to be subscribed > > to both the Solr and the Mahout lists. Eventually though, > > I just couldn't cope with the traffic. Maybe some sort > > of [email protected] mailing list? At some point I thought > > the Lucene umbrella could be such a focal point, but the > > ASF is moving away from umbrellas like that. A low > > overhead mailing list could be the ticket, it's not a > > lot of effort, and if it doesn't take off, we just shelve > > it. WDYT? > > > > > I think that makes a lot of sense and I also wonder what can be the right > and useful way to build such a community. > As you say umbrella projects are not the way ASF is going to, maybe > text-interest ML can be a good starting point. > Surely I am +1 for that. > Cheers, > Tommaso > > [1] : http://incubator.markmail.org/thread/qpzfmztgahwwa6fr > -- Jason Baldridge Assistant Professor, Department of Linguistics The University of Texas at Austin http://comp.ling.utexas.edu/people/jason_baldridge
