On Aug 1, 2012, at 11:57 AM, Mattmann, Chris A (388J) wrote:
> +1 to making OpenNLP better and eating the ASF dogfood, great response Jörn.

Also +1 on implementing the Berkeley parsing model in OpenNLP, but practically 
speaking, that's a *ton* of work and I don't think anyone is going to do that 
any time soon. Jörn, please correct me if I'm wrong.

What would Apache think about setting things up so that by default the OpenNLP 
parser was used, but making it easy to substitute the Berkeley parser if a 
downstream user wants to (and can accept the license requirements, and can add 
the dependency, etc.)?

Steve

> 
> Cheers,
> Chris
> 
> On Aug 1, 2012, at 5:00 AM, Jörn Kottmann wrote:
> 
>> On 08/01/2012 01:01 PM, Miller, Timothy wrote:
>>> There was some chatter last week about resources potentially being 
>>> downloaded via maven for license compatibility reasons.  Just wondering if 
>>> that brings about the possibility of using external libraries that are not 
>>> apache-licensed that would also be auto-downloaded under certain maven 
>>> build commands.  Specifically I was thinking of the GPL-licensed berkeley 
>>> parser which I've used to get significantly higher accuracy than the 
>>> opennlp parser we currently wrap in our constituency parser module.
>> 
>> Making scripts or maven build commands which download stuff is fine, but it 
>> might
>> turn out to be quit limiting for your users which need the freedom of the 
>> AL. That will be
>> a problem if Berkeley is the only option.
>> 
>> The HBase people for example have an optional dependency on LZO which is GPL,
>> and people there just need to install and download it themselves.
>> See here:
>> http://hbase.apache.org/book/lzo.compression.html
>> 
>> Speaking as an OpenNLP committer now, it would of course be nice to make our 
>> parser better.
>> If you want to work on that we will be happy to get some patches.
>> 
>> Jörn
> 
> 
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Senior Computer Scientist
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-266B, Mailstop: 171-246
> Email: [email protected]
> WWW:   http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Assistant Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> 

Reply via email to