Thanks Jörn.  Was it trained on the whole Penn treebank?  And do you
happen to know if this means there are licensing restrictions on the
use of the parser, that say would need to be resolved via the LDC?

jds

On Mon, Nov 7, 2011 at 4:58 PM, Jörn Kottmann <kottm...@gmail.com> wrote:
> On 11/7/11 5:20 PM, John Stewart wrote:
>>
>> I'm a new user of OpenNLP.  I'm evaluating the parser against others,
>> I enjoy the ease of using OpenNLP with Clojure so it would be great to
>> be able to settle on OpenNLP's parser.  Unfortunately I'm unable to
>> find details online of how the parser was trained on, how it works --
>> is it lexicalized?  Was it trained with the MaxEnt package?
>>
>> For this reason I'm unable to guess at its coverage.  Any technical
>> details about it would be very welcome.
>>
>> I should say that in informal tests, while it has lower exhibited
>> coverage than, say, the Stanford parser, the coverage doesn't appear
>> *that*  much lower.  So I'm optimistic:)
>
> Parser documentation is still very sparse.
> The Parser itself is based on a paper from  Adwait Ratnaparkhi.
>
> You can find the link to it, and other papers
> OpenNLP is based on in our wiki:
> https://cwiki.apache.org/OPENNLP/nlp-papers.html
>
> The models from the website are trained on the Penn Treebank.
>
> Let me know if you need more information.
>
> Hope this helps,
> Jörn
>

Reply via email to