[
https://issues.apache.org/jira/browse/OPENNLP-701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14029059#comment-14029059
]
Chris Krol / IBM edited comment on OPENNLP-701 at 6/12/14 1:35 PM:
-------------------------------------------------------------------
Thanks for your response.
Ok, I get it. Sure, I'm willing to contribute such format support, then.
was (Author: kris.chris):
Thanks for your response.
Ok, I get it. Sure, I'm willing to contribute such format support, then.
I would be still contributing at least sentence detection and tokenizer
binaries, because they were created using a huge plaintext data set that's free
to use and that doesn't require any pre-processing.
> Polish language support - Maxent binaries
> -----------------------------------------
>
> Key: OPENNLP-701
> URL: https://issues.apache.org/jira/browse/OPENNLP-701
> Project: OpenNLP
> Issue Type: New Feature
> Reporter: Chris Krol / IBM
> Priority: Minor
>
> Hi,
> Currently I'm working at IBM Poland and my manager approved the idea of
> contributing various Maxent binaries for Polish language (sentence split,
> sentence detection, POS tagging and morphological analysis, NER).
> You could possibly put them on your download page.
> We trained them using the Golden Standard human-annotated Polish National
> Corpus (GPL 3.0).
> Would this be also possible to give some credit (or any) to the fact that the
> job's been done at IBM?
> I've already sent a mail to the devs, but haven't seen any response for two
> weeks now.
--
This message was sent by Atlassian JIRA
(v6.2#6252)