Re: [VOTE] Apache OpenNLP 2.0.0 Release Candidate

2022-06-01 Thread Joern Kottmann
+1 binding Thanks for all the work on this Jeff! Cheers, Jörn On Wed, Jun 1, 2022 at 9:57 PM Suneel Marthi wrote: > +1 binding > > On Wed, Jun 1, 2022 at 3:12 PM Jeff Zemerick wrote: > > > Just pinging folks on the thread about the active vote. The project has a > > board report due in a

Re: [VOTE] Apache OpenNLP 1.9.3 Release Candidate

2020-07-29 Thread Joern Kottmann
+1 Release the packages as Apache OpenNLP 1.9.3 Jörn On Wed, Jul 29, 2020 at 1:08 PM Tommaso Teofili wrote: > > +1 from me, build, sigs, tag look good. > > Regards, > Tommaso > > On Tue, 28 Jul 2020 at 10:48, Bruno P. Kinoshita wrote: > > > It worked after I imported keys from > >

Re: license for opennlp 1.5 pre-trained models

2019-12-30 Thread Joern Kottmann
Hello, The Apache OpenNLP project only distributes models that are licensed under the AL 2.0 license, or models that comply with the strict licensing requirements at Apache. So far we only release a language detection model at the Apache OpenNLP project. The OpenNLP project was hosted in the

Re: OpenNLP 1.9.2 and Java 8/11

2019-12-15 Thread Joern Kottmann
+1 lgtm, it would be nice to track down the exact cause of the changes on accuracy caused by the JDK update. We had similar issues in the past e.g through things like the undefined iteration order of Sets. I am happy to help with this. Jörn On Sat, Dec 14, 2019 at 3:48 PM Tommaso Teofili

Re: KStem support?

2019-02-19 Thread Joern Kottmann
Hello, we don't have it, but it would be nice to get a contribution for it. Jörn On Thu, Feb 7, 2019 at 3:03 PM Benedict Holland wrote: > > Hello all, > > I just happened to read a Solr message about using KStem. Is there any > support for this particular stemmer or would you like there to be?

Re: [VOTE] Apache OpenNLP 1.9.0 Release Candidate 2

2018-06-29 Thread Joern Kottmann
+1 Jörn On Fri, Jun 29, 2018 at 1:45 PM, Jeff Zemerick wrote: > Hi folks, > > I have posted a 2nd release candidate for the Apache OpenNLP 1.9.0 release > and it is ready for testing. > > The distributables can be downloaded from: >

Re: Custom models (for Ukrainian and Russian languages)

2018-06-28 Thread Joern Kottmann
Hello, we would be happy to hear about your experience. Did the language detector perform well enough on Russian/Ukrainian texts? To reproduce the models we train you should download the data via svn: svn co https://svn.apache.org/repos/bigdata/opennlp/trunk opennlp-corpus Note the corpus is

Re: OPENNLP-912 : Add a rule based sentence detector

2018-04-06 Thread Joern Kottmann
Hello, could you elaborate a bit on the approach? Jörn On Tue, Apr 3, 2018 at 5:24 PM, Isuranga Perera wrote: > Hi All, > > I would like to contribute $subject feature. Appreciate if anyone can guide > me through the process. > > Best Regards > Isuranga Perera

Re: [VOTE] Apache OpenNLP 1.8.4 Release Candidate

2017-12-23 Thread Joern Kottmann
+1 Jörn On Dec 21, 2017 15:44, "Jeff Zemerick" wrote: > Hi Folks, > > I have posted a first release candidate for the Apache OpenNLP 1.8.4 > release and it is ready for testing. > > The RC1 distributables can be downloaded from here: >

Re: [VOTE] Language Detector model for Apache OpenNLP 1.8.3 Release Candidate 3

2017-10-30 Thread Joern Kottmann
+1 Jörn On Mon, Oct 30, 2017 at 2:30 PM, William Colen wrote: > The Apache OpenNLP PMC would like to call for a Vote on the Language > Detector model for Apache OpenNLP 1.8.3 Release Candidate 3. > > The Release artifacts can be downloaded from: > >

Re: [VOTE] Apache OpenNLP 1.8.3 Release Candidate

2017-10-26 Thread Joern Kottmann
+1 Jörn On Thu, Oct 26, 2017 at 10:18 AM, Rodrigo Agerri wrote: > +1 (binding) > > -eval and unit tests OK > > On Wed, Oct 25, 2017 at 7:01 PM, William Colen > wrote: >> +1 binding >> >> - eval tests ok >> - unit test ok >> - build from tag ok

[ANNOUNCE] CVE-2017-12620: Apache OpenNLP XXE vulnerability

2017-10-02 Thread Joern Kottmann
files, either a dictionary or embedded inside a model package, to demonstrate this vulnerability. Credit: This issue was discovered by Nishil Shah of Salesforce. Regards, Jörn Kottmann

Re: [VOTE] Apache OpenNLP 1.8.2 Release Candidate 2

2017-09-15 Thread Joern Kottmann
The vote passed, only +1 votes were received. The following voted +1: Tommaso Teofili Suneel Marthi Jörn Kottmann Daniel Russ Jeff Zemerick Richard Eckart de Castilho William Colen Peter Thygesen Thanks for voting! Jörn On Wed, Sep 13, 2017 at 8:31 PM, Peter Thygesen <peter.thyge...@gmail.

[VOTE] Apache OpenNLP 1.8.2 Release Candidate 2

2017-09-11 Thread Joern Kottmann
Hi Folks, I have posted a second release candidate for the Apache OpenNLP 1.8.2 release and it is ready for testing. The RC 2 distributables can be downloaded from here: https://repository.apache.org/content/repositories/orgapacheopennlp-1018/org/apache/opennlp/opennlp-distr/1.8.2/ The

Re: [VOTE] Apache OpenNLP 1.8.2 Release Candidate

2017-09-07 Thread Joern Kottmann
t;> >> > +1 binding >> > >> > /PEter Thygesen >> > >> > > On 4 Sep 2017, at 23.41, Joern Kottmann <jo...@apache.org> wrote: >> > > >> > > Hi Folks, >> > > >> > > >> > > I

[VOTE] Apache OpenNLP 1.8.2 Release Candidate

2017-09-04 Thread Joern Kottmann
Hi Folks, I have posted a first release candidate for the Apache OpenNLP 1.8.2 release and it is ready for testing. The RC 1 distributables can be downloaded from here: https://repository.apache.org/content/repositories/orgapacheopennlp-1017/org/apache/opennlp/opennlp-distr/1.8.2/ The

Re: Early stopping NameFinderME

2017-08-29 Thread Joern Kottmann
get to it later tonight. > Daniel > >> On Aug 29, 2017, at 10:32 AM, Joern Kottmann <kottm...@gmail.com> wrote: >> >> Hi Daniel, >> >> do you see any issue if we expose LLThreshold and allow the user to >> change it via training parameters? >&g

Re: Early stopping NameFinderME

2017-08-29 Thread Joern Kottmann
? > > Daniel > >> On Aug 24, 2017, at 4:48 AM, Joern Kottmann <kottm...@gmail.com> wrote: >> >> You are the first one who ever asked this question. I think we have this as >> an option already on the gis trainer but it is not exposed all the way >> through. &

Re: Early stopping NameFinderME

2017-08-24 Thread Joern Kottmann
You are the first one who ever asked this question. I think we have this as an option already on the gis trainer but it is not exposed all the way through. Please open a jira and I can look at it next week. Jörn On Aug 21, 2017 5:11 PM, "Saurabh Jain" wrote: > Hi

Re: Problem of POSTaggerCrossValidator

2017-07-20 Thread Joern Kottmann
Hello, attachments are not allowed on this list. Could you please copy the error you got and the command you used into a mail? Jörn On Thu, Jul 20, 2017 at 6:31 AM, Santipong Thaiprayoon wrote: > To whom it may concern. > > > I used OpenNLP version 1.8.1 for

Re: Releasing a Language Detection Model

2017-07-11 Thread Joern Kottmann
t; the metadata would have the algorithm information ? > > 2. Do we publish multiple models for the same task, each trained on > different algorithms ? > > > > On Tue, Jul 11, 2017 at 9:30 AM, Joern Kottmann <kottm...@gmail.com> wrote: > >> Hello, >> &g

Re: Releasing a Language Detection Model

2017-07-11 Thread Joern Kottmann
ols – lots in my day – that can load from the CLI to >> override an >> internal classpath dependency. This is for people in environments who want >> a sensible >> / delivered internal classpath default and the ability for run-time, non >> zipped up/messing >> wi

Re: Releasing a Language Detection Model

2017-07-11 Thread Joern Kottmann
g model or always download from the > original provider? We can't guarantee that the corpus will be there > forever, not only because it changed license, but simple because the > provider is not keeping the server up anymore. > > William > > > > 2017-07-10 14:52 GMT-03:00

Re: Releasing a Language Detection Model

2017-07-11 Thread Joern Kottmann
gt;>> If the user does not provide a model to use on the CLI, can the CLI tools >>> look on the classpath for a model whose name fits the needed model (like >>> en-ner-person.bin) and if found use it automatically? >>> >>> Jeff >>> >>> >>&

Re: Releasing a Language Detection Model

2017-07-10 Thread Joern Kottmann
t; provider is not keeping the server up anymore. > > William > > > > 2017-07-10 14:52 GMT-03:00 Joern Kottmann <kottm...@gmail.com>: > >> Hello all, >> >> since Apache OpenNLP 1.8.1 we have a new language detection component >> which like all our compone

Releasing a Language Detection Model

2017-07-10 Thread Joern Kottmann
Hello all, since Apache OpenNLP 1.8.1 we have a new language detection component which like all our components has to be trained. I think we should release a pre-build model for it trained on the Leipzig corpus. This will allow the majority of our users to get started very quickly with language

Re: [VOTE] Apache OpenNLP 1.8.1 Release Candidate 3

2017-07-07 Thread Joern Kottmann
+1 i did run the eval the tests and they passed Jörn On Fri, Jul 7, 2017 at 1:06 PM, Bruno P. Kinoshita wrote: > Build passing OK with the following environment: > Apache Maven 3.3.9 (bb52d8502b132ec0a5a3f4c09453c07478323dc5; > 2015-11-11T05:41:47+13:00) >

Re: Document Categorizer based on Glove + LSTM (powered by DL4J)

2017-07-05 Thread Joern Kottmann
d, Jul 5, 2017 at 8:26 AM, Chris Mattmann <mattm...@apache.org> > wrote: > > > Thamme, great job! > > > > (proud academic dad) > > > > Cheers, > > Chris > > > > > > > > >

Re: Title: [VOTE] Apache OpenNLP 1.8.1 Release Candidate 2

2017-07-05 Thread Joern Kottmann
hs and sigs > 2. clean build from {src} * {tar, zip} and all tests pass > > > On Tue, Jul 4, 2017 at 9:16 AM, Joern Kottmann <kottm...@gmail.com> wrote: > >> Hi Folks, >> >> >> I have posted a 2nd release candidate for the Apache OpenNLP 1.8.1 >> rel

Re: [VOTE] Apache OpenNLP 1.8.1 Release Candidate

2017-07-04 Thread Joern Kottmann
Thank you very much for that info. We reverted the change we did to the sentence detector and will do this in a release after 1.8.1. RC 2 is now available. Jörn On Sun, Jul 2, 2017 at 9:25 PM, Richard Eckart de Castilho <r...@apache.org> wrote: > On 02.07.2017, at 19:13, Joern Kottma

[GitHub] opennlp pull request #242: OPENNLP-1108: Set default eos char to null

2017-07-03 Thread kottmann
Github user kottmann closed the pull request at: https://github.com/apache/opennlp/pull/242 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature

[GitHub] opennlp pull request #243: Revert "OPENNLP-1082: Add EOS to SDEventStream if...

2017-07-03 Thread kottmann
GitHub user kottmann opened a pull request: https://github.com/apache/opennlp/pull/243 Revert "OPENNLP-1082: Add EOS to SDEventStream if missing" This reverts commit b5b6d5c27443e1837b80b089206aad480852cd1c. Thank you for contributing to Apache OpenNLP.

[GitHub] opennlp pull request #242: OPENNLP-1108: Set default eos char to null

2017-07-03 Thread kottmann
GitHub user kottmann opened a pull request: https://github.com/apache/opennlp/pull/242 OPENNLP-1108: Set default eos char to null Thank you for contributing to Apache OpenNLP. In order to streamline the review of the contribution we ask you to ensure the following steps

Re: [VOTE] Apache OpenNLP 1.8.1 Release Candidate

2017-07-02 Thread Joern Kottmann
Hello, one question, did you retrain or use existing models? Jörn On Sat, Jul 1, 2017 at 10:20 PM, Richard Eckart de Castilho wrote: > Hi all, > > I ran a DKPro Core build against the RC. Looks mostly fine. No code changes > are required after switching from 1.8.0 to 1.8.1.

1.8.1 release

2017-07-01 Thread Joern Kottmann
Dear all, We will be making a 1.8.1 release of OpenNLP in the next days. All issues in jira are closed now. Jörn

[GitHub] opennlp pull request #239: OPENNLP-1102: Adjust test for contraction change

2017-06-30 Thread kottmann
Github user kottmann closed the pull request at: https://github.com/apache/opennlp/pull/239 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature

[GitHub] opennlp pull request #240: OPENNLP-1105: Add a profile and category for high...

2017-06-30 Thread kottmann
GitHub user kottmann opened a pull request: https://github.com/apache/opennlp/pull/240 OPENNLP-1105: Add a profile and category for high mem tests Thank you for contributing to Apache OpenNLP. In order to streamline the review of the contribution we ask you to ensure

[GitHub] opennlp pull request #239: OPENNLP-1102: Adjust test for contraction change

2017-06-29 Thread kottmann
GitHub user kottmann opened a pull request: https://github.com/apache/opennlp/pull/239 OPENNLP-1102: Adjust test for contraction change Thank you for contributing to Apache OpenNLP. In order to streamline the review of the contribution we ask you to ensure the following

[GitHub] opennlp pull request #238: Revert merging of sentiment work, no consent to m...

2017-06-29 Thread kottmann
Github user kottmann closed the pull request at: https://github.com/apache/opennlp/pull/238 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature

Re: [GitHub] opennlp pull request #238: Revert merging of sentiment work, no consent to m...

2017-06-29 Thread Joern Kottmann
One more thing, in case we check in models for unit tests we need to be able to train them again, we might not support those models forever and then it would be bad if we can't use the tests anymore or need to repair them by hand. Jörn On Thu, Jun 29, 2017 at 7:18 PM, Joern Kottmann <ko

Re: Missing serializer for postagger.bin

2017-06-29 Thread Joern Kottmann
This is fixed now in the master branch, would you mind to try it again? Jörn On Wed, Jun 14, 2017 at 4:31 PM, Joern Kottmann <kottm...@gmail.com> wrote: > We have to fix this, William wrote a unit test to reproduce it. > > Jörn > > On Fri, Jun 9, 2017 at 4:31 PM, Dam

Re: [GitHub] opennlp pull request #238: Revert merging of sentiment work, no consent to m...

2017-06-29 Thread Joern Kottmann
ennlp/tools/sentiment/sample_train_categ2 > (for categorical/multi-class) > > We can also do similar files where instead of multi-class, we just use > pos/neg as the label. > > Cheers, > Chris > > > > > > On 6/29/17, 2:35 AM, "Joern Kottmann" <kottm...

Re: [VOTE] Migrate our main repositories to GitHub

2017-06-29 Thread Joern Kottmann
Is there some rush here? > > Cheers, > Chris > > > > > On 6/28/17, 3:57 AM, "Joern Kottmann" <kottm...@gmail.com> wrote: > > The vote passes, only +1 votes have been received: > +1 Mark G > +1 Rodrigo Agerri > +1 Jeff Zemerick >

Re: [GitHub] opennlp pull request #238: Revert merging of sentiment work, no consent to m...

2017-06-29 Thread Joern Kottmann
e”. >> It was “The PR was merged wrong in ways 1-4 and I’m going to revert it.” >> >> That’s pretty off-putting to someone who is semi-new like me and like Ana. >> >> Anyways, go ahead and revert it. Sorry to have caused any issues. >> >> Chris >> &

Re: [VOTE] Migrate our main repositories to GitHub

2017-06-28 Thread Joern Kottmann
The vote passes, only +1 votes have been received: +1 Mark G +1 Rodrigo Agerri +1 Jeff Zemerick +1 Suneel Marthi +1 Jörn Kottmann +1 William Colen +1 Dan Russ +1 Anthony Beylerian +1 Chris Mattmann +1 Oleg Tikhonov +1 Tommaso Teofili Jörn On Wed, Jun 28, 2017 at 10:27 AM, Tommaso Teofili

[GitHub] opennlp-site issue #21: OPENNLP-1045: Add Git development page (adapted from...

2017-06-27 Thread kottmann
Github user kottmann commented on the issue: https://github.com/apache/opennlp-site/pull/21 @kinow this will now change a bit again due to the migration to GitHub, I propose we hold it off another week. --- If your project is set up for it, you can reply to this email and have your

[GitHub] opennlp-site pull request #21: OPENNLP-1045: Add Git development page (adapt...

2017-06-27 Thread kottmann
Github user kottmann commented on a diff in the pull request: https://github.com/apache/opennlp-site/pull/21#discussion_r124306989 --- Diff: src/main/jbake/content/using-git.ad --- @@ -0,0 +1,178 @@ + + Licensed to the Apache Software Foundation (ASF) under one

[VOTE] Migrate our main repositories to GitHub

2017-06-27 Thread Joern Kottmann
Hello all, lets decide here if we want to move our main repository, currently hosted at Apache to GitHub instead. This will make our process a bit easier because we can eliminate one remote from our workflow. [ ] +1 Migrate all repositories to GitHub [ ] -1 Do not migrate, because...

Re: [VOTE] Migrate our main repositories to GitHub

2017-06-27 Thread Joern Kottmann
+1 Jörn On Tue, Jun 27, 2017 at 12:30 PM, Joern Kottmann <kottm...@gmail.com> wrote: > Hello all, > > lets decide here if we want to move our main repository, currently > hosted at Apache to GitHub instead. This will make our process a bit > easier because we can eliminat

[GitHub] opennlp pull request #238: Revert merging of sentiment work, no consent to m...

2017-06-27 Thread kottmann
GitHub user kottmann opened a pull request: https://github.com/apache/opennlp/pull/238 Revert merging of sentiment work, no consent to merge it Thank you for contributing to Apache OpenNLP. In order to streamline the review of the contribution we ask you to ensure

[GitHub] opennlp pull request #237: OPENNLP-1092: Fix pos model serialization bug

2017-06-26 Thread kottmann
GitHub user kottmann opened a pull request: https://github.com/apache/opennlp/pull/237 OPENNLP-1092: Fix pos model serialization bug Thank you for contributing to Apache OpenNLP. In order to streamline the review of the contribution we ask you to ensure the following

[GitHub] opennlp pull request #236: OPENNLP-1097: Enable the normalizers by default i...

2017-06-22 Thread kottmann
GitHub user kottmann opened a pull request: https://github.com/apache/opennlp/pull/236 OPENNLP-1097: Enable the normalizers by default in langdetect Thank you for contributing to Apache OpenNLP. In order to streamline the review of the contribution we ask you to ensure

[GitHub] opennlp pull request #235: OPENNLP-1096: Swap for loops in ngram generation ...

2017-06-22 Thread kottmann
GitHub user kottmann opened a pull request: https://github.com/apache/opennlp/pull/235 OPENNLP-1096: Swap for loops in ngram generation to be cache friendly Thank you for contributing to Apache OpenNLP. In order to streamline the review of the contribution we ask you

[GitHub] opennlp-site pull request #11: OPENNLP-1045: Git documentation for developer...

2017-06-22 Thread kottmann
Github user kottmann commented on a diff in the pull request: https://github.com/apache/opennlp-site/pull/11#discussion_r123445407 --- Diff: src/main/jbake/content/using-git.ad --- @@ -0,0 +1,113 @@ + + Licensed to the Apache Software Foundation (ASF) under one

[GitHub] opennlp-site pull request #11: OPENNLP-1045: Git documentation for developer...

2017-06-22 Thread kottmann
Github user kottmann commented on a diff in the pull request: https://github.com/apache/opennlp-site/pull/11#discussion_r123445203 --- Diff: src/main/jbake/content/using-git.ad --- @@ -0,0 +1,113 @@ + + Licensed to the Apache Software Foundation (ASF) under one

[GitHub] opennlp-site pull request #11: OPENNLP-1045: Git documentation for developer...

2017-06-22 Thread kottmann
Github user kottmann commented on a diff in the pull request: https://github.com/apache/opennlp-site/pull/11#discussion_r123445064 --- Diff: src/main/jbake/content/using-git.ad --- @@ -0,0 +1,113 @@ + + Licensed to the Apache Software Foundation (ASF) under one

[GitHub] opennlp-site pull request #21: OPENNLP-1045: Add Git development page (adapt...

2017-06-22 Thread kottmann
Github user kottmann commented on a diff in the pull request: https://github.com/apache/opennlp-site/pull/21#discussion_r123442070 --- Diff: src/main/jbake/content/using-git.ad --- @@ -0,0 +1,178 @@ + + Licensed to the Apache Software Foundation (ASF) under one

[GitHub] opennlp-site pull request #21: OPENNLP-1045: Add Git development page (adapt...

2017-06-22 Thread kottmann
Github user kottmann commented on a diff in the pull request: https://github.com/apache/opennlp-site/pull/21#discussion_r123438951 --- Diff: src/main/jbake/content/using-git.ad --- @@ -0,0 +1,178 @@ + + Licensed to the Apache Software Foundation (ASF) under one

[GitHub] opennlp pull request #232: Remove pmap indirection

2017-06-16 Thread kottmann
GitHub user kottmann opened a pull request: https://github.com/apache/opennlp/pull/232 Remove pmap indirection Thank you for contributing to Apache OpenNLP. In order to streamline the review of the contribution we ask you to ensure the following steps have been taken

Re: Missing serializer for postagger.bin

2017-06-14 Thread Joern Kottmann
tackTrace(); > >> } > >> > >> } > >> } > >> } > >> > >> public static POSModel loadPosTagger (String modelName) { > >> > >> try (InputStream modelIn = new

Re: Missing serializer for postagger.bin

2017-06-07 Thread Joern Kottmann
opennlp-tools > 1.8.0 > > > Do i need others dependencies too? > > > > 2017-06-07 14:53 GMT+02:00 Joern Kottmann <kottm...@gmail.com>: > > > This should be working. Did you test with 1.8.0? > > > > Jörn > > > &

Re: Missing serializer for postagger.bin

2017-06-07 Thread Joern Kottmann
This should be working. Did you test with 1.8.0? Jörn On Mon, Jun 5, 2017 at 3:43 PM, Damiano Porta wrote: > Hello, > i am using the POSTaggerFeatureGenerator via generators.xml > > > > during the training i add this model in the resources doing: > >

[GitHub] opennlp pull request #226: OPENNLP-1088: Reduce fork count for eval tests

2017-06-06 Thread kottmann
Github user kottmann closed the pull request at: https://github.com/apache/opennlp/pull/226 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature

[GitHub] opennlp pull request #225: OPENNLP-1087: Add convenience methods to load fro...

2017-05-31 Thread kottmann
GitHub user kottmann opened a pull request: https://github.com/apache/opennlp/pull/225 OPENNLP-1087: Add convenience methods to load from Path Thank you for contributing to Apache OpenNLP. In order to streamline the review of the contribution we ask you to ensure

[GitHub] opennlp pull request #224: OPENNLP-1085: Add methods to write model to File ...

2017-05-31 Thread kottmann
GitHub user kottmann opened a pull request: https://github.com/apache/opennlp/pull/224 OPENNLP-1085: Add methods to write model to File or Path Thank you for contributing to Apache OpenNLP. In order to streamline the review of the contribution we ask you to ensure

[GitHub] opennlp pull request #223: OPENNLP-1086: Refactor the Data Indexers

2017-05-30 Thread kottmann
GitHub user kottmann opened a pull request: https://github.com/apache/opennlp/pull/223 OPENNLP-1086: Refactor the Data Indexers The following has been done: - Use Java 8 streams where it makes sense - Deduplicate the index method and have one common one - Avoid having

[GitHub] opennlp pull request #216: OPENNLP-1076: Add validation of spans to Sentence...

2017-05-24 Thread kottmann
GitHub user kottmann opened a pull request: https://github.com/apache/opennlp/pull/216 OPENNLP-1076: Add validation of spans to SentenceSample Thank you for contributing to Apache OpenNLP. In order to streamline the review of the contribution we ask you to ensure

[GitHub] opennlp pull request #215: OPENNLP-1075 Add streams for sentence and token s...

2017-05-23 Thread kottmann
GitHub user kottmann opened a pull request: https://github.com/apache/opennlp/pull/215 OPENNLP-1075 Add streams for sentence and token samples for conllu Thank you for contributing to Apache OpenNLP. In order to streamline the review of the contribution we ask you

Re: opennlp.tools.coref.mention.JWNLDictionary;

2017-05-23 Thread Joern Kottmann
The coref component was removed from OpenNLP quite some time ago because we didn't have a maintainer anymore for it. The JWNLDictionary class was part of that removal, you can still find the code in the OpenNLP Sandbox:

[GitHub] opennlp pull request #214: OPENNLP-1074: Reduce visibility of eval methods

2017-05-22 Thread kottmann
GitHub user kottmann opened a pull request: https://github.com/apache/opennlp/pull/214 OPENNLP-1074: Reduce visibility of eval methods Thank you for contributing to Apache OpenNLP. In order to streamline the review of the contribution we ask you to ensure the following

[GitHub] opennlp-site pull request #14: OPENNLP-1067: Use variables in jbake.properti...

2017-05-19 Thread kottmann
Github user kottmann commented on a diff in the pull request: https://github.com/apache/opennlp-site/pull/14#discussion_r117455933 --- Diff: src/main/jbake/content/maven-dependency.ad --- @@ -35,59 +34,75 @@ all transient dependencies are resolved automatically. To use

[GitHub] opennlp-site pull request #14: OPENNLP-1067: Use variables in jbake.properti...

2017-05-19 Thread kottmann
Github user kottmann commented on a diff in the pull request: https://github.com/apache/opennlp-site/pull/14#discussion_r117455817 --- Diff: src/main/jbake/content/docs/index.ad --- @@ -25,12 +25,12 @@ There exists a manual and Javadoc API documentation for Apache OpenNLP

[ANNOUNCE] Apache OpenNLP 1.8.0 Release

2017-05-19 Thread Joern Kottmann
The Apache OpenNLP team is pleased to announce the release of version 1.8.0 of Apache OpenNLP. The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text. It supports the most common NLP tasks, such as tokenization, sentence segmentation,

[GitHub] opennlp pull request #196: [1.8.1] OPENNLP-1054: Remove deprecated Heap and ...

2017-05-19 Thread kottmann
Github user kottmann closed the pull request at: https://github.com/apache/opennlp/pull/196 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature

[GitHub] opennlp pull request #212: OPENNLP-1068: Use current version to generate cha...

2017-05-19 Thread kottmann
GitHub user kottmann opened a pull request: https://github.com/apache/opennlp/pull/212 OPENNLP-1068: Use current version to generate changes list Thank you for contributing to Apache OpenNLP. In order to streamline the review of the contribution we ask you to ensure

[GitHub] opennlp-site pull request #12: Update site for 1.8.0 release

2017-05-18 Thread kottmann
GitHub user kottmann opened a pull request: https://github.com/apache/opennlp-site/pull/12 Update site for 1.8.0 release You can merge this pull request into a Git repository by running: $ git pull https://github.com/kottmann/opennlp-site 180_release Alternatively you can

Re: [VOTE] Apache OpenNLP 1.8.0 Release Candidate 3

2017-05-18 Thread Joern Kottmann
The vote passes, only +1 votes were receive: +1 Bruno +1 Tommaso +1 William +1 Jörn +1 Jeff +1 Daniel +1 Richard +1 Joey +1 Suneel +1 Rodrigo Thanks for voting! Jörn On Wed, 2017-05-17 at 23:48 +0200, Joern Kottmann wrote: > The Apache OpenNLP PMC would like to call for a Vote on Apa

Re: [VOTE] Apache OpenNLP 1.8.0 Release Candidate 3

2017-05-18 Thread Joern Kottmann
nlp/blob/73c8e5b9d8e055fefb53f7f3c2487d > > 05c9788c6a/opennlp-tools/src/main/java/opennlp/tools/util/featuregen/ > > POSTaggerNameFeatureGenerator.java#L59 > > > Plus other NullPointerException's that can be prevented, and other > minor > > > issues. Not blockers for the release though, IMO. >

[GitHub] opennlp pull request #211: Add support to train on leipzig

2017-05-18 Thread kottmann
GitHub user kottmann opened a pull request: https://github.com/apache/opennlp/pull/211 Add support to train on leipzig Thank you for contributing to Apache OpenNLP. In order to streamline the review of the contribution we ask you to ensure the following steps have been

Re: CoReference

2017-05-18 Thread Joern Kottmann
"Damiano Porta" <damianopo...@gmail.com> ha scritto: > > > Oh my wrong. Pardon. > > Do we have accuracy statistics? > > > > Il 18 mag 2017 14:59, "Joern Kottmann" <kottm...@gmail.com> ha scritto: > > > >> This is for linking

Re: CoReference

2017-05-18 Thread Joern Kottmann
This is for linking entities in one document, e.g. first name mention to a full name mention, or to he, she, it. Jörn On Thu, May 18, 2017 at 1:27 PM, Damiano Porta wrote: > Hi, thanks but I need to link entities to each others . I do not need to > link entities to

[VOTE] Apache OpenNLP 1.8.0 Release Candidate 3

2017-05-17 Thread Joern Kottmann
The Apache OpenNLP PMC would like to call for a Vote on Apache OpenNLP 1.8.0 Release Candidate 3.  The RC 3 distributables can be downloaded from here: https://repository.apache.org/content/repositories/orgapacheopennlp-101 3/org/apache/opennlp/opennlp-distr/1.8.0/ The release was made from the

[GitHub] opennlp pull request #205: OPENNLP-1064: Disable evalDutchMaxentQn test

2017-05-17 Thread kottmann
Github user kottmann closed the pull request at: https://github.com/apache/opennlp/pull/205 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature

[GitHub] opennlp pull request #206: [WIP] OPENNLP-1065: Use ISO-639-3 in test code

2017-05-17 Thread kottmann
GitHub user kottmann opened a pull request: https://github.com/apache/opennlp/pull/206 [WIP] OPENNLP-1065: Use ISO-639-3 in test code Thank you for contributing to Apache OpenNLP. In order to streamline the review of the contribution we ask you to ensure the following

[GitHub] opennlp pull request #205: OPENNLP-1064: Disable evalDutchMaxentQn test

2017-05-17 Thread kottmann
GitHub user kottmann opened a pull request: https://github.com/apache/opennlp/pull/205 OPENNLP-1064: Disable evalDutchMaxentQn test Thank you for contributing to Apache OpenNLP. In order to streamline the review of the contribution we ask you to ensure the following

[GitHub] opennlp pull request #201: Opennlp 1060

2017-05-15 Thread kottmann
GitHub user kottmann opened a pull request: https://github.com/apache/opennlp/pull/201 Opennlp 1060 Thank you for contributing to Apache OpenNLP. In order to streamline the review of the contribution we ask you to ensure the following steps have been taken

Re: [VOTE] Apache OpenNLP 1.8.0 Release Candidate 2

2017-05-15 Thread Joern Kottmann
15, 2017 at 6:21 PM, Richard Eckart de Castilho <r...@apache.org> wrote: > > On 15.05.2017, at 16:35, Joern Kottmann <kottm...@gmail.com> wrote: > > > > Richard, I believe I found the problem with the parser, would you mind to > > take a look? > > > &g

Re: [VOTE] Apache OpenNLP 1.8.0 Release Candidate 2

2017-05-15 Thread Joern Kottmann
Richard, I believe I found the problem with the parser, would you mind to take a look? This PR should fix it: https://github.com/apache/opennlp/pull/199 Jörn On Mon, May 15, 2017 at 4:14 PM, Richard Eckart de Castilho wrote: > Hi Rodrigo, > > On 15.05.2017, at 15:36, Rodrigo

[GitHub] opennlp pull request #199: OPENNLP-1059 Set model version before creating th...

2017-05-15 Thread kottmann
GitHub user kottmann opened a pull request: https://github.com/apache/opennlp/pull/199 OPENNLP-1059 Set model version before creating the POS Model Thank you for contributing to Apache OpenNLP. In order to streamline the review of the contribution we ask you to ensure

Re: [VOTE] Apache OpenNLP 1.8.0 Release Candidate 2

2017-05-15 Thread Joern Kottmann
://github.com/apache/opennlp/blob/3df659b9bfb02084e782f1e8b6ec716f56e0611c/opennlp-tools/src/test/java/opennlp/tools/eval/OntoNotes4ParserEval.java#L70 On Sat, May 13, 2017 at 10:35 PM, Richard Eckart de Castilho <r...@apache.org > wrote: > Hi all, > > > On 11.05.2017, at 18:

Re: Error when processing doap file http://opennlp.apache.org/doap_opennlp.rdf:

2017-05-12 Thread Joern Kottmann
Thanks for forwarding this to the dev list. The file is now available again. Jörn On Fri, May 12, 2017 at 10:46 AM, sebb wrote: > -- Forwarded message -- > From: Projects > Date: 12 May 2017 at 03:00 > Subject: Error when processing doap

[VOTE] Apache OpenNLP 1.8.0 Release Candidate 2

2017-05-11 Thread Joern Kottmann
The Apache OpenNLP PMC would like to call for a Vote on Apache OpenNLP 1.8.0 Release Candidate 2.  The RC 2 distributables can be downloaded from here: https://repository.apache.org/content/repositories/orgapacheopennlp-101 2/org/apache/opennlp/opennlp-distr/1.8.0/ The release was made from the

[ANNOUNCE] New website for Apache OpenNLP

2017-05-11 Thread Joern Kottmann
Hello all, we launched a redesigned new web site for Apache OpenNLP with a new logo - check it out at https://opennlp.apache.org Regards, The Apache OpenNLP Team

Re: [VOTE] Apache OpenNLP 1.8.0 Release Candidate

2017-05-11 Thread Joern Kottmann
I am canceling the vote due to the above mentioned bug. Lets prepare another RC which has this issue fixed. Jörn On Thu, May 11, 2017 at 9:51 AM, Joern Kottmann <kottm...@gmail.com> wrote: > I am changing my vote to -1 due to a bug i the DictionaryLemmatizer, in > case the word and

Re: [VOTE] Apache OpenNLP 1.8.0 Release Candidate

2017-05-11 Thread Joern Kottmann
<jzemer...@apache.org> wrote: > +1 non-binding > > Built and tested on Ubuntu 16.04 and Amazon Linux 2017.03.0 with OpenJDK8. > NOTICE and LICENSE files look good. > Created and tested a token name finder model. > > Jeff > > > On Tue, May 9, 2017 at 2:41 PM, Joern

[GitHub] opennlp pull request #196: [1.8.1] OPENNLP-1054: Remove deprecated Heap and ...

2017-05-10 Thread kottmann
GitHub user kottmann opened a pull request: https://github.com/apache/opennlp/pull/196 [1.8.1] OPENNLP-1054: Remove deprecated Heap and HeapList Thank you for contributing to Apache OpenNLP. In order to streamline the review of the contribution we ask you to ensure

[VOTE] Apache OpenNLP 1.8.0 Release Candidate

2017-05-09 Thread Joern Kottmann
The Apache OpenNLP PMC would like to call for a Vote on Apache OpenNLP 1.8.0 Release Candidate 1.  The RC 1 distributables can be downloaded from here: https://repository.apache.org/content/repositories/orgapacheopennlp-101 1/org/apache/opennlp/opennlp-distr/1.8.0/ The release was made from the

[GitHub] opennlp pull request #185: OPENNLP-1046: Correctly join tokens to text strin...

2017-04-26 Thread kottmann
Github user kottmann closed the pull request at: https://github.com/apache/opennlp/pull/185 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature

[GitHub] opennlp pull request #185: OPENNLP-1046: Correctly join tokens to text strin...

2017-04-26 Thread kottmann
GitHub user kottmann opened a pull request: https://github.com/apache/opennlp/pull/185 OPENNLP-1046: Correctly join tokens to text string The text was one space too long which results in a different parse tree if the method is used to reproduce an existing parse tree

[GitHub] opennlp pull request #184: [WIP] OPENNLP-1021: Change xv folds from 10 to 5 ...

2017-04-24 Thread kottmann
GitHub user kottmann opened a pull request: https://github.com/apache/opennlp/pull/184 [WIP] OPENNLP-1021: Change xv folds from 10 to 5 to reduce runtime Thank you for contributing to Apache OpenNLP. In order to streamline the review of the contribution we ask you

  1   2   3   4   >