Re: Multiple models and String.intern

2017-02-08 Thread Jeffrey Zemerick
I did not know that about StringTableSize. I thought it was more of a hard limit. That's good to know. Thanks On Wed, Feb 8, 2017 at 2:16 PM, Joern Kottmann wrote: > The StringTableSize doesn't limit the amount of Strings that can be stored > in the pool, if the size is too

Re: Multiple models and String.intern

2017-02-08 Thread Joern Kottmann
The StringTableSize doesn't limit the amount of Strings that can be stored in the pool, if the size is too small it just gets slower. This would only be done for loading models, querying the model wouldn't be affected. The predicate / feature strings would be interned. Jörn On Wed, Feb 8, 2017

Re: Multiple models and String.intern

2017-02-08 Thread Jeffrey Zemerick
Would it be possible to have an option or setting somewhere that determines if string pooling is used? The option would provide backward compatibility in case someone has to adjust the -XX:StringTableSize because their existing models exceed the default JVM limit, and an option would also be

[GitHub] opennlp pull request #109: [WIP] Deprecate low-level feature constructors an...

2017-02-08 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/opennlp/pull/109 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] opennlp pull request #113: OPENNLP-964: Ignore LICENSE, NOTICE and README fi...

2017-02-08 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/opennlp/pull/113 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

Re: Help Required in Code

2017-02-08 Thread ABHISHEK MAITI
I get it now. But shouldn't it be changed to char [] so it is more widely understood by new contributors? I found it in use a bit too frequently. On Feb 8, 2017 20:51, "Suneel Marthi" wrote: > In Java both are valid syntaxes to represent an array of chars, the > preferred

Re: Help Required in Code

2017-02-08 Thread Suneel Marthi
In Java both are valid syntaxes to represent an array of chars, the preferred syntax should have been char[] eosCharacters. The getter method actually returns a char[] public char[] getEndOfSentenceCharacters() { return eosCharacters; } On Wed, Feb 8, 2017 at 10:01 AM, ABHISHEK MAITI

Re: Help Required in Code

2017-02-08 Thread Russ, Daniel (NIH/CIT) [E]
I am not an expert on this part of the code, but I believe the idea is that there are multiple characters that can end a sentence (in English, think .!?). So it might be looking if any of the characters in the text match any of the end of sentence characters. Daniel On 2/8/17, 10:01 AM,

FINAL REMINDER: CFP for ApacheCon closes February 11th

2017-02-08 Thread Rich Bowen
Dear Apache Enthusiast, This is your FINAL reminder that the Call for Papers (CFP) for ApacheCon Miami is closing this weekend - February 11th. This is your final opportunity to submit a talk for consideration at this event. This year, we are running several mini conferences in conjunction with

[GitHub] opennlp pull request #117: OPENNLP-978: Set name finder defaults to perceptr...

2017-02-08 Thread kottmann
GitHub user kottmann opened a pull request: https://github.com/apache/opennlp/pull/117 OPENNLP-978: Set name finder defaults to perceptron and cutoff zero You can merge this pull request into a Git repository by running: $ git pull https://github.com/kottmann/opennlp