1.8.1 release

2017-07-01 Thread Joern Kottmann
Dear all, We will be making a 1.8.1 release of OpenNLP in the next days. All issues in jira are closed now. Jörn

Re: [VOTE] Apache OpenNLP 1.8.1 Release Candidate

2017-07-01 Thread Suneel Marthi
Here's my +1 binding 1. Verified the sigs and hashsums 2. Ran a clean build of {src} * {zip, tar} and all unit tests pass 3. Verified RAT check On Sat, Jul 1, 2017 at 11:20 AM, Suneel Marthi wrote: > The Apache OpenNLP PMC would like to call for a Vote on Apache OpenNLP

[VOTE] Apache OpenNLP 1.8.1 Release Candidate

2017-07-01 Thread Suneel Marthi
The Apache OpenNLP PMC would like to call for a Vote on Apache OpenNLP 1.8.1 Release Candidate. The Release artifacts can be downloaded from: https://repository.apache.org/content/repositories/orgapacheopennlp-1014/org/apache/opennlp/opennlp-distr/1.8.1/ The release was made from the Apache

Re: Spelling correction

2017-07-01 Thread Damiano Porta
I also read about Noisy channel. I could work on this if you think it is good. Damiano Il 1 lug 2017 20:16, "Suneel Marthi" ha scritto: > 'Spelling Correction' has been the most popular ask from audience at my > recent NLP talks, it would be great to have this feature

Spelling correction

2017-07-01 Thread Damiano Porta
Hello everybody, i am dealing with data normalization on very bad sentences with many spelling errors. Do you know a good paper to understand how to build a model that will fix this kind of problem? I can share the code without problems if you are interested in integrating it into OpenNLP.

Re: Spelling correction

2017-07-01 Thread Daniel Russ
Damiano, There is a lot of research on spelling correction. Here is a paper from a group our of the National Library of Medicine https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2137159/ . They also have a product called GSpell

[GitHub] opennlp-sandbox pull request #3: text sequence classification using Glove an...

2017-07-01 Thread thammegowda
GitHub user thammegowda opened a pull request: https://github.com/apache/opennlp-sandbox/pull/3 text sequence classification using Glove and RNN/LSTMs Summary: + Added a dataset reader for feeding mini batches to DL4J's network + Golve embeddings to vectorize text using