This is an automated email from the ASF dual-hosted git repository.
krickert pushed a change to branch OPENNLP-1850-2a-tokenizer
in repository https://gitbox.apache.org/repos/asf/opennlp.git
discard f2d1d8cca OPENNLP-1850 Review nits: ExtendedPictographic fail-loud
parity + doc; WordType heuristic note (tokenizer)
discard 57b77648e OPENNLP-1850 Fail loud on a Word_Break line missing its ';'
(tokenizer)
discard c19c4fc11 OPENNLP-1850 UAX #29 word tokenizer: WordSegmenter,
WordTokenizer, WordType (2a)
discard 702acc52f OPENNLP-1850 Review nits: soften DL forward-link; fix
LineBreakPreserving opener (alignment)
discard 1d8f582c0 OPENNLP-1850 Offset/alignment layer: Alignment, AlignedText,
buildAligned, *Aligned (1b)
add b2a17a1fc OPENNLP-1850 Review nits: clearer accent-fold javadoc; drop
Lucene/BM25 framing; rename searchDefault->defaultChain
add 8bfa60ced OPENNLP-1850 Offset/alignment layer: Alignment, AlignedText,
buildAligned, *Aligned (1b)
add 2bed55526 OPENNLP-1850 Review nits: soften DL forward-link; fix
LineBreakPreserving opener (alignment)
add 6b4b32637 OPENNLP-1850 UAX #29 word tokenizer: WordSegmenter,
WordTokenizer, WordType (2a)
add 3fa5fa6e1 OPENNLP-1850 Fail loud on a Word_Break line missing its ';'
(tokenizer)
add 9c8e3fc91 OPENNLP-1850 Review nits: ExtendedPictographic fail-loud
parity + doc; WordType heuristic note (tokenizer)
This update added new revisions after undoing existing revisions.
That is to say, some revisions that were in the old version of the
branch are not in the new version. This situation occurs
when a user --force pushes a change and generates a repository
containing something like this:
* -- * -- B -- O -- O -- O (f2d1d8cca)
\
N -- N -- N refs/heads/OPENNLP-1850-2a-tokenizer (9c8e3fc91)
You should already have received notification emails for all of the O
revisions, and so the following emails describe only the N revisions
from the common base, B.
Any revisions marked "omit" are not gone; other references still
refer to them. Any revisions marked "discard" are gone forever.
No new revisions were added by this update.
Summary of changes:
.../tools/util/normalizer/AccentFoldCharSequenceNormalizer.java | 6 +++---
.../tools/util/normalizer/CaseFoldCharSequenceNormalizer.java | 3 +--
.../src/main/java/opennlp/tools/util/normalizer/Confusables.java | 4 +---
.../main/java/opennlp/tools/util/normalizer/TextNormalizer.java | 8 ++++----
.../java/opennlp/tools/util/normalizer/TextNormalizerTest.java | 4 ++--
5 files changed, 11 insertions(+), 14 deletions(-)