This is an automated email from the ASF dual-hosted git repository.

krickert pushed a change to branch OPENNLP-1850-4-docs
in repository https://gitbox.apache.org/repos/asf/opennlp.git


 discard 121c25885 OPENNLP-1850 Docs review nits: declare xmlns:xlink; populate 
second ids2Labels example
 discard 0b644851d OPENNLP-1850 Docs review nits: populate ids2Labels example; 
rule-based hyphenation
 discard 71ce84efa OPENNLP-1850 Mirror the Extended_Pictographic emoji caveat 
in the tokenizer manual
 discard 0ddea4657 OPENNLP-1850 Tighten normalizer manual wording (review nits)
 discard e659b3958 OPENNLP-1850 Document the supplementary-dash offset shift in 
the DL fold options
 discard 2309357ac OPENNLP-1850 Document the offset-aware substitution folds 
(quotes, digits, ellipsis, bullets, umlaut)
 discard 46387efff OPENNLP-1850 Name the OffsetMappingNameFinder capability 
interface in the manual
 discard e838ab792 OPENNLP-1850 Document the offset-aware normalization 
pipeline (buildAligned)
 discard e3c1ff98e OPENNLP-1850 Document Unicode normalization, the UAX #29 
tokenizer, and DL handling
 discard ed5c7777d OPENNLP-1850 Review nits: extract testable DL guards; 
merge-copy; capitalize msgs; migration note
 discard b9d6972bc OPENNLP-1850 Make mergeOverlappingSpans O(n log n) (dl)
 discard c799b80bf OPENNLP-1850 Reject non-finite logits in softmax, not just 
NaN (dl)
 discard ee0e294fc OPENNLP-1850 Fully-qualify TokenNameFinder javadoc links in 
NameFinderDL
 discard 74f3c6425 OPENNLP-1850 Fail loud on corrupt document-classification 
model output
 discard 498539b5c OPENNLP-1850 Fail fast on null finder input; fix the GPU 
eval test options
 discard 4e4713263 OPENNLP-1850 Harden fail-loud paths in the DL components
 discard a4444192e OPENNLP-1850 Add real-model chunk-boundary eval tests; drop 
dead label constants
 discard 76003e15a OPENNLP-1850 Resolve overlapping chunk spans and compose the 
input alignment
 discard 9d28db088 OPENNLP-1850 Add OffsetMappingNameFinder capability 
interface and a findInOriginal end-to-end test
 discard 0ec30f676 OPENNLP-1850 Offset-safe, Unicode-aware input normalization 
in the DL components
 discard 13e46418b OPENNLP-1850 Review nits: add Turkish profile; derive 
coverage from the enum (profiles)
 discard 859146c9c OPENNLP-1850 Resolve Norwegian nb/nn to the Norwegian 
profile (profiles)
 discard 8d32dbac9 OPENNLP-1850 Per-language NormalizationProfile registry (2c)
 discard a23a51358 OPENNLP-1850 Review nits: rename dashes()->dash(); LEMMA 
doc+test; soften forward-link (Term)
 discard 58cff0120 OPENNLP-1850 Layered Term model: Term, TermAnalyzer (2b)
 discard f2d1d8cca OPENNLP-1850 Review nits: ExtendedPictographic fail-loud 
parity + doc; WordType heuristic note (tokenizer)
 discard 57b77648e OPENNLP-1850 Fail loud on a Word_Break line missing its ';' 
(tokenizer)
 discard c19c4fc11 OPENNLP-1850 UAX #29 word tokenizer: WordSegmenter, 
WordTokenizer, WordType (2a)
 discard 702acc52f OPENNLP-1850 Review nits: soften DL forward-link; fix 
LineBreakPreserving opener (alignment)
 discard 1d8f582c0 OPENNLP-1850 Offset/alignment layer: Alignment, AlignedText, 
buildAligned, *Aligned (1b)
     add b2a17a1fc OPENNLP-1850 Review nits: clearer accent-fold javadoc; drop 
Lucene/BM25 framing; rename searchDefault->defaultChain
     add 8bfa60ced OPENNLP-1850 Offset/alignment layer: Alignment, AlignedText, 
buildAligned, *Aligned (1b)
     add 2bed55526 OPENNLP-1850 Review nits: soften DL forward-link; fix 
LineBreakPreserving opener (alignment)
     add 6b4b32637 OPENNLP-1850 UAX #29 word tokenizer: WordSegmenter, 
WordTokenizer, WordType (2a)
     add 3fa5fa6e1 OPENNLP-1850 Fail loud on a Word_Break line missing its ';' 
(tokenizer)
     add 9c8e3fc91 OPENNLP-1850 Review nits: ExtendedPictographic fail-loud 
parity + doc; WordType heuristic note (tokenizer)
     add 7ba196d79 OPENNLP-1850 Layered Term model: Term, TermAnalyzer (2b)
     add db9aaa65d OPENNLP-1850 Review nits: rename dashes()->dash(); LEMMA 
doc+test; soften forward-link (Term)
     add f811ef1bb OPENNLP-1850 Review nits: TermAnalyzer javadoc references 
matchingAnalyzer()
     add 1cedb0608 OPENNLP-1850 Per-language NormalizationProfile registry (2c)
     add 7db9aced6 OPENNLP-1850 Resolve Norwegian nb/nn to the Norwegian 
profile (profiles)
     add 9b10fe3ce OPENNLP-1850 Review nits: add Turkish profile; derive 
coverage from the enum (profiles)
     add 087c4779d OPENNLP-1850 Review nits: rename 
searchAnalyzer->matchingAnalyzer; drop 'search' framing in profile docs
     add ac0b30fdf OPENNLP-1850 Offset-safe, Unicode-aware input normalization 
in the DL components
     add b78550661 OPENNLP-1850 Add OffsetMappingNameFinder capability 
interface and a findInOriginal end-to-end test
     add 282b1f49c OPENNLP-1850 Resolve overlapping chunk spans and compose the 
input alignment
     add 80c4e48b4 OPENNLP-1850 Add real-model chunk-boundary eval tests; drop 
dead label constants
     add 665758511 OPENNLP-1850 Harden fail-loud paths in the DL components
     add dbc733be8 OPENNLP-1850 Fail fast on null finder input; fix the GPU 
eval test options
     add c9f65483a OPENNLP-1850 Fail loud on corrupt document-classification 
model output
     add 6605361a8 OPENNLP-1850 Fully-qualify TokenNameFinder javadoc links in 
NameFinderDL
     add fd1195676 OPENNLP-1850 Reject non-finite logits in softmax, not just 
NaN (dl)
     add fabf88583 OPENNLP-1850 Make mergeOverlappingSpans O(n log n) (dl)
     add 105d8a17c OPENNLP-1850 Review nits: extract testable DL guards; 
merge-copy; capitalize msgs; migration note
     add 87b308a61 OPENNLP-1850 Document Unicode normalization, the UAX #29 
tokenizer, and DL handling
     add 7a44b8be3 OPENNLP-1850 Document the offset-aware normalization 
pipeline (buildAligned)
     add 62b224032 OPENNLP-1850 Name the OffsetMappingNameFinder capability 
interface in the manual
     add 49118068d OPENNLP-1850 Document the offset-aware substitution folds 
(quotes, digits, ellipsis, bullets, umlaut)
     add d8b72aa61 OPENNLP-1850 Document the supplementary-dash offset shift in 
the DL fold options
     add 26ec5ad1f OPENNLP-1850 Tighten normalizer manual wording (review nits)
     add c83eee1c7 OPENNLP-1850 Mirror the Extended_Pictographic emoji caveat 
in the tokenizer manual
     add 5000fae25 OPENNLP-1850 Docs review nits: populate ids2Labels example; 
rule-based hyphenation
     add 3dce858da OPENNLP-1850 Docs review nits: declare xmlns:xlink; populate 
second ids2Labels example
     add af56b2446 OPENNLP-1850 Review nits: manual uses 
defaultChain()/matchingAnalyzer(); drop BM25/search framing

This update added new revisions after undoing existing revisions.
That is to say, some revisions that were in the old version of the
branch are not in the new version.  This situation occurs
when a user --force pushes a change and generates a repository
containing something like this:

 * -- * -- B -- O -- O -- O   (121c25885)
            \
             N -- N -- N   refs/heads/OPENNLP-1850-4-docs (af56b2446)

You should already have received notification emails for all of the O
revisions, and so the following emails describe only the N revisions
from the common base, B.

Any revisions marked "omit" are not gone; other references still
refer to them.  Any revisions marked "discard" are gone forever.

No new revisions were added by this update.

Summary of changes:
 .../util/normalizer/AccentFoldCharSequenceNormalizer.java    |  6 +++---
 .../util/normalizer/CaseFoldCharSequenceNormalizer.java      |  3 +--
 .../main/java/opennlp/tools/util/normalizer/Confusables.java |  4 +---
 .../opennlp/tools/util/normalizer/NormalizationProfile.java  |  8 ++++----
 .../opennlp/tools/util/normalizer/NormalizationProfiles.java |  4 ++--
 .../java/opennlp/tools/util/normalizer/TermAnalyzer.java     |  2 +-
 .../java/opennlp/tools/util/normalizer/TextNormalizer.java   |  8 ++++----
 .../tools/util/normalizer/NormalizationProfilesTest.java     | 10 +++++-----
 .../opennlp/tools/util/normalizer/TextNormalizerTest.java    |  4 ++--
 opennlp-docs/src/docbkx/normalizer.xml                       | 12 ++++++------
 10 files changed, 29 insertions(+), 32 deletions(-)

Reply via email to