[
https://issues.apache.org/jira/browse/CODEC-187?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14036585#comment-14036585
]
michael tobias commented on CODEC-187:
--------------------------------------
Thanks guys
if the codes match Steve Morse' tool then we are ok.
Yes the Ashkenazi codes will be very different but it is a specialised usage of
BMPM and is likely not used by many sites (I was thinking of using it but will
likely use generic now). I would guess that the most used will be Generic
Approx Auto and that was only slightly wrong previously.
I am way too new to this to try to apply a patch and rebuild the jar. How do I
obtain the next build when ready incorporating the updates? I will then try to
do a little language specific testing (that might have implications for some
sites).
Thomas you may have seen my post on dev re Daitch-Mokotoff soundex. Would you
be interested in porting the php code to java for inclusion in the Commons
Codec? If so contact me privately and we can discuss further including
remuneration.
regards
M
> Beider Morse Phonetic Matching producing incorrect tokens
> ---------------------------------------------------------
>
> Key: CODEC-187
> URL: https://issues.apache.org/jira/browse/CODEC-187
> Project: Commons Codec
> Issue Type: Bug
> Affects Versions: 1.9
> Reporter: michael tobias
> Priority: Minor
> Fix For: 1.10
>
> Attachments: CODEC-187.patch, CODEC-187_ashkenazi_approx_any.patch,
> CODEC-187_ashkenazi_approx_any_v2.patch
>
>
> I believe the Beider Morse Phonetic Matching algorithm was added in Commons
> Codec 1.6
> The BMPM algorithm is an EVOLVING algorithm that is currently on version 3.02
> though it had been static since version 3.01 dated 19 Dec 2011 (it was first
> available as opensource as version 1.00 on 6 May 2009).
> I can see nothing in the Commons Codec Docs to say which version of BMPM was
> implemented so I am not sure if the problem with the algorithm as coded in
> the Codec is simply an old version or whether there are more basic problems
> with the implementation.
> How do I determine the version of the algorithm that was implemented in the
> Commons Codec?
> How do we ensure that the algorithm is updated if/when the BMPM algorithm
> changes?
> How do we ensure that the algorithm as coded in the Commons Codec is accurate
> and working as expected?
--
This message was sent by Atlassian JIRA
(v6.2#6252)