> On Feb 1, 2022, at 10:44 PM, David Carlson <[email protected]>
> wrote:
>
> I am currently running GnuCash 3.8 in Ubuntu 20.04. This release has the
> 'New' generic importer with the revised Bayesian matching. I have found
> that sometimes it is good at assigning accounts to incoming transactions
> and other times it is awful. One example is transactions where Starbucks
> recharges my wife's online app with $25.00 two or three times a month,
> GnuCash has seen this transaction 130 times according to the Import Map
> Editor, yet it cannot assign this transaction to the correct expense
> account. There are several other common transactions at certain businesses
> that do not get assigned even though they have appeared many times. It
> used to work better in the old version that was in the 2.6.x releases. Is
> this a bug?
Maybe, depending on why the matcher gets it wrong. The matcher uses space as a
separator to break up the description into tokens. Every time you match a
transfer account to a transaction the tokens in the transaction are added or
incremented on the account. The import map editor shows those pairings with the
scores.
The match algorithm is a variation on a "naive Bayes filter" commonly used in
spam detectors like SpamAssassin, see
https://en.wikipedia.org/wiki/Naive_Bayes_spam_filtering
<https://en.wikipedia.org/wiki/Naive_Bayes_spam_filtering> for an explanation.
The modification is that a spam filter makes a single yes-no decision while the
match algorithm needs to make a most likely decision among two or more possible
accounts.
Now that you know what to look for you can analyze the tokens in your Starbucks
recharge transactions using the counts in the import map editor to figure out
why the matching algorithm might be getting it wrong.
Regards,
John Ralls
_______________________________________________
gnucash-user mailing list
[email protected]
To update your subscription preferences or to unsubscribe:
https://lists.gnucash.org/mailman/listinfo/gnucash-user
If you are using Nabble or Gmane, please see
https://wiki.gnucash.org/wiki/Mailing_Lists for more information.
-----
Please remember to CC this list on all your replies.
You can do this by using Reply-To-List or Reply-All.