Re: Are there existing_entries for plugins, like with importers?

Daniele Nicolodi Tue, 13 Sep 2022 15:43:31 -0700

On 12/09/2022 11:54, John Koala wrote:

Hi,
Yes, sorry, in the context of V2 still I'm afraid...
Perhaps you already know of a "fuzzy string matcher" for transactionnarrations/payees?

I am not sure I grok the question: how could a fuzzy string matcher bespecialized for transaction narrations or payees?

I didn't have much luck with "smart_importer" and decided thescipy/numpy/etc dependency was a PITA so am (or was) thinking to knockup a plugin to complete my imported transactions.

Fuzzy matching strings is not all there is to write a machine learningclassifiers. I think that 'pip install scikit-learn' is immensely easierthan rolling your own algorithms.

Maybe if you provide more details on how smart_importer does not workfor you, someone can help you in making it work.

Is a plugin the correct idea?


I don' think so.

A plugin operates on the transactions read from a ledger after beancountafter booking (the process for which all the postings in all thetransactions are balanced, padding amounts are calculated, lots arecomputed, etc...). The transactions processed in this phase already needto have all postings completed.

Also, a plugin does not have a way to serialize the completedtransactions into a ledger. Unless you hack something together, yourplugin would run every time you load your ledger and will have to do itsjob again. This would make fixing any mistake the automaticcategorization algorithms does rather cumbersome.


Why do you thing a plugin is a better approach?

I noted that the importer is provided with an `existing_entries` list oftransactions, which seems a very useful suite of items to matchagainst. But can I reach that from the plugin?

That what? A plugin as access to all the transactions in the ledger onwhich Beancount is operating. In this context there isn't the notion ofanother ledger to which a batch of transactions will be added to.

Where/how? and is that even a good idea? (its not going to re-read theentire history for every imported transaction is it? Hmm, I'd toleratethat nonetheless :-)) I'm assuming a `beancount.loader.load_file`inside the plugin would create some recursive sillyness?

The beancount parser and loader are capable of loading more than onefile in the same process. However, there is no protection from a pluginthat recursively tries to load the Beancount ledger from which it hasbeen invoked. If you want to try the ledger filename is available as the"filename" entry to the "options_map" passed to the plugin entry point.

The way I would approach this, if you want a solution independent fromthe import framework, is to use beancount.parser.parse_file() to parsethe transactions from a ledger, use the technique you like the most tocomplete or rewrite the transactions, and write them back withbeancount.parser.printer.print_entries().


Cheers,
Dan

--
You received this message because you are subscribed to the Google Groups 
"Beancount" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/beancount/bc3179fa-4e93-c325-172b-a92a8c4e7cce%40grinta.net.

Re: Are there existing_entries for plugins, like with importers?

Reply via email to