[Python-Dev] Re: My take on multiple interpreters (Was: Should we be making so many changes in pursuit of PEP 554?)

Mark Shannon Thu, 11 Jun 2020 04:05:15 -0700

Hi Riccardo,

On 10/06/2020 5:51 pm, Riccardo Ghetta wrote:

Hi,
as an user, the "lua use case" is right what I need at work.
I realize that for python this is a niche case, and most users don'tneed any of this, but I hope it will useful to understand why havingmultiple independent interpreters in a single process can be anessential feature.The company I work for develop and sells a big C++ financial system withpython embedded, providing critical flexibility to our customers.Python is used as a scripting language, with most cases having C++calling a python script itself calling other C++ functions.Most of the times those scripts are in workloads I/O bound or where thetime spent in python is negligible. > But some workloads are really cpu bound and those tend to becomeGIL-bound, even with massive use of C++ helpers; some to the point thatGIL-contention makes up over 80% of running time, instead of 1-5%.And every time our customers upgrade their server, they buy machineswith more cores and the contention problem worsens.

Different interpreters need to operate in their own isolated addressspace, or there will be horrible race conditions.

Regardless of whether that separation is done in software or hardware,
it has to be done.

Whenever data contained in a Python object is passed to C/C++ code,there are two ways to do it. Either pass the whole object, or areference to the underlying data.By passing the underlying data, you can release the GIL, and yourproblem is solved, or at least alleviated.If you can't do that, and must pass the object, then all accesses tothat object must be protected by a per-interpreter lock.That's because interpreters need to operate serially, or you'll gethorrible race conditions.

If you need to share objects across threads, then there will becontention, regardless of how many interpreters there are, or whichprocesses they are in.

Obviously, our use case calls for per-thread separate interpreters:server processes run continuously and already consume gigabytes of RAM,so startup time or increased memory consumption are not issues. Sharedstate also is not needed, actually we try to avoid it as much as possible.
In the end, removing process-global state is extremely interesting for us.

If the additional resource consumption is irrelevant, what's theobjection to spinning up a new processes?


Cheers,
Mark.

P.S.

Do try passing the underlying data, not the whole object, and droppingthe GIL when calling back into C++. It can be effective.CPython already drops the GIL for some computational workloadsimplemented in C, like compression.

_______________________________________________
Python-Dev mailing list -- python-dev@python.org
To unsubscribe send an email to python-dev-le...@python.org
https://mail.python.org/mailman3/lists/python-dev.python.org/
Message archived at 
https://mail.python.org/archives/list/python-dev@python.org/message/6KYRUABTLNYNGNRBS5KRKPHKLKS2AI7U/
Code of Conduct: http://python.org/psf/codeofconduct/

[Python-Dev] Re: My take on multiple interpreters (Was: Should we be making so many changes in pursuit of PEP 554?)

Reply via email to