Re: [PHP-DEV] Native decimal scalar support and object types in BcMath - do we want both?

Rowan Tommins [IMSoP] Mon, 08 Apr 2024 12:23:10 -0700

On 07/04/2024 23:50, Jordan LeDoux wrote:

By a "scalar" value I mean a value that has the same semantics forreading, writing, copying, passing-by-value, passing-by-reference, andpassing-by-pointer (how objects behave) as the integer, float, orboolean types.

Right, in that case, it might be more accurate to talk about "valuetypes", since arrays are not generally considered "scalar", but havethose same behaviours. And Ilija recently posted a draft proposal for"data classes", which would be object, but also value types:https://externals.io/message/122845

As I mentioned in the discussion about a "scalar arbitrary precisiontype", the idea of a scalar in this meaning is a non-trivialchallenge, as the zval can only store a value that is treated in thisway of 64 bits or smaller.

Fortunately, that's not true. If you think about it, that would rule outnot only arrays, but any string longer than 8 bytes long!

The way PHP handles this is called "copy-on-write" (COW), where multiplevariables can point to the same zval until one of them needs to write toit, at which point a copy is transparently created.

The pointer for this value would fit in the 64 bits, which is howobjects work, but that's also why objects have different semantics forscope than integers. Objects are potentially very large in memory, sowe refcount them and pass the pointer into child scopes, instead ofcopying the value like is done with integers.

Objects are not the only thing that is refcounted. In fact, in PHP 4.xand 5.x, *every* zval used a refcount and COW approach; changing sometypes to be eagerly copied instead was one of the major performanceimprovements in the "PHP NG" project which formed the basis of PHP 7.0.You can actually see this in action here: https://3v4l.org/oPgr4

This is all completely transparent to the user, as are a bunch of othermemory/speed optimisations, like interned string literals, packedarrays, etc.

So, there may be performance gains if we can squeeze values into thezval memory, but it doesn't need to affect the semantics of the new type.

In general I would say that libbcmath is different enough from otherbackends that we should not expect any work on a BCMath implementationto be utilized in other implementations. It *could* be that we areable to do that, but it should not be something people *expect* tohappen because of the technical differences.
Some of the broader language design choices would be transferablethough. For instance, the standard names of various calculationfunctions/methods are something that would remain independent, evenwith the differences in the implementation.

Yes, that makes sense. Even if we don't have an interface, it would beannoying if one class provided $foo->div($bar), and another provided$foo->dividedBy($bar)

For money calculations, scale is always likely to be a more usefulconfiguration. For mathematical calculations (such as machine learningapplications, which I would say is the other very large use case forthis kind of capability), precision is likely to be the more usefulconfiguration. Other applications that I have personally encounteredinclude: simulation and modeling, statistical distributions, and dataanalysis. Most of these can be done with fair accuracy withoutarbitrary precision, but there are certainly types of applicationsthat would benefit from or even require arbitrary precision in thesespaces.

This probably relates quite closely to Arvid's point that for a lot ofuses, we don't actually need arbitrary precision, just something thatcan represent small-to-medium decimal numbers without the inaccuraciesof binary floating point. That some libraries can be used for bothpurposes is not necessarily evidence that we could ever "bless" one forboth use cases and make it a single native type.

My intuition at the moment is that a single number-handling API wouldbe challenging to do without an actual proposed implementation on thetable for MPDec/MPFR.

I think it would certainly be wise to experiment with how each librarycan interface to the language as an extension, before spending the extratime needed to integrate it as a new zval type.

But even with these extensions available in PHP, they are barely usedby developers at all because (at least in part) of the enormousdifference between PECL and PIP. For PHP, I do not think thatextensions are an adequate substitute like PIP modules are for Python.

Yes, this is something of a problem. On the plus side, a library doesn'tneed to be incorporated into the language to be widely installed,because we have the concept of "bundled" extensions; and in practice,Linux distributions add a few "popular" PECL extensions to their list ofinstallable binary packages. On the minus side, even making it into the"bundled" list doesn't mean it's installed by default everywhere, anduserland libraries spend a lot of effort polyfilling things which wouldideally be available by default.

This is, essentially, the thesis of the research and work that I havedone in the space since joining the internals mailing list.



Thanks, there's some really useful perspective there.

Regards,

--
Rowan Tommins
[IMSoP]

Re: [PHP-DEV] Native decimal scalar support and object types in BcMath - do we want both?

Reply via email to