Re: [Python-Dev] Decimal <-> float comparisons in py3k.

Glenn Linderman Thu, 18 Mar 2010 11:35:38 -0700

On 3/18/2010 5:23 AM, Steven D'Aprano wrote:

On Thu, 18 Mar 2010 08:58:25 am Raymond Hettinger wrote:

On Mar 17, 2010, at 1:59 PM, Steven D'Aprano wrote:

On Thu, 18 Mar 2010 07:44:21 am Raymond Hettinger wrote:

The spectrum of options from worst to best is
1) compare but give the wrong answer
2) compare but give the right answer
3) refuse to compare.

Why is 3 the best? If there is a right answer to give, surely
giving the right answer it is better than not?

 From the early days of the decimal module,
we've thought that mixed float-decimal operations
are 1) a bit perilous and 2) have few, if any good
use cases.

When it comes to *arithmetic* operations, I agree. Is there anyone on
python-dev willing to argue the case for allowing implicit mixed
float/Decimal arithmetic operations? The arguments in the PEP seem
pretty convincing to me, and I'm not suggesting we change that.


But comparison operations are different. For starters, you don't need to
worry about whether to return a float or a Decimal, because you always
get a bool. In theory, both Decimals and floats are representations of
the same underlying thing, namely real numbers, and it seems strange to
me that I can't ask whether two such real numbers are equal just
because their storage implementation is different.

I can see three reasonable reasons for avoiding mixed comparisons:

(1) To avoid confusing float-naive users (but they're confused by pure
float comparisons too).

(2) To avoid mixed arithmetic operations (but comparisons aren't
arithmetic).

(3) If Decimals and floats compare equal, they must hash equal, and
currently they don't (but Mark Dickinson thinks he has a solution for
that).

Accordingly, any mixed operations should be explicit
rather than implicit:

      Decimal('1.1') + Decimal.from_float(2.2)

is better than:

      Decimal('1.1') + 2.2

Agreed. The user should explicitly choose whether they want a float
answer or a Decimal answer.

To help the user avoid confusion, we flag the latter with a
TypeError: unsupported operand type(s) for +: 'Decimal' and 'float'.

Unfortunately, in Py2.x, implicit mixed comparisons do not
raise an exception, and instead will silently fail by giving

an incorrect answer:
     >>>  Decimal('1.1')<  2.2
     False

That is clearly the wrong thing to do.

Do you envisage any problems from allowing this instead?

Decimal('1.1')<  2.2

True


Yes.

As any non-naïve float user is aware, the proper form of floatcomparisons is not to use < or > or == or !=, but rather, instead ofusing < (to follow along with your example), one should use:


Decimal('1.1') - 2.2 < epsilon

However, while even this is only useful in certain circumstances as agross simplification [1], it immediately shows the need to do mixedarithmetic to produce (sometimes) correct results. More correctcomparisons require much more code (even the 20-line C code in [1],which understands the float format to some extent, admits to beingdeficient in some circumstances).

For all the reasons that mixed decimal and float arithmetic is bad,mixed decimal and float comparisons are also bad. To do propercomparisons, you need to know the number of significant digits of bothnumbers, and the precision and numeric ranges being dealt with by theapplication.

For the single purpose of sorting, one could make an argument that notknowing the significant digits, precision, and numeric ranges, that thesort would probably produce results where floats and decimals thatshould compare equal would be clustered similarly as they would if thesignificant digits, precision, and numeric ranges were known, and thatwould probably be close to truth, but only if the decimal vs float keywere the last in the composite sort key. I don't think Python informsits comparison operations that it is being used as part of a sort, norwould there be a way for user-written sorts to inform the comparisonoperations of that fact.

Seems like it would be better to raise an exception, and in thedocumentation for the exception point out that turning off the exception(if it should be decided that that should be possible, which could begood for compatibility), would regress to the current behavior, whichdoesn't sort numerically, but by type.


[1]
http://www.cprogramming.com/tutorial/floating_point/understanding_floating_point_representation.html


Glenn
_______________________________________________
Python-Dev mailing list
Python-Dev@python.org
http://mail.python.org/mailman/listinfo/python-dev
Unsubscribe: 
http://mail.python.org/mailman/options/python-dev/archive%40mail-archive.com

Re: [Python-Dev] Decimal <-> float comparisons in py3k.

Reply via email to