On 3/18/2010 5:23 AM, Steven D'Aprano wrote:
On Thu, 18 Mar 2010 08:58:25 am Raymond Hettinger wrote:
On Mar 17, 2010, at 1:59 PM, Steven D'Aprano wrote:
On Thu, 18 Mar 2010 07:44:21 am Raymond Hettinger wrote:
The spectrum of options from worst to best is
1) compare but give the wrong answer
2) compare but give the right answer
3) refuse to compare.
Why is 3 the best? If there is a right answer to give, surely
giving the right answer it is better than not?
From the early days of the decimal module,
we've thought that mixed float-decimal operations
are 1) a bit perilous and 2) have few, if any good
use cases.
When it comes to *arithmetic* operations, I agree. Is there anyone on
python-dev willing to argue the case for allowing implicit mixed
float/Decimal arithmetic operations? The arguments in the PEP seem
pretty convincing to me, and I'm not suggesting we change that.
But comparison operations are different. For starters, you don't need to
worry about whether to return a float or a Decimal, because you always
get a bool. In theory, both Decimals and floats are representations of
the same underlying thing, namely real numbers, and it seems strange to
me that I can't ask whether two such real numbers are equal just
because their storage implementation is different.
I can see three reasonable reasons for avoiding mixed comparisons:
(1) To avoid confusing float-naive users (but they're confused by pure
float comparisons too).
(2) To avoid mixed arithmetic operations (but comparisons aren't
arithmetic).
(3) If Decimals and floats compare equal, they must hash equal, and
currently they don't (but Mark Dickinson thinks he has a solution for
that).
Accordingly, any mixed operations should be explicit
rather than implicit:
Decimal('1.1') + Decimal.from_float(2.2)
is better than:
Decimal('1.1') + 2.2
Agreed. The user should explicitly choose whether they want a float
answer or a Decimal answer.
To help the user avoid confusion, we flag the latter with a
TypeError: unsupported operand type(s) for +: 'Decimal' and 'float'.
Unfortunately, in Py2.x, implicit mixed comparisons do not
raise an exception, and instead will silently fail by giving
an incorrect answer:
>>> Decimal('1.1')< 2.2
False
That is clearly the wrong thing to do.
Do you envisage any problems from allowing this instead?
Decimal('1.1')< 2.2
True
Yes.
As any non-naïve float user is aware, the proper form of float
comparisons is not to use < or > or == or !=, but rather, instead of
using < (to follow along with your example), one should use:
Decimal('1.1') - 2.2 < epsilon
However, while even this is only useful in certain circumstances as a
gross simplification [1], it immediately shows the need to do mixed
arithmetic to produce (sometimes) correct results. More correct
comparisons require much more code (even the 20-line C code in [1],
which understands the float format to some extent, admits to being
deficient in some circumstances).
For all the reasons that mixed decimal and float arithmetic is bad,
mixed decimal and float comparisons are also bad. To do proper
comparisons, you need to know the number of significant digits of both
numbers, and the precision and numeric ranges being dealt with by the
application.
For the single purpose of sorting, one could make an argument that not
knowing the significant digits, precision, and numeric ranges, that the
sort would probably produce results where floats and decimals that
should compare equal would be clustered similarly as they would if the
significant digits, precision, and numeric ranges were known, and that
would probably be close to truth, but only if the decimal vs float key
were the last in the composite sort key. I don't think Python informs
its comparison operations that it is being used as part of a sort, nor
would there be a way for user-written sorts to inform the comparison
operations of that fact.
Seems like it would be better to raise an exception, and in the
documentation for the exception point out that turning off the exception
(if it should be decided that that should be possible, which could be
good for compatibility), would regress to the current behavior, which
doesn't sort numerically, but by type.
[1]
http://www.cprogramming.com/tutorial/floating_point/understanding_floating_point_representation.html
Glenn
_______________________________________________
Python-Dev mailing list
Python-Dev@python.org
http://mail.python.org/mailman/listinfo/python-dev
Unsubscribe:
http://mail.python.org/mailman/options/python-dev/archive%40mail-archive.com