http://gcc.gnu.org/bugzilla/show_bug.cgi?id=55303
Oleg Endo <olegendo at gcc dot gnu.org> changed:
What |Removed |Added
----------------------------------------------------------------------------
Status|UNCONFIRMED |ASSIGNED
Last reconfirmed| |2013-03-02
AssignedTo|unassigned at gcc dot |olegendo at gcc dot gnu.org
|gnu.org |
Ever Confirmed|0 |1
--- Comment #1 from Oleg Endo <olegendo at gcc dot gnu.org> 2013-03-02 16:16:41
UTC ---
Created attachment 29567
--> http://gcc.gnu.org/bugzilla/attachment.cgi?id=29567
Working patch with a thinko.
This patch, albeit working, has a thinko.
The idea was to reduce the constraints of the clips/clipu insn comparison
constants by adding/subtracting a constant offset value before/after the actual
clipping insn. For example:
int
test_02 (int a)
{
return max (0, min (255, a));
}
becomes:
_test_02:
movi20 #128,r1
sub r1,r4
mov r4,r0
clips.b r0
rts
add r1,r0
The problem with this is that it won't work for values that will wrap-around
before/after the offset subtraction/addition.
E.g. plugging the value 0x80000000 (−2147483648) into the above case:
movi20 #128,r1
sub r1,r4 // r4 = 0x80000000 - 128 = 0x7FFFFF80
mov r4,r0
clips.b r0 // !(r0 < -128) && (r0 > 127) -> r0 = 127
rts
add r1,r0 // r0 = 127 + 128 = 255
// expected result: 0
Maybe this case could be handled by using subv/addv insns to catch
over/underflows somehow, but probably the resulting code would be more complex
(and thus slower) than two straight forward compare-and-branch sequences.
On the other hand, if it is known that the input value is in a certain range
(e.g. a sign/zero extended HImode or QImode), the offset approach should work
fine.
I will modify the attached patch so that it will allow only the HW clip
constants for now.