On 07/26/2018 12:06 AM, 瞿仙淼 wrote:
> 
>> 在 2018年7月25日,上午5:24,Jeff Law <l...@redhat.com> 写道:
>>
>> On 07/24/2018 12:18 PM, Sandra Loosemore wrote:
>>> On 07/24/2018 09:45 AM, Jeff Law wrote:
>>>> On 07/23/2018 10:21 PM, Sandra Loosemore wrote:
>>>> I'm not a big fan of more awk code, but I'm not going to object to it :-)
>>>>
>>>> Why does the port have its own little pass for condition code
>>>> optimization (cse_cc)?  What is it doing that can't be done with our
>>>> generic optimizers?
>>>
>>> This pass was included in the initial patch set we got from C-SKY, and
>>> as it didn't seem to break anything I left it in.  Perhaps C-SKY can
>>> provide a testcase that demonstrates why it's still useful in the
>>> current version of GCC; otherwise we can remove this from the initial
>>> port submission and restore it later if some performance analysis shows
>>> it is still worthwhile.
>> FWIW it looks like we model CC setting on just a few insns, (add,
>> subtract) so I'd be surprised if this little mini pass found much.  I'd
>> definitely like to hear from the csky authors here.
>>
>> Alternately, you could do add some instrumentation to flag when it
>> triggers, take a test or two that does, reduce it and we can then look
>> at the key RTL sequences and see what the pass is really doing.
>>
> 
> I wrote a case to reproduce this problem on C-SKY. C code is as follows:
> -----------------------------------------------------------------------
> int e1, e2;
> 
> void func (int a, int b, int c, int d, int f, int g)
> {
>   e1 = a > b ? f : g;
>   e2 = a > b ? c : d;
> 
>   return;
> }
> -----------------------------------------------------------------------
> 
> compile to assembler with option “-O3 -S” :
> -----------------------------------------------------------------------
> func:
>   cmplt a1, a0
>   ld.w  t1, (sp, 0)
>   ld.w  t0, (sp, 4)
>   movt  t0, t1
>   cmplt a1, a0
>   movt  a3, a2
>   lrw a1, e2
>   lrw a2, e1
>   st.w  a3, (a1, 0)
>   st.w  t0, (a2, 0)
>   rts
> -----------------------------------------------------------------------
> There is an extra “cmplt a1, a0" in the above code without cse_cc. This 
> situation mainly occurs when a relatively short branch jump is converted into 
> a conditional execution instruction. And the CSE pass can not reduce the same 
> conditional comparison instruction . Here is the rtx sequence after “cse2” 
> pass.
> 
> -----------------------------------------------------------------------
> (insn 28 13 29 2 (set (reg:CC 33 c)
>         (gt:CC (reg/v:SI 77 [ a ])
>             (reg/v:SI 78 [ b ]))) func.c:5 1099 {*cmpgtsi}
>      (nil))
> (insn 29 28 30 2 (set (reg/v:SI 82 [ g ])
>         (if_then_else:SI (eq (reg:CC 33 c)
>                 (const_int 0 [0]))
>             (reg/v:SI 82 [ g ])
>             (reg/v:SI 81 [ f ]))) func.c:5 983 {movf}
>      (expr_list:REG_DEAD (reg/v:SI 81 [ f ])
>         (expr_list:REG_DEAD (reg:CC 33 c)
>             (nil))))
> (insn 30 29 31 2 (set (reg:CC 33 c)
>         (gt:CC (reg/v:SI 77 [ a ])
>             (reg/v:SI 78 [ b ]))) func.c:5 1099 {*cmpgtsi}
>      (expr_list:REG_DEAD (reg/v:SI 78 [ b ])
>         (expr_list:REG_DEAD (reg/v:SI 77 [ a ])
>             (nil))))
> (insn 31 30 18 2 (set (reg/v:SI 80 [ d ])
>         (if_then_else:SI (eq (reg:CC 33 c)
>                 (const_int 0 [0]))
>             (reg/v:SI 80 [ d ])
>             (reg/v:SI 79 [ c ]))) func.c:5 983 {movf}
>      (expr_list:REG_DEAD (reg/v:SI 79 [ c ])
>         (expr_list:REG_DEAD (reg:CC 33 c)
>             (nil))))
> -----------------------------------------------------------------------
> 
> It doesn't seem to check the same conditional comparison instruction .So I 
> wrote this to solve this problem, but I am not sure if this is the best way : 
> )
> 
> PS, the same conditional comparison instruction cannot be reduced with the 
> latest version gcc with C-SKY because I just insert the “cse_cc” after 
> “cse1”, when I insert after “cse2”, this problem can be solved very well.
I think the cse_cc pass is really just working around one or more bugs
in CSE and/or a backend bug.  The RTL above clearly shows a common
subexpression that is not eliminated by CSE.

What CSE should be trying to do is changing the second and third
occurrences of (gt:CC (reg 77) (reg 78)) with (reg 33) which would
create nop-sets which would be subsequently deleted.  I suspect you do
not have an insn which matches that nop set of the CC register.  If you
fix that I suspect CSE will work better and eliminate the need for your
cse_cc pass.

jeff

Reply via email to