Re: RFA: Fix dse / postreload not to bypass add expanders

2011-11-04 Thread Paolo Bonzini

On 11/03/2011 08:01 PM, Eric Botcazou wrote:

+  info.insert_before = insn;
+  info.first = new_insn;
+  info.fixed_regs_live = insn_info-fixed_regs_live;
+  info.failure = false;
+  for (cur = new_insn; cur; cur = NEXT_INSN (cur))
+{
+  info.current = cur;
+  note_stores (PATTERN (cur), note_add_store,info);
+}


Unless I'm missing something, this is going all the way down to the end 
of the function, bypassing the CFG, so it is neither efficient nor correct.


For DSE you should set up backwards liveness simulation, and use that 
instead of note_stores and insn scanning.  I don't know postreload well 
enough, but liveness simulation might work there too.


Paolo


Re: RFA: Fix dse / postreload not to bypass add expanders

2011-11-04 Thread Eric Botcazou
 Unless I'm missing something, this is going all the way down to the end
 of the function, bypassing the CFG, so it is neither efficient nor correct.

No, gen_ functions doesn't emit the instructions.

 For DSE you should set up backwards liveness simulation, and use that
 instead of note_stores and insn scanning.

DSE already does this sort of insn scanning.

-- 
Eric Botcazou


Re: RFA: Fix dse / postreload not to bypass add expanders

2011-11-04 Thread amylaar

Quoting Paolo Bonzini bonz...@gnu.org:


Unless I'm missing something, this is going all the way down to the end
of the function, bypassing the CFG, so it is neither efficient nor
correct.


new_insn hasn't been emitted yet, hence it is a single insn or a short
chain of insn to implement a no-op move (for a post-modify with zero offset)
or, more likely, a two-address add.




Re: RFA: Fix dse / postreload not to bypass add expanders

2011-11-03 Thread Eric Botcazou
 This patch makes emit_inc_dec_insn_before use add3_insn / gen_move_insn
 so that the appropriate expanders are used to create the new instructions,
 and for dse it use the available register liveness information to check
 that no live fixed hard register, like a flags register, is clobbered in
 the process.  For postreload, there is no such information available, so we
 give up when we see a clobber / set that might be problematic.

2011-10-31  Joern Rennecke joern.renne...@embecosm.com

* regset.h (fixed_regset): Declare.
* dse.c: Include regset.h .
(struct insn_info): Add member fixed_regs_live.
(note_add_store_info): New typedef.
(note_add_store): New function.
(emit_inc_dec_insn_before): Expect arg to be of type insn_info_t .
Use gen_add3_insn / gen_move_insn.
Check new insn for unwanted clobbers before emitting it.
(check_for_inc_dec): Rename to...
(check_for_inc_dec_1:) ... this.  Return bool.  Take insn_info
parameter.  Changed all callers in file.
(check_for_inc_dec, copy_fixed_regs): New functions.
(scan_insn): Set fixed_regs_live field of insn_info.
* rtl.h (check_for_inc_dec): Update prototype.
* postreload.c (reload_cse_simplify): Take new signature of
check_ind_dec into account.
* reginfo.c (fixed_regset): New variable.
(init_reg_sets_1): Initialize it.

OK modulo the following:

+typedef struct
+{
+  rtx insert_before;

This field is never read.


+  rtx first, current;
+  regset fixed_regs_live;
+  bool failure;
+} note_add_store_info;
+
+/* Callback for emit_inc_dec_insn_before via note_stores.
+   Check if a register is clobbered which is life afterwards.  */

live


+static void
+note_add_store (rtx loc, const_rtx expr ATTRIBUTE_UNUSED, void *data)

Missing blank line.  The functions in dse.c have a blank line between head 
comment and body.


+{
+  rtx insn, *nextp;
+  note_add_store_info *info = (note_add_store_info *) data;
+  int r, n;
+
+  if (!REG_P (loc))
+return;
+  /* If this register is referenced by the current or an earlier insn,
+ that's OK.  E.g. this applies to the register that is being incremented
+ with this addition.  */

Blank line before the comment.


+  nextp = info-first;
+  do
+{
+  insn = *nextp;
+  nextp = NEXT_INSN (insn);
+  if (reg_referenced_p (loc, PATTERN (insn)))
+   return;
+}
+  while (insn != info-current);

Isn't that a convoluted way of writing this?

  for (insn = info-first;
   insn != NEXT_INSN (info-current);
   insn = NEXT_INSN (insn))
if (reg_referenced_p (loc, PATTERN (insn)))
  return;


+  if (!info-fixed_regs_live)
+{
+  info-failure =  true;
+  return;
+}

Missing comment explaining why we do that.



+  /* Now check if this is a live fixed register.  */
+  r = REGNO (loc);
+  n = HARD_REGNO_NREGS (r, GET_MODE (loc));
+  while (--n =  0)
+if (REGNO_REG_SET_P (info-fixed_regs_live, r+n))
+  info-failure =  true;

Blank line before the comment.  What's the point in the reverse iteration?

  for (i = 0; i  hard_regno_nregs[regno][GET_MODE (loc)]; i++)
if (REGNO_REG_SET_P (info-fixed_regs_live, regno + i))
   {
 info-failure =  true;
 return;
   }

hard_regno_nregs in small letters.


+  info.insert_before = insn;
+  info.first = new_insn;
+  info.fixed_regs_live = insn_info-fixed_regs_live;
+  info.failure = false;
+  for (cur = new_insn; cur; cur = NEXT_INSN (cur))
+{
+  info.current = cur;
+  note_stores (PATTERN (cur), note_add_store, info);
+}
+  if (info.failure)
+return 1;

Missing comment explaining what we're doing.



 /* Before we delete INSN, make sure that the auto inc/dec, if it is
-   there, is split into a separate insn.  */
+   there, is split into a separate insn.
+   Return true on success (or if there was nothing to do), false on failure.  
*/
 
-void
-check_for_inc_dec (rtx insn)
+static bool
+check_for_inc_dec_1 (insn_info_t insn_info)

Missing adjustment in the comment: Before we delete the insn described by
INSN_INFO, make sure...


+/* Entry point for postreload.  */
+bool
+check_for_inc_dec (rtx insn, regset fixed_regs_live)

No point in adding an argument if it is always null.  Missing blank line and 
head comment: Same as above, but take a naked INSN instead.  This is used by 
passes like that don't compute precise liveness information.


+/* Return a bitmap of the fixed registers contained in IN.  */
+static bitmap
+copy_fixed_regs (const_bitmap in)
+{
+  bitmap ret;
+
+  ret = ALLOC_REG_SET (NULL);
+  bitmap_and (ret, in, fixed_regset);
+  return ret;
+}

Missing blank line.


+/* Same information as fixed_reg_set but in regset form.  */
+regset fixed_regset;

Hum, you'd better have a good trick to remember which is which.  This isn't 
pretty, but let's mimic what is just above:

/* Same information as FIXED_REG_SET but in regset form.  */
regset 

RFA: Fix dse / postreload not to bypass add expanders

2011-11-01 Thread Joern Rennecke

This patch makes emit_inc_dec_insn_before use add3_insn / gen_move_insn
so that the appropriate expanders are used to create the new instructions,
and for dse it use the available register liveness information to check
that no live fixed hard register, like a flags register, is clobbered in the
process.  For postreload, there is no such information available, so we
give up when we see a clobber / set that might be problematic.

regtested for epiphany-elf with modified rtx_cost, where it fixes  
three ICE-on-valid-code:
FAIL: gcc.c-torture/execute/builtins/memcpy-chk.c compilation,  -O1   
(internal compiler error)
FAIL: gcc.c-torture/execute/builtins/memmove-chk.c compilation,  -O1   
(internal compiler error)
FAIL: gcc.c-torture/execute/memcpy-bi.c compilation,  -O1  (internal  
compiler error)


Bootstrapped and regression tested on i686-pc-linux-gnu .
2011-10-31  Joern Rennecke joern.renne...@embecosm.com

* regset.h (fixed_regset): Declare.
* dse.c: Include regset.h .
(struct insn_info): Add member fixed_regs_live.
(note_add_store_info): New typedef.
(note_add_store): New function.
(emit_inc_dec_insn_before): Expect arg to be of type insn_info_t .
Use gen_add3_insn / gen_move_insn.
Check new insn for unwanted clobbers before emitting it.
(check_for_inc_dec): Rename to...
(check_for_inc_dec_1:) ... this.  Return bool.  Take insn_info
parameter.  Changed all callers in file.
(check_for_inc_dec, copy_fixed_regs): New functions.
(scan_insn): Set fixed_regs_live field of insn_info.
* rtl.h (check_for_inc_dec): Update prototype.
* postreload.c (reload_cse_simplify): Take new signature of
check_ind_dec into account.
* reginfo.c (fixed_regset): New variable.
(init_reg_sets_1): Initialize it.

Index: postreload.c
===
--- postreload.c(revision 180683)
+++ postreload.c(working copy)
@@ -112,8 +112,8 @@ reload_cse_simplify (rtx insn, rtx testr
  if (REG_P (value)
   ! REG_FUNCTION_VALUE_P (value))
value = 0;
- check_for_inc_dec (insn);
- delete_insn_and_edges (insn);
+ if (check_for_inc_dec (insn, NULL))
+   delete_insn_and_edges (insn);
  return;
}
 
@@ -164,8 +164,8 @@ reload_cse_simplify (rtx insn, rtx testr
 
   if (i  0)
{
- check_for_inc_dec (insn);
- delete_insn_and_edges (insn);
+ if (check_for_inc_dec (insn, NULL))
+   delete_insn_and_edges (insn);
  /* We're done with this insn.  */
  return;
}
Index: regset.h
===
--- regset.h(revision 180683)
+++ regset.h(working copy)
@@ -1,6 +1,6 @@
 /* Define regsets.
Copyright (C) 1987, 1997, 1998, 1999, 2000, 2001, 2002, 2003, 2004,
-   2005, 2006, 2007, 2008, 2009, 2010 Free Software Foundation, Inc.
+   2005, 2006, 2007, 2008, 2009, 2010, 2011 Free Software Foundation, Inc.
 
 This file is part of GCC.
 
@@ -115,6 +115,9 @@ #define EXECUTE_IF_AND_IN_REG_SET(REGSET
 
 extern regset regs_invalidated_by_call_regset;
 
+/* Same information as FIXED_REG_SET but in regset form.  */
+extern regset fixed_regset;
+
 /* An obstack for regsets.  */
 extern bitmap_obstack reg_obstack;
 
Index: dse.c
===
--- dse.c   (revision 180683)
+++ dse.c   (working copy)
@@ -33,6 +33,7 @@ Software Foundation; either version 3, o
 #include tm_p.h
 #include regs.h
 #include hard-reg-set.h
+#include regset.h
 #include flags.h
 #include df.h
 #include cselib.h
@@ -377,6 +378,13 @@ struct insn_info
  created.  */
   read_info_t read_rec;
 
+  /* The live fixed registers.  We assume only fixed registers can
+ cause trouble by being clobbered from an expanded pattern;
+ storing only the live fixed registers (rather than all registers)
+ means less memory needs to be allocated / copied for the individual
+ stores.  */
+  regset fixed_regs_live;
+
   /* The prev insn in the basic block.  */
   struct insn_info * prev_insn;
 
@@ -448,9 +456,9 @@ struct bb_info
   /* The following bitvector is indexed by the reg number.  It
  contains the set of regs that are live at the current instruction
  being processed.  While it contains info for all of the
- registers, only the pseudos are actually examined.  It is used to
- assure that shift sequences that are inserted do not accidently
- clobber live hard regs.  */
+ registers, only the hard registers are actually examined.  It is used
+ to assure that shift and/or add sequences that are inserted do not
+ accidently clobber live hard regs.  */
   bitmap regs_live;
 };
 
@@ -827,6 +835,51 @@ free_store_info (insn_info_t insn_info)
   insn_info-store_rec = NULL;
 }
 
+typedef struct
+{
+  rtx