from:"dorit at il dot ibm dot com"

[Bug tree-optimization/33869] [4.3 Regression] ICE verify_ssa failed (missing definition for SSA_NAME)

2007-10-29 Thread dorit at il dot ibm dot com

--- Comment #11 from dorit at il dot ibm dot com 2007-10-30 05:48 --- (In reply to comment #6) Richard, is this related to the issue you reported in http://gcc.gnu.org/ml/gcc-patches/2007-10/msg01127.html (looks like the same error)? Any idea why the fix you committed doesn't cover

[Bug tree-optimization/25371] -ftree-vectorize results in internal compiler error on AMD64

2007-07-01 Thread dorit at il dot ibm dot com

--- Comment #12 from dorit at il dot ibm dot com 2007-07-01 09:30 --- Subject: Re: -ftree-vectorize results in internal compiler error on AMD64 Zdenek's patch for cleaning the dataref analysis is also fixing this bug. http://gcc.gnu.org/ml/gcc-patches/2007-05/msg00634.html So now

[Bug tree-optimization/24659] Conversions are not vectorized

2007-06-29 Thread dorit at il dot ibm dot com

--- Comment #19 from dorit at il dot ibm dot com 2007-06-29 16:46 --- testing this patch for Altivec: Index: config/rs6000/altivec.md === *** config/rs6000/altivec.md(revision 126053) --- config/rs6000/altivec.md

[Bug rtl-optimization/32084] gfortran 4.3 13%-18% slower for induct.f90 than gcc 4.0-based competitor

2007-06-27 Thread dorit at il dot ibm dot com

--- Comment #5 from dorit at il dot ibm dot com 2007-06-27 11:57 --- (In reply to comment #4) (In reply to comment #3) The problem is in -ftree-vectorize The difference is, that without -ftree-vectorize the inner loop (do k = 1, 9) is completely unrolled, but with vectorization

[Bug tree-optimization/32075] can't determine dependence between p-a[x+i] and p-a[x+i+1] where x is invariant but defined in the function

2007-06-18 Thread dorit at il dot ibm dot com

--- Comment #2 from dorit at il dot ibm dot com 2007-06-18 11:03 --- I see this in the vectorizer dump file (with mainline from a few days ago): (compute_affine_dependence (stmt_a = D.3027_19 = p_7-a[D.3026_18]) (stmt_b = p_7-a[D.3025_17] = D.3027_19) Data ref a: (Data Ref: stmt

[Bug tree-optimization/32378] can't determine dependence (distinct sections of an array)

2007-06-18 Thread dorit at il dot ibm dot com

--- Comment #4 from dorit at il dot ibm dot com 2007-06-18 11:08 --- I see this in the vectorizer dump file (with mainline from a few days ago): (compute_affine_dependence (stmt_a = D.1423_50 = (*a_49(D))[D.1422_48]) (stmt_b = (*a_49(D))[D.1420_51] = D.1425_54) Data ref a: (Data

[Bug target/32274] FAIL: gcc.dg/vect/pr32224.c

2007-06-13 Thread dorit at il dot ibm dot com

--- Comment #1 from dorit at il dot ibm dot com 2007-06-13 08:41 --- Sorry about the breakage. Does it work for you if you change the testcase as follows?: Index: pr32224.c === --- pr32224.c (revision 125641) +++ pr32224

[Bug tree-optimization/32309] Unnecessary conversion from short to unsigend short breaks vectorization

2007-06-12 Thread dorit at il dot ibm dot com

--- Comment #4 from dorit at il dot ibm dot com 2007-06-12 17:46 --- it's on my (long) todo list... -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=32309

[Bug tree-optimization/32224] [4.3 Regression] ICE in vect_analyze_operations, at tree-vect-analyze.c:374

2007-06-07 Thread dorit at il dot ibm dot com

--- Comment #4 from dorit at il dot ibm dot com 2007-06-07 18:40 --- You're right. I'm testing this obvious patch: Index: tree-vect-analyze.c === *** tree-vect-analyze.c (revision 125526) --- tree-vect-analyze.c (working

[Bug tree-optimization/32216] [4.3 Regression] ICE: verify_stmts failed (invalid reference prefix) with -ftree-vectorize

2007-06-06 Thread dorit at il dot ibm dot com

--- Comment #5 from dorit at il dot ibm dot com 2007-06-06 08:33 --- (In reply to comment #4) (In reply to comment #3) Probably something similar is required for the VEC_UNPACK_FLOAT_*_EXPR tree-codes ? But these tree-codes are already there: sorry, I guess I was looking

[Bug tree-optimization/32216] [4.3 Regression] ICE: verify_stmts failed (invalid reference prefix) with -ftree-vectorize

2007-06-05 Thread dorit at il dot ibm dot com

--- Comment #3 from dorit at il dot ibm dot com 2007-06-06 03:28 --- (In reply to comment #1) veclower expands things when it wrongly concludes that they are not supported by the target in vecor mode. For demotion/promotion/conversion kinda operations this may be because it does

[Bug target/32107] New: bad codegen for vector initialization in Altivec

2007-05-27 Thread dorit at il dot ibm dot com

Severity: normal Priority: P3 Component: target AssignedTo: unassigned at gcc dot gnu dot org ReportedBy: dorit at il dot ibm dot com GCC build triplet: powerpc-linux GCC host triplet: powerpc-linux GCC target triplet: powerpc-linux http://gcc.gnu.org

[Bug tree-optimization/31945] New: missing type vector conversions patterns on spu

2007-05-16 Thread dorit at il dot ibm dot com

: unassigned at gcc dot gnu dot org ReportedBy: dorit at il dot ibm dot com GCC build triplet: spu GCC host triplet: spu GCC target triplet: spu http://gcc.gnu.org/bugzilla/show_bug.cgi?id=31945

[Bug tree-optimization/31946] New: missed vectorization due to too strict peeling-for-alignment policy

2007-05-16 Thread dorit at il dot ibm dot com

: tree-optimization AssignedTo: unassigned at gcc dot gnu dot org ReportedBy: dorit at il dot ibm dot com http://gcc.gnu.org/bugzilla/show_bug.cgi?id=31946

[Bug middle-end/31738] Fortran dot product vectorization is restricted

2007-05-16 Thread dorit at il dot ibm dot com

--- Comment #3 from dorit at il dot ibm dot com 2007-05-16 20:45 --- (In reply to comment #2) Here is what happens in the three loops that don't get vectorized: (1) the loop in testvectdp2: ... so the vectorizer is ok, except that in this case D.1437_32 doesn't seem to be used

[Bug tree-optimization/31873] New: missed optimization: we don't move invariant casts out of loops

2007-05-09 Thread dorit at il dot ibm dot com

casts out of loops Product: gcc Version: 4.3.0 Status: UNCONFIRMED Severity: normal Priority: P3 Component: tree-optimization AssignedTo: unassigned at gcc dot gnu dot org ReportedBy: dorit at il dot ibm

[Bug tree-optimization/25809] missed PRE optimization - move invariant casts out of loops

2007-05-09 Thread dorit at il dot ibm dot com

--- Comment #8 from dorit at il dot ibm dot com 2007-05-09 07:14 --- So I guess this should be handled somewhere else. I'll open a new missed-optimization PR instead (not against PRE this time). thanks. This is now PR31873 -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=25809

[Bug middle-end/31738] Fortran dot product vectorization is restricted

2007-05-08 Thread dorit at il dot ibm dot com

--- Comment #2 from dorit at il dot ibm dot com 2007-05-08 21:00 --- Here is what happens in the three loops that don't get vectorized: (1) the loop in testvectdp2: This is the loop we analyze: # prephitmp.192_37 = PHI storetmp.191_30(3), D.1443_42(5) # i_1 = PHI 1(3), i_44(5

[Bug middle-end/31699] [4.3 Regression] -march=opteron -ftree-vectorize generates wrong code

2007-05-02 Thread dorit at il dot ibm dot com

--- Comment #6 from dorit at il dot ibm dot com 2007-05-02 20:38 --- patch: http://gcc.gnu.org/ml/gcc-patches/2007-05/msg00111.html -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=31699

[Bug middle-end/31699] [Regression 4.3] -march=opteron -ftree-vectorize generates wrong code

2007-04-26 Thread dorit at il dot ibm dot com

--- Comment #3 from dorit at il dot ibm dot com 2007-04-26 19:34 --- Created an attachment (id=13450) -- (http://gcc.gnu.org/bugzilla/attachment.cgi?id=13450action=view) patch -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=31699

[Bug middle-end/31699] [Regression 4.3] -march=opteron -ftree-vectorize generates wrong code

2007-04-26 Thread dorit at il dot ibm dot com

--- Comment #4 from dorit at il dot ibm dot com 2007-04-26 19:37 --- I'm testing the attched patch. The problem is that we don't compute the peel factor correctly (when peeling to align a store) when we have multiple data-types in the loop (the computation assumes that VF is the number

[Bug testsuite/31589] gcc.dg/vect failures due to missing target specifiers

2007-04-26 Thread dorit at il dot ibm dot com

--- Comment #4 from dorit at il dot ibm dot com 2007-04-27 05:44 --- patch: http://gcc.gnu.org/ml/gcc-patches/2007-04/msg01739.html requires retesting on ia64 before I can commit it. -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=31589

[Bug fortran/31615] testsuite failure in gfortran.dg/vect/vect-5.f90

2007-04-25 Thread dorit at il dot ibm dot com

--- Comment #7 from dorit at il dot ibm dot com 2007-04-25 21:30 --- Are you going to submit/install your patch? yes, I'll go ahead and submit the patch -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=31615

[Bug fortran/31615] testsuite failure in gfortran.dg/vect/vect-5.f90

2007-04-19 Thread dorit at il dot ibm dot com

--- Comment #5 from dorit at il dot ibm dot com 2007-04-19 07:27 --- (In reply to comment #4) (In reply to comment #3) But then I wonder why we don't see the same failure on ia64? Because the failing part of the testcase is only done on ilp32 targets: ! { dg-final { scan-tree-dump

[Bug fortran/31615] testsuite failure in gfortran.dg/vect/vect-5.f90

2007-04-18 Thread dorit at il dot ibm dot com

--- Comment #3 from dorit at il dot ibm dot com 2007-04-18 10:18 --- Created dump file using -fdump-tree-vect-details thanks. So I don't understand why we expect to version for 3 different data-references, since there are only 2 in the loop that is vectorized. But then I wonder why we

[Bug tree-optimization/25809] missed PRE optimization - move invariant casts out of loops

2007-04-17 Thread dorit at il dot ibm dot com

--- Comment #5 from dorit at il dot ibm dot com 2007-04-17 07:22 --- Doing cast motion actually causes about 25 *more* failures in the vectorizer testsuite. I'm closing this as won't fix since it seems there was no other reason to do this. can you please send me the patch so that I

[Bug tree-optimization/25809] missed PRE optimization - move invariant casts out of loops

2007-04-17 Thread dorit at il dot ibm dot com

--- Comment #6 from dorit at il dot ibm dot com 2007-04-17 07:38 --- can you please send me the patch so that I could look at this failures before you close this PR? I'm going over my inbox top down, so I just saw that you had laready sent the patch... so I will look into it. (thanks

[Bug tree-optimization/25809] missed PRE optimization - move invariant casts out of loops

2007-04-17 Thread dorit at il dot ibm dot com

--- Comment #7 from dorit at il dot ibm dot com 2007-04-17 19:31 --- so I will look into it. (for reference: http://gcc.gnu.org/ml/gcc-patches/2007-04/msg01103.html). So I guess this should be handled somewhere else. I'll open a new missed-optimization PR instead (not against PRE

[Bug testsuite/31589] gcc.dg/vect failures due to missing target specifiers

2007-04-17 Thread dorit at il dot ibm dot com

--- Comment #2 from dorit at il dot ibm dot com 2007-04-17 20:10 --- 2 more are under investigation: no-section-anchors-vect-69.c vect-reduc-dot-u16a.c In the first testcase, the vectorizer can only prove that the data reference in the third loop is aligned on 8 bytes

[Bug fortran/31615] testsuite failure in gfortran.dg/vect/vect-5.f90

2007-04-17 Thread dorit at il dot ibm dot com

--- Comment #1 from dorit at il dot ibm dot com 2007-04-18 06:42 --- could you please provide the .vect dump file, as generated with -fdump-tree-vect-details? -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=31615

[Bug fortran/31561] FAIL: gfortran.dg/vect/vect-4.f90

2007-04-14 Thread dorit at il dot ibm dot com

--- Comment #4 from dorit at il dot ibm dot com 2007-04-14 09:38 --- I think the only thing that really matters is that the loop is vectorized. I don't think the alignment details are important checking, even on platforms where they are relevant. So we should remove all scan-tree

[Bug tree-optimization/31460] if(a) a[i] = xxx; else a[i] = yyy; is not converted to if (a) ddd= xxx; else ddd = yyy; a[i] = ddd;

2007-04-03 Thread dorit at il dot ibm dot com

--- Comment #4 from dorit at il dot ibm dot com 2007-04-03 19:56 --- yes, this is indeed a known problem (I don't know if there's a PR open for it). It is one of the tree-ifcvt enhancements that Victor was going to tackle for 4.3 (item (2.3) in http://gcc.gnu.org/wiki

[Bug target/25413] wrong alignment or incorrect address computation in vectorized code on Pentium 4 SSE

2007-04-03 Thread dorit at il dot ibm dot com

--- Comment #6 from dorit at il dot ibm dot com 2007-04-03 20:22 --- So I see Devang had sent a patch for this over a year ago: http://gcc.gnu.org/ml/gcc-patches/2006-03/msg00167.html I don't know what ever happened to it. Maybe you want to give it a try? (you may need to implement

[Bug target/31334] Bad codegen for vector initializer with constants prop'd into a vector initializer

2007-04-03 Thread dorit at il dot ibm dot com

--- Comment #8 from dorit at il dot ibm dot com 2007-04-03 20:46 --- (In reply to comment #7) Something like: (define_insn_and_split altivec_dupmode [(set (match_operand:V 0 register_operand v) (vec_duplicate: (match_operand: 0 r))) (clobber (match_operand:V 3

[Bug target/30784] [4.3 regression] ICE on loop vectorization (-O1 -march=athlon-xp -ftree-vectorize)

2007-03-24 Thread dorit at il dot ibm dot com

--- Comment #7 from dorit at il dot ibm dot com 2007-03-24 08:00 --- patch: http://gcc.gnu.org/ml/gcc/2007-03/msg00918.html -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=30784

[Bug tree-optimization/31333] New: ICE with -fno-tree-dominator-opts -ftree-vectorize -msse

2007-03-24 Thread dorit at il dot ibm dot com

: P3 Component: tree-optimization AssignedTo: unassigned at gcc dot gnu dot org ReportedBy: dorit at il dot ibm dot com GCC build triplet: i386-linux GCC host triplet: i386-linux GCC target triplet: i386-linux http://gcc.gnu.org/bugzilla/show_bug.cgi?id=31333

[Bug target/31334] New: Bad codegen for vectorized induction with altivec

2007-03-24 Thread dorit at il dot ibm dot com

codegen for vectorized induction with altivec Product: gcc Version: 4.3.0 Status: UNCONFIRMED Severity: normal Priority: P3 Component: target AssignedTo: unassigned at gcc dot gnu dot org ReportedBy: dorit at il dot ibm dot

[Bug target/30784] [4.3 regression] ICE on loop vectorization (-O1 -march=athlon-xp -ftree-vectorize)

2007-03-14 Thread dorit at il dot ibm dot com

--- Comment #4 from dorit at il dot ibm dot com 2007-03-14 12:13 --- I also saw this on powerpc64, on a different testcase (vectorizing longs with -m64). seems like constant propagation during dom3 propagates the vector initializer into a BIT_FIELD_EXPR, which results in invalid gimple

[Bug target/30784] [4.3 regression] ICE on loop vectorization (-O1 -march=athlon-xp -ftree-vectorize)

2007-03-14 Thread dorit at il dot ibm dot com

--- Comment #5 from dorit at il dot ibm dot com 2007-03-14 12:29 --- this is the testcase I have ICE-ing on powerpc64-yellowdog, when compiled with -ftree-vectorize -maltivec -m64 -O2: long stack_vars_sorted[32]; void partition_stack_vars (long stack_vars_num) { long si, n

[Bug tree-optimization/31041] [4.3 Regression] verify_stmts failed: invalid operand to binary operator with -O2 -ftree-vectorize

2007-03-05 Thread dorit at il dot ibm dot com

--- Comment #5 from dorit at il dot ibm dot com 2007-03-05 20:15 --- I'm travelling now, but can prepare a fix when I'm back (next week). -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=31041

[Bug tree-optimization/30858] [4.3 Regression] ice for legal code with -O2 -ftree-vectorize

2007-02-21 Thread dorit at il dot ibm dot com

--- Comment #8 from dorit at il dot ibm dot com 2007-02-21 19:31 --- Is this acceptable ? sure, thanks -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=30858

[Bug tree-optimization/30858] [4.3 Regression] ice for legal code with -O2 -ftree-vectorize

2007-02-20 Thread dorit at il dot ibm dot com

--- Comment #6 from dorit at il dot ibm dot com 2007-02-20 22:56 --- proposed patches - http://gcc.gnu.org/ml/gcc-patches/2007-02/msg01734.html I have thrown most of Suse Linux 10.3 at it and it has crashed in a few places. would you mind giving these patches a try? (to see what's

[Bug c/30843] ice for legal code with -ftree-vectorize -O2

2007-02-19 Thread dorit at il dot ibm dot com

--- Comment #3 from dorit at il dot ibm dot com 2007-02-19 08:28 --- Looks like possibly some bad interaction between vectorization of induction and vectorization of strided-access. Will investigate. I looked into it with Ira, and looks like the problem is that during

[Bug c/30858] ice for legal code with -O2 -ftree-vectorize

2007-02-19 Thread dorit at il dot ibm dot com

--- Comment #2 from dorit at il dot ibm dot com 2007-02-19 12:45 --- Reduced testcase: int foo (int ko) { int j,i; for (j = 0; j ko; j++) i += (i 10) ? -5 : 7; return i; } Looking into it... -- dorit at il dot ibm dot com changed: What|Removed

[Bug c/30858] ice for legal code with -O2 -ftree-vectorize

2007-02-19 Thread dorit at il dot ibm dot com

--- Comment #3 from dorit at il dot ibm dot com 2007-02-19 12:56 --- (In reply to comment #0) Thanks for exercising the vectorizer and reporting these bugs! On the wider issue of the quality of the vectorizer, I have thrown most of Suse Linux 10.3 at it and it has crashed in a few

[Bug tree-optimization/30858] [4.3 Regression] ice for legal code with -O2 -ftree-vectorize

2007-02-19 Thread dorit at il dot ibm dot com

--- Comment #5 from dorit at il dot ibm dot com 2007-02-19 14:12 --- Looks like I wasn't careful enough with my fix for PR30771. Here is a fix for that fix I'm now testing: Index: tree-vect-analyze.c === --- tree-vect

[Bug tree-optimization/30795] [4.3 Regression] ice for legal code with -ftree-vectorize -O2

2007-02-18 Thread dorit at il dot ibm dot com

--- Comment #4 from dorit at il dot ibm dot com 2007-02-18 16:42 --- patch: http://gcc.gnu.org/ml/gcc-patches/2007-02/msg01555.html -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=30795

[Bug c/30843] ice for legal code with -ftree-vectorize -O2

2007-02-18 Thread dorit at il dot ibm dot com

--- Comment #2 from dorit at il dot ibm dot com 2007-02-18 21:50 --- I was able to reproduce it. Here's a reduced testcase: void dacP98FillRGBMap( unsigned char *pBuffer ) { unsigned long dw, dw1; unsigned long *pdw = (unsigned long *)(pBuffer); for( dw = 256, dw1 = 0; dw

[Bug tree-optimization/30795] [4.3 Regression] ice for legal code with -ftree-vectorize -O2

2007-02-15 Thread dorit at il dot ibm dot com

--- Comment #3 from dorit at il dot ibm dot com 2007-02-15 10:21 --- I'll look into it. -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=30795

[Bug c++/30771] ice for legal code with -O2 -ftree-vectorize

2007-02-12 Thread dorit at il dot ibm dot com

--- Comment #3 from dorit at il dot ibm dot com 2007-02-12 10:11 --- I'll look into it. -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=30771

[Bug tree-optimization/30771] ice for legal code with -O2 -ftree-vectorize

2007-02-12 Thread dorit at il dot ibm dot com

--- Comment #4 from dorit at il dot ibm dot com 2007-02-12 14:23 --- I'm testing the patch below. (wasn;t able to reproduce the problem in the attched testcase, but here's a reduced testcase for the problem that Richi described - thanks!: int a[128]; int main() { short i; for (i=0

[Bug tree-optimization/29145] unsafe use of restrict qualifier

2007-02-06 Thread dorit at il dot ibm dot com

--- Comment #11 from dorit at il dot ibm dot com 2007-02-06 08:18 --- (In reply to comment #10) One thing I can think of that this description misses is that the two pointers must be based-on *different* restrict-qualified pointers, unless that case is already handled elsewhere

[Bug tree-optimization/29145] unsafe use of restrict qualifier

2007-01-07 Thread dorit at il dot ibm dot com

--- Comment #8 from dorit at il dot ibm dot com 2007-01-07 20:22 --- I'm testing this patch, that makes us more conservative, and concludes that two pointers don't overlap only if both are based on restricted pointers, with based on trivially implemented as the pointer used

[Bug tree-optimization/30038] Call to sin(x), cos(x) should be transformed to sincos(x)

2006-12-12 Thread dorit at il dot ibm dot com

--- Comment #16 from dorit at il dot ibm dot com 2006-12-12 20:59 --- (In reply to comment #13) Looks like what's blocking vectorization of the loop is: sinc.f90:8: note: value used after loop. sinc.f90:8: note: not vectorized: relevant stmt not supported: D.1408_32 = (*radius_31)[D

[Bug tree-optimization/30038] Call to sin(x), cos(x) should be transformed to sincos(x)

2006-12-07 Thread dorit at il dot ibm dot com

--- Comment #11 from dorit at il dot ibm dot com 2006-12-07 20:19 --- (In reply to comment #10) Using the three patches: ... gfortran is able to use sincos - and does so for my example (comment #0; the example, however, cannot be vectorized). why? (what does -fdump-tree-vect

[Bug fortran/29779] [4.3 Regression] vectorizer fortran testcases failing

2006-12-06 Thread dorit at il dot ibm dot com

--- Comment #12 from dorit at il dot ibm dot com 2006-12-06 22:22 --- By the way, you wrote 2006-11-17: Should be submitted this weekend Any new ETA? It was already submitted: http://gcc.gnu.org/ml/gcc-patches/2006-12/msg00110.html -- http://gcc.gnu.org/bugzilla/show_bug.cgi

[Bug fortran/29779] [4.3 Regression] vectorizer fortran testcases failing

2006-11-16 Thread dorit at il dot ibm dot com

--- Comment #7 from dorit at il dot ibm dot com 2006-11-17 06:46 --- (In reply to comment #6) This patch should fix the problem: indeed it does, thanks! are you going to submit it to mainline? -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=29779

[Bug tree-optimization/29777] New: missed optimization: model missing widen_mult* idioms for SSE

2006-11-09 Thread dorit at il dot ibm dot com

at il dot ibm dot com GCC build triplet: i?86-*-* and x86_64-*-* GCC host triplet: i?86-*-* and x86_64-*-* GCC target triplet: i?86-*-* and x86_64-*-* http://gcc.gnu.org/bugzilla/show_bug.cgi?id=29777

[Bug tree-optimization/29778] New: missed optimization: model missing vec_pack/unpack idioms for ia64

2006-11-09 Thread dorit at il dot ibm dot com

: unassigned at gcc dot gnu dot org ReportedBy: dorit at il dot ibm dot com GCC build triplet: ia64-*-* GCC host triplet: ia64-*-* GCC target triplet: ia64-*-* http://gcc.gnu.org/bugzilla/show_bug.cgi?id=29778

[Bug middle-end/29779] New: vectorizer fortran testcases failing

2006-11-09 Thread dorit at il dot ibm dot com

Component: middle-end AssignedTo: unassigned at gcc dot gnu dot org ReportedBy: dorit at il dot ibm dot com GCC build triplet: ppc*-*-linux GCC host triplet: ppc*-*-linux GCC target triplet: ppc*-*-linux http://gcc.gnu.org/bugzilla/show_bug.cgi?id=29779

[Bug tree-optimization/29777] missed optimization: model missing widen_mult* idioms for SSE

2006-11-09 Thread dorit at il dot ibm dot com

--- Comment #2 from dorit at il dot ibm dot com 2006-11-09 20:26 --- But these files can be succesfully vectorized using current (gcc version 4.3.0 20061109) version on i686: gcc -O2 -msse2 -ftree-vectorize -fdump-tree-vect-all vect-widen-mult-sum.c vect-widen-mult-sum.c:16: note

[Bug tree-optimization/29145] unsafe use of restrict qualifier

2006-11-05 Thread dorit at il dot ibm dot com

--- Comment #6 from dorit at il dot ibm dot com 2006-11-05 15:48 --- (In reply to comment #5) This was something that slipped in, IIRC. I was of Ian's viewpoint, that may_alias_p should handle it, and it shouldn't be special to data-references. yes, it was originally added

[Bug middle-end/29268] New: missed optimization: need to generalize realignment support in the vectorizer

2006-09-28 Thread dorit at il dot ibm dot com

support in the vectorizer Product: gcc Version: 4.2.0 Status: UNCONFIRMED Severity: normal Priority: P3 Component: middle-end AssignedTo: unassigned at gcc dot gnu dot org ReportedBy: dorit at il dot

[Bug middle-end/29269] New: missing documentation for vcond (vector conditional operation)

2006-09-28 Thread dorit at il dot ibm dot com

Priority: P3 Component: middle-end AssignedTo: unassigned at gcc dot gnu dot org ReportedBy: dorit at il dot ibm dot com http://gcc.gnu.org/bugzilla/show_bug.cgi?id=29269

[Bug middle-end/29160] New: missed optimization: redundant casts prevent vectorization

2006-09-21 Thread dorit at il dot ibm dot com

vectorization Product: gcc Version: 4.2.0 Status: UNCONFIRMED Severity: normal Priority: P3 Component: middle-end AssignedTo: unassigned at gcc dot gnu dot org ReportedBy: dorit at il dot ibm dot com http://gcc.gnu.org

[Bug tree-optimization/29170] autovec cannot handle short+=short

2006-09-21 Thread dorit at il dot ibm dot com

--- Comment #4 from dorit at il dot ibm dot com 2006-09-21 19:30 --- By the way, the testcase gets vectorized if you compile with -fwrapv. -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=29170

[Bug middle-end/28684] Imprecise -funsafe-math-optimizations definition

2006-09-11 Thread dorit at il dot ibm dot com

--- Comment #4 from dorit at il dot ibm dot com 2006-09-11 10:57 --- You could help by looking at the source code (there are only a few dozens places mentioning flag_unsafe_math_optimizations) and auditing which places would be more suited to a new flag_reassociate_fp variable. we'd

[Bug tree-optimization/27742] [4.2 regression] ICE with -ftree-vectorizer-verbose

2006-08-31 Thread dorit at il dot ibm dot com

--- Comment #9 from dorit at il dot ibm dot com 2006-08-31 08:08 --- I have been unsuccessful in reproducing this problem on a i386-redhat-linux. I don't get a failure compiling the testcase from comment 8. I tried to compile the testcase from comment 7 and got the following errors

[Bug tree-optimization/26969] [4.1 Regression] ICE with -O1 -funswitch-loops -ftree-vectorize

2006-08-31 Thread dorit at il dot ibm dot com

--- Comment #10 from dorit at il dot ibm dot com 2006-08-31 08:22 --- I think this can be closed? (I opened a missed-optimization PR instead - PR28643) -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=26969

[Bug tree-optimization/26969] [4.1 Regression] ICE with -O1 -funswitch-loops -ftree-vectorize

2006-08-31 Thread dorit at il dot ibm dot com

--- Comment #12 from dorit at il dot ibm dot com 2006-09-01 05:43 --- oops - I didn't notice it was open against 4.1. So hopefully porting Victor's patch to 4.1 would fix it. -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=26969

[Bug target/27827] [4.0/4.1 Regression] gcc 4 produces worse x87 code on all platforms than gcc 3

2006-08-09 Thread dorit at il dot ibm dot com

--- Comment #55 from dorit at il dot ibm dot com 2006-08-09 19:10 --- Subject: Re: [4.0/4.1 Regression] gcc 4 produces worse x87 code on all platforms than gcc 3 Here's some questions I need to figure out: (1) Why do I have to throw the -funsafe-math-optimizations flag to enable

[Bug tree-optimization/28643] redundant phi-node in latch-block prevents vectorization

2006-08-08 Thread dorit at il dot ibm dot com

--- Comment #3 from dorit at il dot ibm dot com 2006-08-08 07:38 --- Err, SSA copy prop should be enough, actually, since after copy-prop, the phi will have no users (and they shouldn't care about code with no uses that doesn't access memory). Though it's interesting

[Bug middle-end/28628] New: Not forcing alignment of arrays in structs with -fsection-anchors

2006-08-07 Thread dorit at il dot ibm dot com

Priority: P3 Component: middle-end AssignedTo: unassigned at gcc dot gnu dot org ReportedBy: dorit at il dot ibm dot com GCC build triplet: powerpc-linux GCC host triplet: powerpc-linux GCC target triplet: powerpc-linux http://gcc.gnu.org/bugzilla/show_bug.cgi?id

[Bug middle-end/27770] [4.2 Regression] wrong code in spec tests for -ftree-vectorize -maltivec

2006-08-07 Thread dorit at il dot ibm dot com

--- Comment #25 from dorit at il dot ibm dot com 2006-08-07 07:09 --- (In reply to comment #24) Fixed, a new different bug for the missed optimization should be opened. It's PR28628. -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=27770

[Bug target/27827] [4.0/4.1 Regression] gcc 4 produces worse x87 code on all platforms than gcc 3

2006-08-07 Thread dorit at il dot ibm dot com

--- Comment #43 from dorit at il dot ibm dot com 2006-08-07 20:35 --- I'm all for this. info gcc says that w/o a guarantee of alignment, loops are duped, with an if selecting between vector and scalar loops, is this not accurate? yes I spent a day trying to get gcc to vectorize

[Bug middle-end/28643] New: redundant phi-node in latch-block prevents vectorization

2006-08-07 Thread dorit at il dot ibm dot com

: dorit at il dot ibm dot com GCC build triplet: powerpc-linux GCC host triplet: powerpc-linux GCC target triplet: powerpc-linux http://gcc.gnu.org/bugzilla/show_bug.cgi?id=28643

[Bug middle-end/27770] [4.2 Regression] wrong code in spec tests for -ftree-vectorize -maltivec

2006-07-23 Thread dorit at il dot ibm dot com

--- Comment #19 from dorit at il dot ibm dot com 2006-07-23 19:03 --- The fix we've agreed is best in principle is to speculatively increase the DECL_ALIGN of vectorisable variables before compiling functions. Dorit says that there is a patch related to this on the autovect branch

[Bug tree-optimization/26197] [4.2 regression] ICE in is_old_name with vectorizer

2006-03-01 Thread dorit at il dot ibm dot com

--- Comment #13 from dorit at il dot ibm dot com 2006-03-01 12:35 --- So I'll submit the patch to gcc-patches for approval. Can someone please check if this patch actually solves this PR? -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=26197

[Bug tree-optimization/26197] [4.2 regression] ICE in is_old_name with vectorizer

2006-02-28 Thread dorit at il dot ibm dot com

--- Comment #11 from dorit at il dot ibm dot com 2006-02-28 08:26 --- Created an attachment (id=10935) -- (http://gcc.gnu.org/bugzilla/attachment.cgi?id=10935action=view) tentative patch I get a similar error message when trying to bootstrap mainline with vectorization enabled: /home

[Bug tree-optimization/26420] -ftree-vectorizer-verbose=1 prints unvectorized loops information

2006-02-26 Thread dorit at il dot ibm dot com

--- Comment #2 from dorit at il dot ibm dot com 2006-02-26 11:01 --- For -ftree-vectorizer-verbose=1 the vectorizer reports each loop that got vectorized, and also the total number of loops that got vectorized, even if that number is zero. If preferable, we can report that 0 loops got

[Bug tree-optimization/26419] -ftree-vectorizer-verbose=n documentation is terse

2006-02-26 Thread dorit at il dot ibm dot com

--- Comment #3 from dorit at il dot ibm dot com 2006-02-26 11:05 --- patch: http://gcc.gnu.org/ml/gcc-patches/2006-02/msg01905.html -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=26419

[Bug tree-optimization/26359] [4.2 Regression] Over optimization of loop when using -ftree-vectorize

2006-02-21 Thread dorit at il dot ibm dot com

--- Comment #3 from dorit at il dot ibm dot com 2006-02-21 22:01 --- patch: http://gcc.gnu.org/ml/gcc-patches/2006-02/msg01710.html -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=26359

[Bug tree-optimization/26360] [4.2 Regression] Autovectorization of char - int loop gets ICE

2006-02-21 Thread dorit at il dot ibm dot com

--- Comment #3 from dorit at il dot ibm dot com 2006-02-21 22:02 --- patch: http://gcc.gnu.org/ml/gcc-patches/2006-02/msg01713.html -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=26360

[Bug tree-optimization/26362] ICE on the autovect-branch (gfortran example)

2006-02-20 Thread dorit at il dot ibm dot com

--- Comment #1 from dorit at il dot ibm dot com 2006-02-20 16:45 --- Looks like the vectorizer detects a strided access in this testcase. Strided accesses are not entirely supported for SSE right now (work in progress...), but it is enabled, so currently all strided testcases brake

[Bug tree-optimization/26362] ICE on the autovect-branch (gfortran example)

2006-02-20 Thread dorit at il dot ibm dot com

--- Comment #2 from dorit at il dot ibm dot com 2006-02-20 17:09 --- Actually there's this patch by rth that seems to fix this ICE; it's from a while back, I don't think it was fully tested at the time, and I'm not sure it provides all the missing bits/fixes for SSE support

[Bug tree-optimization/26360] [4.2 Regression] Autovectorization of char - int loop gets ICE

2006-02-19 Thread dorit at il dot ibm dot com

--- Comment #2 from dorit at il dot ibm dot com 2006-02-19 08:50 --- This happens because we actually rely on dce taking place after the vectorizer to clean up dead code. When we detect a pattern (widneing-summation in this case) we create a dummy stmt (pattern-stmt) that represents

[Bug tree-optimization/26359] [4.2 Regression] Over optimization of loop when using -ftree-vectorize

2006-02-19 Thread dorit at il dot ibm dot com

--- Comment #2 from dorit at il dot ibm dot com 2006-02-19 15:34 --- The problem is that during dce the call to is_hidden_global_store returns false cause the tag is not marked as global/static. This seems to fix it: Index: tree-ssa-alias.c

[Bug tree-optimization/26197] [4.2 regression] ICE in is_old_name, at tree-into-ssa.c:466

2006-02-19 Thread dorit at il dot ibm dot com

--- Comment #10 from dorit at il dot ibm dot com 2006-02-19 16:10 --- so maybe if an SFT has may-aliases then new_type_alias should add the may-aliases of the SFT as may-aliases of the new tag, instead of adding the SFT as a may-alias of the new tag. ? There's a comment

[Bug tree-optimization/26197] [4.2 regression] ICE in is_old_name, at tree-into-ssa.c:466

2006-02-13 Thread dorit at il dot ibm dot com

--- Comment #6 from dorit at il dot ibm dot com 2006-02-13 16:23 --- (In reply to comment #5) Probably related to http://gcc.gnu.org/ml/gcc-patches/2006-01/msg00446.html Would you expect then that calling mark_new_vars_to_rename, like you did in your patch, will fix this problem? I

[Bug tree-optimization/25918] gcc.dg/vect/vect-reduc-dot-s16.c scan-tree-dump-times vectorized 1 loops 1 and gcc.dg/vect/vect-reduc-pattern-2.c scan-tree-dump-times vectorized 2 loops 1 fail

2006-02-08 Thread dorit at il dot ibm dot com

--- Comment #6 from dorit at il dot ibm dot com 2006-02-08 14:17 --- (In reply to comment #4) ... This happens because the IA-64 port defines the widen_ssumv4hi3 pattern. The IA-64 port is the only one that defines this pattern, and hence is probably the only port broken here. All

[Bug tree-optimization/25918] gcc.dg/vect/vect-reduc-dot-s16.c scan-tree-dump-times vectorized 1 loops 1 and gcc.dg/vect/vect-reduc-pattern-2.c scan-tree-dump-times vectorized 2 loops 1 fail

2006-02-08 Thread dorit at il dot ibm dot com

--- Comment #7 from dorit at il dot ibm dot com 2006-02-08 14:19 --- (In reply to comment #5) Will take care of that. -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=25918

[Bug tree-optimization/25918] gcc.dg/vect/vect-reduc-dot-s16.c scan-tree-dump-times vectorized 1 loops 1 and gcc.dg/vect/vect-reduc-pattern-2.c scan-tree-dump-times vectorized 2 loops 1 fail

2006-01-26 Thread dorit at il dot ibm dot com

--- Comment #1 from dorit at il dot ibm dot com 2006-01-26 09:07 --- Can you please send the dump files generated by -fdump-tree-vect-details? reduc-dot-s16.c needs the sdot_prodv4hi pattern, which is implemented for ia64, so I'd expect one loop to be vectorized. I wonder what's

[Bug tree-optimization/25911] [4.2 Regression] ice in vect_recog_dot_prod_pattern

2006-01-24 Thread dorit at il dot ibm dot com

--- Comment #5 from dorit at il dot ibm dot com 2006-01-24 09:10 --- Patch: Index: tree-vect-patterns.c === --- tree-vect-patterns.c(revision 109954) +++ tree-vect-patterns.c(working copy) @@ -243,7 +243,8

[Bug tree-optimization/25809] New: missed PRE optimization - move invariant casts out of loops

2006-01-16 Thread dorit at il dot ibm dot com

Severity: normal Priority: P3 Component: tree-optimization AssignedTo: unassigned at gcc dot gnu dot org ReportedBy: dorit at il dot ibm dot com GCC build triplet: ppc64-yellowdog-linux GCC host triplet: ppc64-yellowdog-linux GCC target triplet: ppc64-yellowdog

[Bug libfortran/21468] vectorizing libfortran

2006-01-08 Thread dorit at il dot ibm dot com

--- Comment #10 from dorit at il dot ibm dot com 2006-01-08 13:49 --- Reopening since many of the intrinsics could still vectorize better. Could help if you list specific functions that you expect to get vectorized. As far as dotprod is concerned - if it's operating on floats, you

[Bug testsuite/25590] FAIL: gcc.dg/tree-ssa/gen-vect-11.c scan-tree-dump-times vectorized 1 loops 1

2006-01-03 Thread dorit at il dot ibm dot com

--- Comment #6 from dorit at il dot ibm dot com 2006-01-04 07:33 --- Maybe related to: 2005-12-26 Kazu Hirata [EMAIL PROTECTED] PR tree-optimization/25125 * convert.c (convert_to_integer): Don't narrow the type of a PLUX_EXPR or MINUS_EXPR if !flag_wrapv

[Bug testsuite/25590] FAIL: gcc.dg/tree-ssa/gen-vect-11.c scan-tree-dump-times vectorized 1 loops 1

2006-01-03 Thread dorit at il dot ibm dot com

--- Comment #7 from dorit at il dot ibm dot com 2006-01-04 07:36 --- (sorry, didn't notice it was already diagnosed as such) -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=25590

[Bug target/25413] wrong alignment or incorrect address computation in vectorized code on Pentium 4 SSE

2005-12-15 Thread dorit at il dot ibm dot com

--- Comment #2 from dorit at il dot ibm dot com 2005-12-15 12:41 --- The problem is that the vectorizer applies loop-peeling in order to align the data reference *(m-c+i), and peeling only works correctly if the data is naturally aligned (aligned on it's type size). This is what

[Bug target/25413] wrong alignment or incorrect address computation in vectorized code on Pentium 4 SSE

2005-12-15 Thread dorit at il dot ibm dot com

--- Comment #3 from dorit at il dot ibm dot com 2005-12-15 12:50 --- related discussion: http://gcc.gnu.org/ml/gcc/2005-12/msg00390.html -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=25413

[Bug target/24378] [4.1/4.2 Regression] gcc.dg/vect/pr24300.c (test for excess errors) fails

2005-12-14 Thread dorit at il dot ibm dot com

--- Comment #9 from dorit at il dot ibm dot com 2005-12-14 15:38 --- Thanks for testing the patch. I finally submitted it: http://gcc.gnu.org/ml/gcc-patches/2005-12/msg01071.html -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=24378

1 2 3 >

1 - 100 of 200 matches

Mail list logo