Re: [cfe-commits] [Patch] Add a warning for for loops with conditions that do not change

Douglas Gregor Mon, 12 Mar 2012 13:53:48 -0700

On Mar 12, 2012, at 1:38 PM, Richard Trieu wrote:

> 
> 
> On Sun, Mar 11, 2012 at 1:58 PM, Douglas Gregor <[email protected]> wrote:
> 
> On Feb 14, 2012, at 3:42 PM, Richard Trieu wrote:
> 
>> On Tue, Feb 7, 2012 at 10:38 AM, Douglas Gregor <[email protected]> wrote:
>> 
>> On Feb 6, 2012, at 1:00 PM, Eli Friedman wrote:
>> 
>> > On Mon, Feb 6, 2012 at 2:50 PM, Richard Trieu <[email protected]> wrote:
>> >> On Mon, Feb 6, 2012 at 2:02 PM, Eli Friedman <[email protected]> 
>> >> wrote:
>> >>>
>> >>> On Mon, Feb 6, 2012 at 1:55 PM, Richard Trieu <[email protected]> wrote:
>> >>>> The motivation of this path is to catch code like this:
>> >>>>
>> >>>> for (int i = 0; i < 10; ++i)
>> >>>>   for (int j = 0; j < 10; ++i)
>> >>>>     { }
>> >>>>
>> >>>> The second for loop increments i instead of j causing an infinite loop.
>> >>>>  The
>> >>>> warning also checks the body of the for loop so it will trigger on:
>> >>>>
>> >>>> for (int i; i <10; ) { }
>> >>>>
>> >>>> But not trigger on:
>> >>>>
>> >>>> for (int i; i< 10; ) { ++i; }
>> >>>>
>> >>>> I'm still fine-tuning the trigger conditions, but would like some
>> >>>> feedback
>> >>>> on this patch.
>> >>>
>> >>> Adding an additional recursive traversal of the body of every parsed
>> >>> for loop is very expensive.
>> >>>
>> >>> -Eli
>> >>
>> >>
>> >> Is it that expensive?  I would think that once the AST was constructed, 
>> >> the
>> >> visitor would be pretty fast, especially if no action is taken on most of
>> >> the nodes.  I also made the warning default ignore and put in an early
>> >> return to prevent the visitors from running in the default case.
>> >>
>> >> Do you have any suggestions on removing the recursive traversal?
>> >
>> > Okay, thinking about it a bit more, maybe it's not that expensive, but
>> > you should at least measure to make sure.
>> 
>> I'm still very nervous about adding such an AST traversal. Presumably, it's 
>> not going to be a concern in practice because most of the time, the 
>> increment statement of the 'for' loop will mention one of the variables in 
>> the condition, and therefore we'll short-circuit the expensive walk of the 
>> loop body. It would be helpful to know that's the case.
>> 
>> > I don't have any good suggestion for an alternative.
>> 
>> 
>> Nor do I.
>> 
>> A couple more comments:
>> 
>> +    ValueDecl *VD = E->getDecl();
>> +    for (SmallVectorImpl<ValueDecl*>::iterator I = Decls.begin(),
>> +                                               E = Decls.end();
>> +         I != E; ++I)
>> +      if (*I == VD) {
>> +        FoundDecl = true;
>> +        return;
>> +      }
>> 
>> This is linear; please use a SmallPtrSet instead.
>> Done. 
>> 
>> Plus, I think you want to narrow this check to only consider VarDecls. 
>> Functions and enumerators are not interesting.
>> Done. 
>> 
>> +      S.Diag(Second->getLocStart(), diag::warn_variables_not_in_loop_body)
>> +          << Second->getSourceRange();
>> 
>> This warning needs to specify which variables were not used or point to them 
>> in the source.
>> The diagnostic now underlines all the variables in the condition.   
>> 
>> Do we want to actually look for modification, e.g., any use of the variable 
>> that isn't immediately consumed by an lvalue-to-rvalue conversion?
>> Yes, that is exactly what we are checking for.  The VisitCastExpr checks for 
>> LvalueToRvalue casts and skips any DeclRefExpr's that are direct 
>> sub-expressions.
>> 
>>        - Doug
>> 
>> I also did a comparison between runs with and without -Wloop-analysis.  
>> [snip]
> 
> Thanks for all of the performance testing. A few more comments on the actual 
> code:
> 
> Index: include/clang/Basic/DiagnosticSemaKinds.td
> ===================================================================
> --- include/clang/Basic/DiagnosticSemaKinds.td        (revision 150501)
> +++ include/clang/Basic/DiagnosticSemaKinds.td        (working copy)
> @@ -14,6 +14,11 @@
>  let Component = "Sema" in {
>  let CategoryName = "Semantic Issue" in {
>  
> +// For loop analysis
> +def warn_variables_not_in_loop_body : Warning<
> +  "variables used in loop condition not modified in loop body">,
> +  InGroup<DiagGroup<"loop-analysis">>, DefaultIgnore;
> 
> It would be really nice if this diagnostic could mention the variables by 
> name ("variables 'i' and 'n' used in loop condition are not modified in loop 
> body"), like we do with missing enumerators in a switch. It'll take some 
> special casing for 1/2/many variables, but it will be much nicer for the 
> user. 
> 
> I will look into this.
>  
> +  void VisitDeclRefExpr(DeclRefExpr *E) {
> +    VarDecl *VD = dyn_cast<VarDecl>(E->getDecl());
> +    if (!VD) return;
> +
> +    PDiag << E->getSourceRange();
> +
> +    // Dont do analysis on pointers.
> +    if (VD->getType()->isAnyPointerType()) {
> +      Simple = false;
> +      return;
> +    }
> +
> +    Decls.insert(VD);
> +  }
> 
> Presumably, anything volatile-qualified should make the expression non-simple.
> The check for volatile is performed after the DeclMatcher finishes.
> 
> +  void VisitCastExpr(CastExpr *E) {
> +    // Don't do analysis on function calls or arrays.
> +    if (E->getCastKind() == CK_FunctionToPointerDecay ||
> +        E->getCastKind() == CK_ArrayToPointerDecay) {
> +      Simple = false;
> +      return;
> +    }
> +
> +     Visit(E->getSubExpr());
> +  }
> 
> If you want to avoid calls, why not just intercept CallExpr and 
> CXXConstructExpr and call those non-simple?
> 
> Backing up a step, it seems strange that DeclExtractor assumes that an 
> expression is 'simple' unless it sees some very small number of cases that 
> aren't consider simple. That feels very brittle to me because, e.g., 
> CXXConstructExpr is certainly not simple (for non-trivial constructors), yet 
> it would be considered simple by this approach. Similarly for CXXNewExpr and 
> a host of other non-trivial expressions that aren't covered. If we have to 
> have the notion of 'simple', then we need to build it constructively by 
> white-listing expressions that are known to be simple and banning (by 
> default) all other expressions.
> 
> I'm a bit worried about how many visitor methods I need to write to, but I'll 
> give it a try and see what happens.


I could settle for a full audit of all of the expressions, but the fact that 
there are several obvious missing cases worries me a lot.

> +  // DeclMatcher checks to see if the decls are used in a non-evauluated
> +  // context.  
> +  class DeclMatcher : public StmtVisitor<DeclMatcher> {
> 
> Should this use an EvaluatedExprVisitor to provide more accurate diagnostics? 
> e.g., the presence of sizeof(x) doesn't mean that 'x' has been used in a 
> meaningful way.
> 
> An EvaluatedExprVisitor is used to find the decls used in the loop condition, 
> which is called the DeclExtractor.  A separate visitor, the DeclMatch, is 
> used on the loop conditional, the loop increment, and the loop body to see if 
> the decls are in use.  For this, non-evaluated contexts that could change the 
> decl are important, so an EvaluatedExprVisitor can not be used here.

How would a non-evaluated context change the value of the variable?

        - Doug

_______________________________________________
cfe-commits mailing list
[email protected]
http://lists.cs.uiuc.edu/mailman/listinfo/cfe-commits

Re: [cfe-commits] [Patch] Add a warning for for loops with conditions that do not change

Reply via email to