Re: [NLopt-discuss] Runtime errors for large gradients

Steven G. Johnson Tue, 08 Mar 2011 11:58:46 -0800


On Mar 8, 2011, at 2:05 PM, [email protected] wrote:

In fact, for realistic parameter sets, computing the gradients (with
some automatic differentiation data type) consumes about 90% of the
total runtime of my program. Evaluating the function withoutdifferentiation (data type double) is quite cheap.

Peter, this probably means that your automatic differentiation programis poor for this kind of problem.

There is a theorem that you can always compute the gradient of f(x)(any function from R^n to R) with about as many additional operationsas are required to compute f(x) **once**. The technique for doing sois called an "adjoint method", or "reverse accumulation".

So you want an AD program that advertises "reverse mode"differentiation or something similar.

In contrast, AD programs that use "forward mode" or "forwardaccumulation" differentiation are poor for differentiating multi-variate functions (R^n to R), since they have cost that growsproportional to n. (Forward accumulation is good for differentiatingfunctions from R to R^m.)


Steven

_______________________________________________
NLopt-discuss mailing list
[email protected]
http://ab-initio.mit.edu/cgi-bin/mailman/listinfo/nlopt-discuss

Re: [NLopt-discuss] Runtime errors for large gradients

Reply via email to