Re: [math] Usage of DifferentiableMultivariateRealFunction

Andreas Niekler Mon, 14 May 2012 07:36:37 -0700

Thank you all very much.

I have updated my DifferentiableMultivariateRealFunction and this iswhat it looks like now (appended). I use this and stick it into aMultiStartDifferentiableMultivariateRealOptimizer. Unfortunately thisprocedure end with the creation of some negative parameters (impossibleby method) which i cannot oversee right now (wrong gradients, wrongprocedure?). Here is the procedure i'm using. Hope this clarifies mytask and maybe to get some more hints to optimize my usage of commons Math:


parameter[2] = 1.0;
parameter[1] = 1.0;
parameter[0] = 1.0;

NonLinearConjugateGradientOptimizer underlying = newNonLinearConjugateGradientOptimizer(ConjugateGradientFormula.FLETCHER_REEVES);

                                        
JDKRandomGenerator g = new JDKRandomGenerator();
g.setSeed(753289573253l);
                                
RandomVectorGenerator generator =
new UncorrelatedRandomVectorGenerator(3, new GaussianRandomGenerator(g));

MultiStartDifferentiableMultivariateRealOptimizer optimizer = newMultiStartDifferentiableMultivariateRealOptimizer(underlying, 10,generator);GaussianProcessRegressionMarginalLikelihood gprml = newGaussianProcessRegressionMarginalLikelihood(input, X);

                                
RealPointValuePair pair = null;
                                
try {
pair = optimizer.optimize(gprml, GoalType.MAXIMIZE, parameter);
} catch (OptimizationException e) {
// TODO Auto-generated catch block
e.printStackTrace();
} catch (FunctionEvaluationException e) {
// TODO Auto-generated catch block
e.printStackTrace();
}

Am 14.05.2012 15:42, schrieb Luc Maisonobe:

Le 14/05/2012 14:11, Andreas Niekler a écrit :

Hello,


Hi Andreas,


after reading a lot through the tutorial this is the code that i came up
with regarding the implementation of a gaussian process regression
optimisation (File appended):

initCovarianceAndGradients(): initialisation of matrices and
calculations which are needed by both marginal likelihood calculation
and gradient calculation:

Within this function i calculate some things globally which are strongly
reused by the value() and gradient() functions. What i do not really
understand is the passing of the double[] argument to the value()
function and the value() function of the gradient() method. Are those
methods called by the optimizer with the updated parameters? If this is
the case i have to recalculate the global calculations with each call to
the value() and gradient() methods.


Yes, the double[] argument is updated at each call and correspond to the
current estimate as the algorithm iterates towards the solution.

You cannot even rely on the calls being always scheduled in the same
way. As an example, the Gauss-Newton optimizer performs the two calls
with function first and gradient afterwards at each iteration, but the
Levenberg-Marquardt optimizer has two embedded loops and computes
Jacobians on the external loop and the function value on the internal
loop. So you should probably not compute everything beforehand in the
hope it will be used later on.

Luc


Thanks for clarification

Am 14.05.2012 12:53, schrieb Gilles Sadowski:

Hello.


thanks for the reply. But i wonder what is the input for value and
gradient.
in DifferentiableMultivariateRealFunction this needs to be a double
array
but what needs to be provided there? The parameters for the function to
optimize?

Thank you very much again

Andreas

Do please have a look to the examples, as your question (and my
answer) is too vague if not supported by proper code. I guess the
answer to your question is 'yes', the double[] array is indeed the set
of parameters, but again, do check the examples, I would not like to
be misguiding you. Besides the user guide which should provide you
with the answer, have a look to this implementation [1], line 153. In
this implementation, x[i] and y[i] are the data points, yhat[i] are
the model predictions, and a[] are the parameters. You should be able
to find your way with this example.


I've also just added another bit of code show-casing the usage of the
"non-linear least-squares" optimizers (svn revision 1338144).


Best regards,
Gilles

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]





---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]


--
Dipl.-Ing. (FH) Andreas Niekler
Mitarbeiter und Promovend
Bereich Multimedia-Produktionssysteme und -technologien

Hochschule für Technik, Wirtschaft und Kultur Leipzig
Fachbereich Medien

Besucher
Gustav-Freytag-Straße 40
04277 Leipzig

Telefon: +49 0341 30 76 2378

Email: [email protected]
http://www.fbm.htwk-leipzig.de

/**
 * 
 */
package de.uni_leipzig.asv.toolchain.outputApplication.TimeSeriesAnalysis.cov;

import org.apache.commons.math.FunctionEvaluationException;
import org.apache.commons.math.analysis.DifferentiableMultivariateRealFunction;
import org.apache.commons.math.analysis.MultivariateRealFunction;
import org.apache.commons.math.analysis.MultivariateVectorialFunction;
import org.apache.commons.math.linear.CholeskyDecomposition;
import org.apache.commons.math.linear.CholeskyDecompositionImpl;
import org.apache.commons.math.linear.DecompositionSolver;
import org.apache.commons.math.linear.LUDecompositionImpl;
import org.apache.commons.math.linear.MatrixUtils;
import org.apache.commons.math.linear.NonSquareMatrixException;
import org.apache.commons.math.linear.NotPositiveDefiniteMatrixException;
import org.apache.commons.math.linear.NotSymmetricMatrixException;
import org.apache.commons.math.linear.RealMatrix;
import org.apache.commons.math.stat.StatUtils;

/**
 * @author Niekler
 *
 */
public class GaussianProcessRegressionMarginalLikelihood implements
                DifferentiableMultivariateRealFunction {

        RealMatrix K;
        RealMatrix invK;
        
        RealMatrix y;
        CholeskyDecomposition L;
        double L_diag;
        RealMatrix alpha;
        
        RealMatrix alpha2;
        
        private double[][] cov;
        private double [][][] paramCov;
        
        private double[] targets;
        private double[] X;
        
        
        
        public GaussianProcessRegressionMarginalLikelihood(double[] targets, 
double[] X) {
        
                this.targets = targets;
                this.X = X;
                
                
                this.cov = new double[targets.length][targets.length];
                
        }

        private void initCovarianceAndGradients(double[] parameter)
        {
                double mean = StatUtils.mean(this.targets);             
                double[] y_array = new double[this.targets.length];
                
                paramCov = new double[parameter.length][][];
                
                for(int i = 0; i < parameter.length; i++)
                {
                        paramCov[i] = new 
double[targets.length][targets.length];
                }
                
                for(int i = 0; i < this.targets.length;i++)
                {
                        y_array[i] = (targets[i]- mean);
                        //y_array[i] = targets[i]- mean;
                }
                
                for(int i = 0; i < this.X.length;i++)
                {
                        
                        for(int j = 0; j < this.X.length;j++)
                        {
                                
                                double covar = 
(Double)squaredExponential.cov(X[i], 
X[j],parameter[0],parameter[1],parameter[2]);
                                this.cov[i][j] = covar;
                                
                                double dSigmaF = 
(Double)squaredExponential.dSigmaF(X[i], 
X[j],parameter[0],parameter[1],parameter[2]);
                                this.paramCov[0][i][j] = dSigmaF;
                                
                                double dSigmaN = 
(Double)squaredExponential.dSigmaN(X[i], 
X[j],parameter[0],parameter[1],parameter[2]);
                                this.paramCov[1][i][j] = dSigmaN;
                                
                                double dL = (Double)squaredExponential.dL(X[i], 
X[j],parameter[0],parameter[1],parameter[2]);
                                this.paramCov[2][i][j] = dL;
                                
                        }
                }
                
                y = MatrixUtils.createColumnRealMatrix(y_array);
                
                K =  MatrixUtils.createRealMatrix(cov);
                
                //identity matrix for I
                RealMatrix k_eye = 
MatrixUtils.createRealIdentityMatrix(cov.length);
                                
                //k_eye = k_eye.scalarMultiply(Math.pow(Float.MIN_VALUE * 1000, 
2));
                k_eye = k_eye.scalarMultiply(Math.pow(parameter[2], 2));
                
                //RealMatrix choleskyInput = 
K.add(k_eye.scalarMultiply(Math.pow(parameter[2], 2)));
                RealMatrix choleskyInput = K.add(k_eye);
                
                CholeskyDecomposition L = null;
                                
                try {
                                L = new 
CholeskyDecompositionImpl(choleskyInput);
                        } catch (NonSquareMatrixException e) {
                                        e.printStackTrace();
                        } catch (NotSymmetricMatrixException e) {
                                        e.printStackTrace();
                                } catch (NotPositiveDefiniteMatrixException e) {
                                        e.printStackTrace();
                                }
                                
                DecompositionSolver solverLTransponse = new 
LUDecompositionImpl(L.getLT()).getSolver();
                DecompositionSolver solverL = new 
LUDecompositionImpl(L.getL()).getSolver();
                
                //inverse of Ltranspose for left devision
                RealMatrix L_transpose_1 = solverLTransponse.getInverse();
                //inverse of Ltranspose for left devision
                RealMatrix L_1 = solverL.getInverse();
                
                //L\y
                RealMatrix L_y = L_1.multiply(y);
                
                //alpha = L'\(L\y)
                alpha = L_transpose_1.multiply(L_y);
                
                double L_diag = 0.0;
                
                for(int i = 0; i < L.getL().getColumnDimension();i++)
                {
                        L_diag += Math.log(L.getL().getEntry(i, i)); 
                }
                
                
                DecompositionSolver solverK = new 
LUDecompositionImpl(K).getSolver();
                this.invK = solverK.getInverse();
                
                alpha2 = invK.multiply(y);
        }
        
        /* log p(y|X,theta) = -(1/2) * y^T*Ky^(-1) * y - 1/2 * log * |Ky| - 
(n/2) * log(2*pi)
         * 
         */
        @Override
        public double value(double[] parameter) throws 
FunctionEvaluationException,
                        IllegalArgumentException {
                
                this.initCovarianceAndGradients(parameter);
                
                double logpyX = - y.transpose().multiply(alpha).getData()[0][0] 
* 0.5
                                                - L_diag
                                                - X.length * Math.log(2 * 
Math.PI) * 0.5;
                
                System.out.println("logPYX: " +logpyX + "param: " + parameter);
                
                return logpyX;
        }

        @Override
        public MultivariateVectorialFunction gradient() {
                return new MultivariateVectorialFunction() {
            public double[] value( double[] parameter) {
                
                initCovarianceAndGradients(parameter);
                
                RealMatrix innerMatrix = 
alpha2.multiply(alpha2.transpose()).subtract(invK);
                
                double[] result = new double[paramCov.length];
                
                for(int i = 0; i < paramCov.length; i++)
                {
                        result[i] = parameter[i] * 
innerMatrix.multiply(MatrixUtils.createRealMatrix(paramCov[i])).getTrace() * 
0.5;
                }
                
                return result;
            }
        };
        }

        @Override
        public MultivariateRealFunction partialDerivative(final int k) {
                return new MultivariateRealFunction() {
            public double value(double[] parameter) {
                initCovarianceAndGradients(parameter);
                
                RealMatrix innerMatrix = 
alpha2.multiply(alpha2.transpose()).subtract(invK);
                
                double[] result = new double[paramCov.length];
                
                for(int i = 0; i < paramCov.length; i++)
                {
                        result[i] = parameter[i] * 
innerMatrix.multiply(MatrixUtils.createRealMatrix(paramCov[i])).getTrace() * 
0.5;
                }
                
                return result[k];
            }
        };
        }

}

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: [math] Usage of DifferentiableMultivariateRealFunction

Reply via email to