[GitHub] spark pull request: [MLLIB] [spark-2352] Implementation of an 1-hi...

witgo Wed, 19 Nov 2014 09:05:05 -0800

Github user witgo commented on a diff in the pull request:

    https://github.com/apache/spark/pull/1290#discussion_r20589805
  
    --- Diff: 
mllib/src/main/scala/org/apache/spark/mllib/ann/ArtificialNeuralNetwork.scala 
---
    @@ -0,0 +1,528 @@
    +/*
    + * Licensed to the Apache Software Foundation (ASF) under one or more
    + * contributor license agreements.  See the NOTICE file distributed with
    + * this work for additional information regarding copyright ownership.
    + * The ASF licenses this file to You under the Apache License, Version 2.0
    + * (the "License"); you may not use this file except in compliance with
    + * the License.  You may obtain a copy of the License at
    + *
    + *    http://www.apache.org/licenses/LICENSE-2.0
    + *
    + * Unless required by applicable law or agreed to in writing, software
    + * distributed under the License is distributed on an "AS IS" BASIS,
    + * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
    + * See the License for the specific language governing permissions and
    + * limitations under the License.
    + */
    +
    +package org.apache.spark.mllib.ann
    +
    +import breeze.linalg.{DenseVector, Vector => BV, axpy => brzAxpy}
    +
    +import org.apache.spark.mllib.linalg.{Vector, Vectors}
    +import org.apache.spark.mllib.optimization._
    +import org.apache.spark.rdd.RDD
    +import org.apache.spark.util.random.XORShiftRandom
    +
    +/*
    + * Implements a Artificial Neural Network (ANN)
    + *
    + * The data consists of an input vector and an output vector, combined 
into a single vector
    + * as follows:
    + *
    + * [ ---input--- ---output--- ]
    + *
    + * NOTE: output values should be in the range [0,1]
    + *
    + * For a network of H hidden layers:
    + *
    + * hiddenLayersTopology(h) indicates the number of nodes in hidden layer 
h, excluding the bias
    + * node. h counts from 0 (first hidden layer, taking inputs from input 
layer) to H - 1 (last
    + * hidden layer, sending outputs to the output layer).
    + *
    + * hiddenLayersTopology is converted internally to topology, which adds 
the number of nodes
    + * in the input and output layers.
    + *
    + * noInput = topology(0), the number of input nodes
    + * noOutput = topology(L-1), the number of output nodes
    + *
    + * input = data( 0 to noInput-1 )
    + * output = data( noInput to noInput + noOutput - 1 )
    + *
    + * W_ijl is the weight from node i in layer l-1 to node j in layer l
    + * W_ijl goes to position ofsWeight(l) + j*(topology(l-1)+1) + i in the 
weights vector
    + *
    + * B_jl is the bias input of node j in layer l
    + * B_jl goes to position ofsWeight(l) + j*(topology(l-1)+1) + 
topology(l-1) in the weights vector
    + *
    + * error function: E( O, Y ) = sum( O_j - Y_j )
    + * (with O = (O_0, ..., O_(noOutput-1)) the output of the ANN,
    + * and (Y_0, ..., Y_(noOutput-1)) the input)
    + *
    + * node_jl is node j in layer l
    + * node_jl goes to position ofsNode(l) + j
    + *
    + * The weights gradient is defined as dE/dW_ijl and dE/dB_jl
    + * It has same mapping as W_ijl and B_jl
    + *
    + * For back propagation:
    + * delta_jl = dE/dS_jl, where S_jl the output of node_jl, but before 
applying the sigmoid
    + * delta_jl has the same mapping as node_jl
    + *
    + * Where E = ((estOutput-output),(estOutput-output)),
    + * the inner product of the difference between estimation and target 
output with itself.
    + *
    + */
    +
    +/**
    + * Artificial neural network (ANN) model
    + *
    + * @param weights the weights between the neurons in the ANN.
    + * @param topology array containing the number of nodes per layer in the 
network, including
    + * the nodes in the input and output layer, but excluding the bias nodes.
    + */
    +class ArtificialNeuralNetworkModel private[mllib](val weights: Vector, val 
topology: Array[Int])
    +  extends Serializable with ANNHelper {
    +
    +  /**
    +   * Predicts values for a single data point using the trained model.
    +   *
    +   * @param testData represents a single data point.
    +   * @return prediction using the trained model.
    +   */
    +  def predict(testData: Vector): Vector = {
    +    Vectors.dense(computeValues(testData.toArray, weights.toArray))
    +  }
    +
    +  /**
    +   * Predict values for an RDD of data points using the trained model.
    +   *
    +   * @param testDataRDD RDD representing the input vectors.
    +   * @return RDD with predictions using the trained model as (input, 
output) pairs.
    +   */
    +  def predict(testDataRDD: RDD[Vector]): RDD[(Vector,Vector)] = {
    +    testDataRDD.map(T => (T, predict(T)) )
    +  }
    +
    +  private def computeValues(arrData: Array[Double], arrWeights: 
Array[Double]): Array[Double] = {
    +    val arrNodes = forwardRun(arrData, arrWeights)
    +    arrNodes.slice(arrNodes.size - topology(L), arrNodes.size)
    +  }
    +}
    +
    +/**
    + * Performs the training of an Artificial Neural Network (ANN)
    + *
    + * @param topology A vector containing the number of nodes per layer in 
the network, including
    + * the nodes in the input and output layer, but excluding the bias nodes.
    + * @param maxNumIterations The maximum number of iterations for the 
training phase.
    + * @param convergenceTol Convergence tolerance for LBFGS. Smaller value 
for closer convergence.
    + */
    +class ArtificialNeuralNetwork private[mllib](
    +    topology: Array[Int],
    +    maxNumIterations: Int,
    +    convergenceTol: Double)
    +  extends Serializable {
    +
    +  private val gradient = new ANNLeastSquaresGradient(topology)
    +  private val updater = new ANNUpdater()
    +  private val optimizer = new LBFGS(gradient, updater).
    --- End diff --
    
    We should use `GradientDescent`?



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request: [MLLIB] [spark-2352] Implementation of an 1-hi...

Reply via email to