Somewhat equivalently, how could I take each of the gradient updates instead of scan just summing all gradient updates automatically?
On Friday, October 7, 2016 at 4:16:46 PM UTC-4, John Moore wrote: > > Hi All, > > My understanding of BPTT is to unfold the network, take the gradients > through time, then average the weight updates. > How do I obtain the weight updates at each timestep? I know that scan > automatically performs BPTT for you, so that it gives you only one weight > update. > > Any insight appreciated. > > Thanks, > John > -- --- You received this message because you are subscribed to the Google Groups "theano-users" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/d/optout.
