Somewhat equivalently, how could I take each of the gradient updates 
instead of scan just summing all gradient updates automatically?

On Friday, October 7, 2016 at 4:16:46 PM UTC-4, John Moore wrote:
>
> Hi All, 
>
> My understanding of BPTT is to unfold the network, take the gradients 
> through time, then average the weight updates.
> How do I obtain the weight updates at each timestep? I know that scan 
> automatically performs BPTT for you, so that it gives you only one weight 
> update. 
>
> Any insight appreciated.
>
> Thanks,
> John
>

-- 

--- 
You received this message because you are subscribed to the Google Groups 
"theano-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/d/optout.

Reply via email to