Hi,
It might be a very simple question for most of you but I am trying to 
understand how exactly theano calculate its gradient.
Lets consider the example posted on Theano's documentation:

x = T.dmatrix('x')
s = T.sum(1/(1+ T.exp(-x)))
gs = T.grad(s,x)
dlogistic = function([x], gs)

dlogistic([[0, 1], [-1, -2]])

array([[ 0.25      ,  0.19661193],

       [ 0.19661193,  0.10499359]])


If I won't be having T.sum, then the general gradient of "s" w.r.t "x" will be 
(1+ T.exp(-x)) and later I can put the value of x and will be done.

But this T.sum() operator is confusing me.

It would be great if you can help me out here.


-- 

--- 
You received this message because you are subscribed to the Google Groups 
"theano-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/d/optout.

Reply via email to