Thank you for you reply, but I'm trying to understand why: hid_init = T.ones((input.shape[0], self.num_units), dtype=theano.config.floatX) * self.hid_init.get_value()
always uses the initial value assigned to self.hid_init (which is a row of zeros). The computation graph seems not to actually evaluate the get_value() call at all. It's true that the size of the weight doesn't depend on the minibatch size, but I'm initializing the LSTM's hidden and cell states here. -- --- You received this message because you are subscribed to the Google Groups "theano-users" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/d/optout.
