Re: [theano-users] How to change the learning rate dynamically

Beatriz G. Wed, 04 Oct 2017 00:38:03 -0700

I have undestood it, and It works for me, I have modified the code so after 
training each epocf and after validate, the learning rate is changed.


The cost took nan because the learning rate value was too high, I am not 
pretty sure why.

El martes, 3 de octubre de 2017, 23:36:00 (UTC+2), Michael Klachko escribió:
>
> The learning rate schedule is defined in this line: 
> updates.append((l_r,0.95*l_r)), and is used in this line: train_model = 
> theano.function([index], cost, updates=updates ...
> If you don't understand what's going on, read about theano functions.
>
>
>
>
> On Thursday, September 7, 2017 at 6:58:34 AM UTC-7, Beatriz G. wrote:
>>
>> Hi. 
>>
>> I have tried to apply your code, but it does not work for me, I got "nan" 
>> at training cost.
>>
>>     l_r = theano.shared(np.array(learning_rate, 
>> dtype=theano.config.floatX))
>> ....
>>     cost = layer8.negative_log_likelihood(y)
>> ....
>>
>>    validate_model = theano.function(
>>         [index],
>>         layer8.errors(y),
>>         givens={
>>             x: valid_set_x[index * batch_size: (index + 1) * batch_size],
>>             y: valid_set_y[index * batch_size: (index + 1) * batch_size],
>>             is_train: numpy.cast['int32'](0)
>>
>>         }
>>     )
>>
>>     # create a list of all model parameters to be fit by gradient descent
>>     params = layer0.params + layer1.params + layer2.params + 
>> layer3.params + layer4.params + layer5.params + layer6.params + 
>> layer7.params + layer8.params
>>
>>     # create a list of gradients for all model parameters
>>     grads = T.grad(cost, params)
>>
>>     ## Learning rate update
>>     updates = [
>>         (param_i, param_i - l_r * grad_i)
>>         for param_i, grad_i in zip(params, grads)
>>         ]
>>     updates.append((l_r,0.95*l_r))
>>
>>     train_model = theano.function([index], cost, updates=updates,
>>                                   givens={
>>                                       x: train_set_x[index * batch_size: 
>> (index + 1) * batch_size],
>>                                       y: train_set_y[index * batch_size: 
>> (index + 1) * batch_size],
>>                                       is_train: np.cast['int32'](1)})
>> in the while loop:
>>             cost_ij = train_model(minibatch_index)
>>             when it is time to validate:
>>                       validation_losses = [validate_model(i) for i in 
>> range(n_valid_batches)]
>>
>>
>> The learning rate updates its value, the validation error is 90% 
>> continuously and the training cost is "nan" 
>>
>> Thank you in advance.
>>
>> Regards.
>>
>>
>> El lunes, 6 de octubre de 2014, 18:40:20 (UTC+2), Pascal Lamblin escribió:
>>>
>>> On Mon, Oct 06, 2014, Frédéric Bastien wrote: 
>>> > Exacte. 
>>>
>>> Also, you can make l_r a shared variable, and update it via a Theano 
>>> expression, if it is convenient to do so. For instance: 
>>>
>>> l_r = theano.shared(np.array(0.1, dtype=theano.config.floatX)) 
>>> ... 
>>> updates.append((l_r, 0.9 * l_r))  # Or however you want to change your 
>>> learning rate 
>>>
>>> train_model = theano.function([index], cost, updates=updates, 
>>> givens=...) 
>>>
>>> > 
>>> > Fred 
>>> > 
>>> > On Mon, Oct 6, 2014 at 11:14 AM, Ofir Levy <micr...@gmail.com> wrote: 
>>> > 
>>> > > ok I think I got it 
>>> > > 
>>> > > learning_rate = 0.1 
>>> > > 
>>> > > l_r = T.scalar('l_r', dtype=theano.config.floatX) 
>>> > > 
>>> > > updates = []for param_i, grad_i in zip(params, grads): 
>>> > >     updates.append((param_i, param_i - l_r * grad_i)) 
>>> > > 
>>> > > train_model = theano.function([index,l_r], cost, updates = updates, 
>>> > >         givens={ 
>>> > >             x: train_set_x[index * batch_size: (index + 1) * 
>>> batch_size], 
>>> > >             y: train_set_y[index * batch_size: (index + 1) * 
>>> batch_size]}) 
>>> > > 
>>> > > and in the training loop: 
>>> > > 
>>> > > cost_ij = train_model(minibatch_index, learning_rate) 
>>> > > 
>>> > > 
>>> > > 
>>> > > On Monday, October 6, 2014 5:38:33 PM UTC+3, Ofir Levy wrote: 
>>> > >> 
>>> > >> for the CNN example we currently have the following code: 
>>> > >> 
>>> > >> learning_rate = 0.1 
>>> > >> 
>>> > >> updates = []for param_i, grad_i in zip(params, grads): 
>>> > >>     updates.append((param_i, param_i - learning_rate * 
>>> grad_i))train_model = theano.function([index], cost, updates = updates, 
>>> > >>         givens={ 
>>> > >>             x: train_set_x[index * batch_size: (index + 1) * 
>>> batch_size], 
>>> > >>             y: train_set_y[index * batch_size: (index + 1) * 
>>> batch_size]}) 
>>> > >> 
>>> > >> and in the training loop: 
>>> > >> 
>>> > >> cost_ij = train_model(minibatch_index) 
>>> > >> 
>>> > >> 
>>> > >> can you kindly tell me how to change it to have a adaptive learning 
>>> rate? 
>>> > >> 
>>> > >> 
>>> > >> 
>>> > >> 
>>> > >> 
>>> > >> 
>>> > >> On Thursday, July 17, 2014 9:48:24 PM UTC+3, Frédéric Bastien 
>>> wrote: 
>>> > >>> 
>>> > >>> Make a theano variable that is the learning rate and pass it as an 
>>> input 
>>> > >>> to your theano function. 
>>> > >>> 
>>> > >>> You could also use a shared variable is you don't want to pass it 
>>> > >>> explicitly each time, but only change it from time to time: 
>>> > >>> 
>>> > >>> http://deeplearning.net/software/theano/tutorial/ 
>>> > >>> examples.html#using-shared-variables 
>>> > >>> 
>>> > >>> Fred 
>>> > >>> 
>>> > >>> 
>>> > >>> On Thu, Jul 17, 2014 at 2:42 PM, <sun...@gmail.com> wrote: 
>>> > >>> 
>>> > >>>> Hi, 
>>> > >>>> 
>>> > >>>> I would to change the learning rate in learning procedure. A 
>>> large 
>>> > >>>> learning rate is used in the initial stage and small is used in 
>>> later. How 
>>> > >>>> can I do. 
>>> > >>>> 
>>> > >>>> Thanks 
>>> > >>>> Jiancheng 
>>> > >>>> 
>>> > >>>> -- 
>>> > >>>> 
>>> > >>>> --- 
>>> > >>>> You received this message because you are subscribed to the 
>>> Google 
>>> > >>>> Groups "theano-users" group. 
>>> > >>>> To unsubscribe from this group and stop receiving emails from it, 
>>> send 
>>> > >>>> an email to theano-users...@googlegroups.com. 
>>> > >>>> For more options, visit https://groups.google.com/d/optout. 
>>> > >>>> 
>>> > >>> 
>>> > >>>  -- 
>>> > > 
>>> > > --- 
>>> > > You received this message because you are subscribed to the Google 
>>> Groups 
>>> > > "theano-users" group. 
>>> > > To unsubscribe from this group and stop receiving emails from it, 
>>> send an 
>>> > > email to theano-users...@googlegroups.com. 
>>> > > For more options, visit https://groups.google.com/d/optout. 
>>> > > 
>>> > 
>>> > -- 
>>> > 
>>> > --- 
>>> > You received this message because you are subscribed to the Google 
>>> Groups "theano-users" group. 
>>> > To unsubscribe from this group and stop receiving emails from it, send 
>>> an email to theano-users...@googlegroups.com. 
>>> > For more options, visit https://groups.google.com/d/optout. 
>>>
>>> -- 
>>> Pascal 
>>>
>>

-- 

--- 
You received this message because you are subscribed to the Google Groups 
"theano-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to theano-users+unsubscr...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Re: [theano-users] How to change the learning rate dynamically

Reply via email to