Hi, Tassilo,
Have you solved this problem?
I encounter the same problem at present without any idea for it.
I just run some separate programs in the same time then the function 
compiling outputs the similar problem.
Thanks a lot for any idea.

在 2015年3月9日星期一 UTC-4下午1:51:30,Tassilo Klein写道:
>
> Hi Fred,
>
> I just wanted to follow up on this issue as I am sort of deadlocked with 
> this problem now. Do you see any solution?
> Btw, the non-thread safety issue of Theano has not been fixed yet, right? 
> Because it sounds somewhat related to that.
>
> -Tassilo
>
> On Mon, Mar 2, 2015 at 7:07 PM, Tassilo Klein <[email protected] 
> <javascript:>> wrote:
>
>> Hi Fred,
>>
>> yes, master and slave have the same directory. Sometimes it works better, 
>> sometimes worse - feels like a race condition. It is weird. Here are 
>> stack-traces for different error outputs.
>>
>> Cheers,
>> Tassilo
>>
>>
>> 15/03/02 18:55:57 WARN TaskSetManager: Lost task 64.0 in stage 7.19 (TID 
>> 17710, node004.cm.cluster): org.apache.spark.api.python.PythonException: 
>> Traceback (most recent call last): 
>>   File "/scratch/users/tjklein/215178/spark/python/pyspark/worker.py", line 
>> 101, in main 
>>     process() 
>>   File "/scratch/users/tjklein/215178/spark/python/pyspark/worker.py", line 
>> 96, in process 
>>     serializer.dump_stream(func(split_index, iterator), outfile) 
>>   File "/scratch/users/tjklein/215178/spark/python/pyspark/serializers.py", 
>> line 236, in dump_stream 
>>     vs = list(itertools.islice(iterator, batch)) 
>>   File "/home/tjklein/cnn-3d/spark_LORM_refactored.py", line 1046, in 
>> distributed_gradient_computation 
>>     return broadcast_ADMM_gradient_function.value(*param_list) 
>>   File "/scratch/users/tjklein/215178/spark/python/pyspark/broadcast.py", 
>> line 106, in value 
>>     self._value = self.load(self._path) 
>>   File "/scratch/users/tjklein/215178/spark/python/pyspark/broadcast.py", 
>> line 95, in load 
>>     return cPickle.loads(data) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/compile/function_module.py",
>>  line 747, in _constructor_Function 
>>     f = maker.create(input_storage, trustme = True) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/compile/function_module.py",
>>  line 1415, in create 
>>     input_storage=input_storage_lists) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/link.py", 
>> line 525, in make_thunk 
>>     output_storage=output_storage)[:3] 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/vm.py", line 
>> 897, in make_all 
>>     no_recycling)) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/op.py", line 
>> 1002, in make_thunk 
>>     compute_map, no_recycling) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/op.py", line 
>> 739, in make_thunk 
>>     output_storage=node_output_storage) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cc.py", line 
>> 1072, in make_thunk 
>>     keep_lock=keep_lock) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cc.py", line 
>> 1014, in __compile__ 
>>     keep_lock=keep_lock) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cc.py", line 
>> 1441, in cthunk_factory 
>>     key=key, lnk=self, keep_lock=keep_lock) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cmodule.py", 
>> line 1076, in module_from_key 
>>     module = lnk.compile_cmodule(location) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cc.py", line 
>> 1353, in compile_cmodule 
>>     preargs=preargs) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cmodule.py", 
>> line 1889, in compile_str 
>>     raise MissingGXX("g++ not available! We can't compile c code.") 
>> MissingGXX: (MissingGXX('The following error happened while compiling the 
>> node', Elemwise{neg,no_inplace}(y), '\n', "g++ not available! We can't 
>> compile c code.", '[Elemwise{neg,no_inplace}(y)]'), <function 
>> _constructor_Function at 0x2aaacc32d230>, 
>> (<theano.compile.function_module.FunctionMaker object at 0x2aab2a53ff10>, 
>> [<None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, 
>> <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, 
>> <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, 
>> <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, 
>> <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, 
>> <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, 
>> <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, <array([[ 
>> 0.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  1.]])>], [None, None, None, None, None, None, 
>> None, None, None, None, None, None, None, None, None, None, None, None, 
>> None, None, None, None, None, None, None, None, None, None, None, None, 
>> None, None, None, None, None, None, None, None, None, None, None, None, 
>> None, None, None, None, None, None, None, None, None, None, None, None, 
>> None, None, None, None, None, None, None, None, array([[ 0.,  0.,  0.,  0.,  
>> 0.],
>>
>>
>> aceback (most recent call last): 
>>   File "/scratch/users/tjklein/215178/spark/python/pyspark/worker.py", line 
>> 101, in main 
>>     process() 
>>   File "/scratch/users/tjklein/215178/spark/python/pyspark/worker.py", line 
>> 96, in process 
>>     serializer.dump_stream(func(split_index, iterator), outfile) 
>>   File "/scratch/users/tjklein/215178/spark/python/pyspark/serializers.py", 
>> line 236, in dump_stream 
>>     vs = list(itertools.islice(iterator, batch)) 
>>   File "/home/tjklein/cnn-3d/spark_LORM_refactored.py", line 1046, in 
>> distributed_gradient_computation 
>>     return broadcast_ADMM_gradient_function.value(*param_list) 
>>   File "/scratch/users/tjklein/215178/spark/python/pyspark/broadcast.py", 
>> line 106, in value 
>>     self._value = self.load(self._path) 
>>   File "/scratch/users/tjklein/215178/spark/python/pyspark/broadcast.py", 
>> line 95, in load 
>>     return cPickle.loads(data) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/compile/function_module.py",
>>  line 747, in _constructor_Function 
>>     f = maker.create(input_storage, trustme = True) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/compile/function_module.py",
>>  line 1415, in create 
>>     input_storage=input_storage_lists) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/link.py", 
>> line 525, in make_thunk 
>>     output_storage=output_storage)[:3] 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/vm.py", line 
>> 897, in make_all 
>>     no_recycling)) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/op.py", line 
>> 1002, in make_thunk 
>>     compute_map, no_recycling) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/op.py", line 
>> 739, in make_thunk 
>>     output_storage=node_output_storage) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cc.py", line 
>> 1072, in make_thunk 
>>     keep_lock=keep_lock) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cc.py", line 
>> 1014, in __compile__ 
>>     keep_lock=keep_lock) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cc.py", line 
>> 1441, in cthunk_factory 
>>     key=key, lnk=self, keep_lock=keep_lock) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cmodule.py", 
>> line 1054, in module_from_key 
>>     module = self._get_from_hash(module_hash, key, keep_lock=keep_lock) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cmodule.py", 
>> line 954, in _get_from_hash 
>>     module = self._get_from_key(None, key_data) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cmodule.py", 
>> line 949, in _get_from_key 
>>     return self._get_module(name) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cmodule.py", 
>> line 632, in _get_module 
>>     self.module_from_name[name] = dlimport(name) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cmodule.py", 
>> line 293, in dlimport 
>>     rval = __import__(module_name, {}, {}, [module_name]) 
>> ImportError: (ImportError('The following error happened while compiling the 
>> node', Elemwise{add,no_inplace}(TensorConstant{(1,) of -1}, 
>> AdvancedSubtensor1.0), '\n', 'No module named 
>> tmpuawGWX.7ecbb18ed585719993b1efa9e2a60ff5'), <function 
>> _constructor_Function at 0x2aaacc3570c8>, 
>> (<theano.compile.function_module.FunctionMaker object at 0x2aab2a314110>, 
>> [<None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, 
>> <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, 
>> <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, 
>> <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, 
>> <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, 
>> <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, 
>> <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, <array([[ 
>> 0.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  0.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  0.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  0.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  0.], 
>>        [ 1.,  1.,  1.,  1.,  1.]])>]
>>
>>
>> 15/03/02 18:56:42 WARN TaskSetManager: Lost task 56.1 in stage 7.19 (TID 
>> 17719, node004.cm.cluster): org.apache.spark.api.python.PythonException: 
>> Traceback (most recent call last): 
>>   File "/scratch/users/tjklein/215178/spark/python/pyspark/worker.py", line 
>> 101, in main 
>>     process() 
>>   File "/scratch/users/tjklein/215178/spark/python/pyspark/worker.py", line 
>> 96, in process 
>>     serializer.dump_stream(func(split_index, iterator), outfile) 
>>   File "/scratch/users/tjklein/215178/spark/python/pyspark/serializers.py", 
>> line 236, in dump_stream 
>>     vs = list(itertools.islice(iterator, batch)) 
>>   File "/home/tjklein/cnn-3d/spark_LORM_refactored.py", line 1046, in 
>> distributed_gradient_computation 
>>     return broadcast_ADMM_gradient_function.value(*param_list) 
>>   File "/scratch/users/tjklein/215178/spark/python/pyspark/broadcast.py", 
>> line 106, in value 
>>     self._value = self.load(self._path) 
>>   File "/scratch/users/tjklein/215178/spark/python/pyspark/broadcast.py", 
>> line 95, in load 
>>     return cPickle.loads(data) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/compile/function_module.py",
>>  line 747, in _constructor_Function 
>>     f = maker.create(input_storage, trustme = True) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/compile/function_module.py",
>>  line 1415, in create 
>>     input_storage=input_storage_lists) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/link.py", 
>> line 525, in make_thunk 
>>     output_storage=output_storage)[:3] 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/vm.py", line 
>> 897, in make_all 
>>     no_recycling)) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/op.py", line 
>> 1002, in make_thunk 
>>     compute_map, no_recycling) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/op.py", line 
>> 739, in make_thunk 
>>     output_storage=node_output_storage) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cc.py", line 
>> 1072, in make_thunk 
>>     keep_lock=keep_lock) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cc.py", line 
>> 1014, in __compile__ 
>>     keep_lock=keep_lock) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cc.py", line 
>> 1441, in cthunk_factory 
>>     key=key, lnk=self, keep_lock=keep_lock) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cmodule.py", 
>> line 1045, in module_from_key 
>>     module = self._get_from_key(key) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cmodule.py", 
>> line 949, in _get_from_key 
>>     return self._get_module(name) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cmodule.py", 
>> line 632, in _get_module 
>>     self.module_from_name[name] = dlimport(name) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cmodule.py", 
>> line 293, in dlimport 
>>     rval = __import__(module_name, {}, {}, [module_name]) 
>> ImportError: (ImportError('The following error happened while compiling the 
>> node', Elemwise{add,no_inplace}(TensorConstant{(1,) of -1}, 
>> AdvancedSubtensor1.0), '\n', 'No module named 
>> tmpuawGWX.7ecbb18ed585719993b1efa9e2a60ff5'), <
>>
>>
>> 15/03/02 18:57:02 INFO TaskSetManager: Lost task 51.1 in stage 7.19 (TID 
>> 17724) on executor node004.cm.cluster: 
>> org.apache.spark.api.python.PythonException (Traceback (most recent call 
>> last): 
>>   File "/scratch/users/tjklein/215178/spark/python/pyspark/worker.py", line 
>> 101, in main 
>>     process() 
>>   File "/scratch/users/tjklein/215178/spark/python/pyspark/worker.py", line 
>> 96, in process 
>>     serializer.dump_stream(func(split_index, iterator), outfile) 
>>   File "/scratch/users/tjklein/215178/spark/python/pyspark/serializers.py", 
>> line 236, in dump_stream 
>>     vs = list(itertools.islice(iterator, batch)) 
>>   File "/home/tjklein/cnn-3d/spark_LORM_refactored.py", line 1046, in 
>> distributed_gradient_computation 
>>     return broadcast_ADMM_gradient_function.value(*param_list) 
>>   File "/scratch/users/tjklein/215178/spark/python/pyspark/broadcast.py", 
>> line 106, in value 
>>     self._value = self.load(self._path) 
>>   File "/scratch/users/tjklein/215178/spark/python/pyspark/broadcast.py", 
>> line 95, in load 
>>     return cPickle.loads(data) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/compile/function_module.py",
>>  line 747, in _constructor_Function 
>>     f = maker.create(input_storage, trustme = True) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/compile/function_module.py",
>>  line 1415, in create 
>>     input_storage=input_storage_lists) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/link.py", 
>> line 525, in make_thunk 
>>     output_storage=output_storage)[:3] 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/vm.py", line 
>> 897, in make_all 
>>     no_recycling)) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/op.py", line 
>> 739, in make_thunk 
>>     output_storage=node_output_storage) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cc.py", line 
>> 1072, in make_thunk 
>>     keep_lock=keep_lock) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cc.py", line 
>> 1014, in __compile__ 
>>     keep_lock=keep_lock) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cc.py", line 
>> 1441, in cthunk_factory 
>>     key=key, lnk=self, keep_lock=keep_lock) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cmodule.py", 
>> line 1054, in module_from_key 
>>     module = self._get_from_hash(module_hash, key, keep_lock=keep_lock) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cmodule.py", 
>> line 957, in _get_from_hash 
>>     key_data.add_key(key, save_pkl=bool(key[0])) 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cmodule.py", 
>> line 467, in add_key 
>>     self.save_pkl() 
>>   File 
>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cmodule.py", 
>> line 484, in save_pkl 
>>     with open(self.key_pkl, 'wb') as f: 
>> IOError: [Errno [Errno 2] No such file or directory: 
>> '/scratch/users/tjklein/215178/theano/compiledir_Linux-2.6-el6.x86_64-x86_64-with-redhat-6.6-Santiago-x86_64-2.7.9-64/tmpQrM4nE/key.pkl']
>>
>> ​
>>
>> On Mon, Mar 2, 2015 at 11:21 AM, Frédéric Bastien <[email protected] 
>> <javascript:>> wrote:
>>
>>> Hi,
>>>
>>> Can you give the full stack trace of the error?
>>>
>>> Do you use the same compiledir for the master and all process when it is 
>>> on the same compute node?
>>>
>>> Fred
>>>
>>> On Sun, Mar 1, 2015 at 5:17 PM, Tassilo Klein <[email protected] 
>>> <javascript:>> wrote:
>>>
>>>> Hi Fred,
>>>>
>>>> I got the latest version of Theano from the repository. I ran it on a 
>>>> single machine and also tried it on a distributed cluster. But still have 
>>>> the same issues as before, in both scenarios.
>>>>
>>>> I get many:
>>>>
>>>> INFO (theano.gof.compilelock): To manually release the lock, delete 
>>>> /scratch/users/tjklein/STANDALONE/theano/compiledir_Linux-2.6-el6.x86_64-x86_64-with-redhat-6.6-Santiago-x86_64-2.7.9-64/lock_dir
>>>>
>>>> INFO (theano.gof.compilelock): Waiting for existing lock by process 
>>>> '252635' (I am process '249796')
>>>>
>>>> INFO (theano.gof.compilelock): To manually release the lock, delete 
>>>> /scratch/users/tjklein/STANDALONE/theano/compiledir_Linux-2.6-el6.x86_64-x86_64-with-redhat-6.6-Santiago-x86_64-2.7.9-64/lock_dir
>>>>
>>>> INFO (theano.gof.compilelock): Waiting for existing lock by process 
>>>> '252635' (I am process '238059')
>>>>
>>>> INFO (theano.gof.compilelock): To manually release the lock, delete 
>>>> /scratch/users/tjklein/STANDALONE/theano/compiledir_Linux-2.6-el6.x86_64-x86_64-with-redhat-6.6-Santiago-x86_64-2.7.9-64/lock_dir
>>>>
>>>> INFO (theano.gof.compilelock): Waiting for existing lock by process 
>>>> '252635' (I am process '237960')
>>>>
>>>> INFO (theano.gof.compilelock): To manually release the lock, delete 
>>>> /scratch/users/tjklein/STANDALONE/theano/compiledir_Linux-2.6-el6.x86_64-x86_64-with-redhat-6.6-Santiago-x86_64-2.7.9-64/lock_dir
>>>>
>>>> INFO (theano.gof.compilelock): Waiting for existing lock by process 
>>>> '252635' (I am process '238179')
>>>>
>>>> INFO (theano.gof.compilelock): To manually release the lock, delete 
>>>> /scratch/users/tjklein/STANDALONE/theano/compiledir_Linux-2.6-el6.x86_64-x86_64-with-redhat-6.6-Santiago-x86_64-2.7.9-64/lock_dir
>>>>
>>>> INFO (theano.gof.compilelock): Waiting for existing lock by process 
>>>> '252635' (I am process '247811')
>>>>
>>>> INFO (theano.gof.compilelock): To manually release the lock, delete 
>>>> /scratch/users/tjklein/STANDALONE/theano/compiledir_Linux-2.6-el6.x86_64-x86_64-with-redhat-6.6-Santiago-x86_64-2.7.9-64/lock_dir
>>>>
>>>> INFO (theano.gof.compilelock): Waiting for existing lock by process 
>>>> '252635' (I am process '237684')
>>>>
>>>> INFO (theano.gof.compilelock): To manually release the lock, delete 
>>>> /scratch/users/tjklein/STANDALONE/theano/compiledir_Linux-2.6-el6.x86_64-x86_64-with-redhat-6.6-Santiago-x86_64-2.7.9-64/lock_dir
>>>>
>>>> INFO (theano.gof.compilelock): Waiting for existing lock by process 
>>>> '252635' (I am process '251284')
>>>>
>>>> and
>>>>
>>>> IOError: [Errno [Errno 2] No such file or directory: 
>>>> '/scratch/users/STANDALONE/theano/compiledir_Linux-2.6-el6.x86_64-x86_64-with-redhat-6.6-Santiago-x86_64-2.7.9-64/tmptlW68q/key.pkl']
>>>>  
>>>> <function _constructor_Function at 0x2aaacc1b7b90>: 
>>>> (<theano.compile.function_module.FunctionMaker object at 0x2aab24139b50>, 
>>>>
>>>>
>>>>
>>>> Let me know if you want me to try out something else.
>>>>
>>>> Best, 
>>>>
>>>>  Tassilo
>>>>
>>>> On Wed, Feb 25, 2015 at 5:39 PM, Frédéric Bastien <
>>>> [email protected] <javascript:>> wrote:
>>>>
>>>>> This could cause duplicate compilation and warning related to that. 
>>>>> Can you try with up to date Theano without your work around?
>>>>> Le 22 févr. 2015 09:16, "György Solymosi" <[email protected] 
>>>>> <javascript:>> a écrit :
>>>>>
>>>>>> Hi Fred,
>>>>>>
>>>>>> Sorry to jump in, but I had some experiences with the same problem, 
>>>>>> so I simply switched off the lock and I had no issues -perhaps 
>>>>>> seemingly- 
>>>>>> with multiprocessing some -even 4 another- Theano tasks. Is it possible 
>>>>>> or 
>>>>>> it was just an illusion and it generated some issues I didn't see in 
>>>>>> background?
>>>>>>
>>>>>> George
>>>>>>
>>>>>> -- 
>>>>>>
>>>>>> --- 
>>>>>> You received this message because you are subscribed to the Google 
>>>>>> Groups "theano-users" group.
>>>>>> To unsubscribe from this group and stop receiving emails from it, 
>>>>>> send an email to [email protected] <javascript:>.
>>>>>> For more options, visit https://groups.google.com/d/optout.
>>>>>>
>>>>> -- 
>>>>>
>>>>> --- 
>>>>> You received this message because you are subscribed to a topic in the 
>>>>> Google Groups "theano-users" group.
>>>>> To unsubscribe from this topic, visit 
>>>>> https://groups.google.com/d/topic/theano-users/Pi4zQpfn5Ts/unsubscribe
>>>>> .
>>>>> To unsubscribe from this group and all its topics, send an email to 
>>>>> [email protected] <javascript:>.
>>>>> For more options, visit https://groups.google.com/d/optout.
>>>>>
>>>>
>>>> -- 
>>>>
>>>> --- 
>>>> You received this message because you are subscribed to the Google 
>>>> Groups "theano-users" group.
>>>> To unsubscribe from this group and stop receiving emails from it, send 
>>>> an email to [email protected] <javascript:>.
>>>> For more options, visit https://groups.google.com/d/optout.
>>>>
>>>
>>> -- 
>>>
>>> --- 
>>> You received this message because you are subscribed to a topic in the 
>>> Google Groups "theano-users" group.
>>> To unsubscribe from this topic, visit 
>>> https://groups.google.com/d/topic/theano-users/Pi4zQpfn5Ts/unsubscribe.
>>> To unsubscribe from this group and all its topics, send an email to 
>>> [email protected] <javascript:>.
>>> For more options, visit https://groups.google.com/d/optout.
>>>
>>
>>
>

-- 

--- 
You received this message because you are subscribed to the Google Groups 
"theano-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/d/optout.

Reply via email to