Thank you anyway.

在 2016年8月5日星期五 UTC-4下午6:24:07,Tassilo Klein写道:
>
> Unless something was dramatically changed in the architecture, I am afraid 
> there is no solution to this the problem. I had to change the framework to 
> get it working.
>
> On Fri, Aug 5, 2016 at 10:30 PM, <[email protected] <javascript:>> wrote:
>
>> Hi, Tassilo,
>> Have you solved this problem?
>> I encounter the same problem at present without any idea for it.
>> I just run some separate programs in the same time then the function 
>> compiling outputs the similar problem.
>> Thanks a lot for any idea.
>>
>> 在 2015年3月9日星期一 UTC-4下午1:51:30,Tassilo Klein写道:
>>>
>>> Hi Fred,
>>>
>>> I just wanted to follow up on this issue as I am sort of deadlocked with 
>>> this problem now. Do you see any solution?
>>> Btw, the non-thread safety issue of Theano has not been fixed yet, 
>>> right? Because it sounds somewhat related to that.
>>>
>>> -Tassilo
>>>
>>> On Mon, Mar 2, 2015 at 7:07 PM, Tassilo Klein <[email protected]> wrote:
>>>
>>>> Hi Fred,
>>>>
>>>> yes, master and slave have the same directory. Sometimes it works 
>>>> better, sometimes worse - feels like a race condition. It is weird. Here 
>>>> are stack-traces for different error outputs.
>>>>
>>>> Cheers,
>>>> Tassilo
>>>>
>>>>
>>>> 15/03/02 18:55:57 WARN TaskSetManager: Lost task 64.0 in stage 7.19 (TID 
>>>> 17710, node004.cm.cluster): org.apache.spark.api.python.PythonException: 
>>>> Traceback (most recent call last): 
>>>>   File "/scratch/users/tjklein/215178/spark/python/pyspark/worker.py", 
>>>> line 101, in main 
>>>>     process() 
>>>>   File "/scratch/users/tjklein/215178/spark/python/pyspark/worker.py", 
>>>> line 96, in process 
>>>>     serializer.dump_stream(func(split_index, iterator), outfile) 
>>>>   File 
>>>> "/scratch/users/tjklein/215178/spark/python/pyspark/serializers.py", line 
>>>> 236, in dump_stream 
>>>>     vs = list(itertools.islice(iterator, batch)) 
>>>>   File "/home/tjklein/cnn-3d/spark_LORM_refactored.py", line 1046, in 
>>>> distributed_gradient_computation 
>>>>     return broadcast_ADMM_gradient_function.value(*param_list) 
>>>>   File "/scratch/users/tjklein/215178/spark/python/pyspark/broadcast.py", 
>>>> line 106, in value 
>>>>     self._value = self.load(self._path) 
>>>>   File "/scratch/users/tjklein/215178/spark/python/pyspark/broadcast.py", 
>>>> line 95, in load 
>>>>     return cPickle.loads(data) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/compile/function_module.py",
>>>>  line 747, in _constructor_Function 
>>>>     f = maker.create(input_storage, trustme = True) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/compile/function_module.py",
>>>>  line 1415, in create 
>>>>     input_storage=input_storage_lists) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/link.py", 
>>>> line 525, in make_thunk 
>>>>     output_storage=output_storage)[:3] 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/vm.py", 
>>>> line 897, in make_all 
>>>>     no_recycling)) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/op.py", 
>>>> line 1002, in make_thunk 
>>>>     compute_map, no_recycling) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/op.py", 
>>>> line 739, in make_thunk 
>>>>     output_storage=node_output_storage) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cc.py", 
>>>> line 1072, in make_thunk 
>>>>     keep_lock=keep_lock) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cc.py", 
>>>> line 1014, in __compile__ 
>>>>     keep_lock=keep_lock) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cc.py", 
>>>> line 1441, in cthunk_factory 
>>>>     key=key, lnk=self, keep_lock=keep_lock) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cmodule.py",
>>>>  line 1076, in module_from_key 
>>>>     module = lnk.compile_cmodule(location) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cc.py", 
>>>> line 1353, in compile_cmodule 
>>>>     preargs=preargs) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cmodule.py",
>>>>  line 1889, in compile_str 
>>>>     raise MissingGXX("g++ not available! We can't compile c code.") 
>>>> MissingGXX: (MissingGXX('The following error happened while compiling the 
>>>> node', Elemwise{neg,no_inplace}(y), '\n', "g++ not available! We can't 
>>>> compile c code.", '[Elemwise{neg,no_inplace}(y)]'), <function 
>>>> _constructor_Function at 0x2aaacc32d230>, 
>>>> (<theano.compile.function_module.FunctionMaker object at 0x2aab2a53ff10>, 
>>>> [<None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, 
>>>> <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, 
>>>> <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, 
>>>> <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, 
>>>> <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, 
>>>> <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, 
>>>> <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, <array([[ 
>>>> 0.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  1.]])>], [None, None, None, None, None, None, 
>>>> None, None, None, None, None, None, None, None, None, None, None, None, 
>>>> None, None, None, None, None, None, None, None, None, None, None, None, 
>>>> None, None, None, None, None, None, None, None, None, None, None, None, 
>>>> None, None, None, None, None, None, None, None, None, None, None, None, 
>>>> None, None, None, None, None, None, None, None, array([[ 0.,  0.,  0.,  
>>>> 0.,  0.],
>>>>
>>>>
>>>> aceback (most recent call last): 
>>>>   File "/scratch/users/tjklein/215178/spark/python/pyspark/worker.py", 
>>>> line 101, in main 
>>>>     process() 
>>>>   File "/scratch/users/tjklein/215178/spark/python/pyspark/worker.py", 
>>>> line 96, in process 
>>>>     serializer.dump_stream(func(split_index, iterator), outfile) 
>>>>   File 
>>>> "/scratch/users/tjklein/215178/spark/python/pyspark/serializers.py", line 
>>>> 236, in dump_stream 
>>>>     vs = list(itertools.islice(iterator, batch)) 
>>>>   File "/home/tjklein/cnn-3d/spark_LORM_refactored.py", line 1046, in 
>>>> distributed_gradient_computation 
>>>>     return broadcast_ADMM_gradient_function.value(*param_list) 
>>>>   File "/scratch/users/tjklein/215178/spark/python/pyspark/broadcast.py", 
>>>> line 106, in value 
>>>>     self._value = self.load(self._path) 
>>>>   File "/scratch/users/tjklein/215178/spark/python/pyspark/broadcast.py", 
>>>> line 95, in load 
>>>>     return cPickle.loads(data) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/compile/function_module.py",
>>>>  line 747, in _constructor_Function 
>>>>     f = maker.create(input_storage, trustme = True) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/compile/function_module.py",
>>>>  line 1415, in create 
>>>>     input_storage=input_storage_lists) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/link.py", 
>>>> line 525, in make_thunk 
>>>>     output_storage=output_storage)[:3] 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/vm.py", 
>>>> line 897, in make_all 
>>>>     no_recycling)) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/op.py", 
>>>> line 1002, in make_thunk 
>>>>     compute_map, no_recycling) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/op.py", 
>>>> line 739, in make_thunk 
>>>>     output_storage=node_output_storage) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cc.py", 
>>>> line 1072, in make_thunk 
>>>>     keep_lock=keep_lock) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cc.py", 
>>>> line 1014, in __compile__ 
>>>>     keep_lock=keep_lock) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cc.py", 
>>>> line 1441, in cthunk_factory 
>>>>     key=key, lnk=self, keep_lock=keep_lock) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cmodule.py",
>>>>  line 1054, in module_from_key 
>>>>     module = self._get_from_hash(module_hash, key, keep_lock=keep_lock) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cmodule.py",
>>>>  line 954, in _get_from_hash 
>>>>     module = self._get_from_key(None, key_data) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cmodule.py",
>>>>  line 949, in _get_from_key 
>>>>     return self._get_module(name) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cmodule.py",
>>>>  line 632, in _get_module 
>>>>     self.module_from_name[name] = dlimport(name) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cmodule.py",
>>>>  line 293, in dlimport 
>>>>     rval = __import__(module_name, {}, {}, [module_name]) 
>>>> ImportError: (ImportError('The following error happened while compiling 
>>>> the node', Elemwise{add,no_inplace}(TensorConstant{(1,) of -1}, 
>>>> AdvancedSubtensor1.0), '\n', 'No module named 
>>>> tmpuawGWX.7ecbb18ed585719993b1efa9e2a60ff5'), <function 
>>>> _constructor_Function at 0x2aaacc3570c8>, 
>>>> (<theano.compile.function_module.FunctionMaker object at 0x2aab2a314110>, 
>>>> [<None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, 
>>>> <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, 
>>>> <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, 
>>>> <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, 
>>>> <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, 
>>>> <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, 
>>>> <None>, <None>, <None>, <None>, <None>, <None>, <None>, <None>, <array([[ 
>>>> 0.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  1.]])>, <array([[ 0.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  0.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  0.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  0.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  0.], 
>>>>        [ 1.,  1.,  1.,  1.,  1.]])>]
>>>>
>>>>
>>>> 15/03/02 18:56:42 WARN TaskSetManager: Lost task 56.1 in stage 7.19 (TID 
>>>> 17719, node004.cm.cluster): org.apache.spark.api.python.PythonException: 
>>>> Traceback (most recent call last): 
>>>>   File "/scratch/users/tjklein/215178/spark/python/pyspark/worker.py", 
>>>> line 101, in main 
>>>>     process() 
>>>>   File "/scratch/users/tjklein/215178/spark/python/pyspark/worker.py", 
>>>> line 96, in process 
>>>>     serializer.dump_stream(func(split_index, iterator), outfile) 
>>>>   File 
>>>> "/scratch/users/tjklein/215178/spark/python/pyspark/serializers.py", line 
>>>> 236, in dump_stream 
>>>>     vs = list(itertools.islice(iterator, batch)) 
>>>>   File "/home/tjklein/cnn-3d/spark_LORM_refactored.py", line 1046, in 
>>>> distributed_gradient_computation 
>>>>     return broadcast_ADMM_gradient_function.value(*param_list) 
>>>>   File "/scratch/users/tjklein/215178/spark/python/pyspark/broadcast.py", 
>>>> line 106, in value 
>>>>     self._value = self.load(self._path) 
>>>>   File "/scratch/users/tjklein/215178/spark/python/pyspark/broadcast.py", 
>>>> line 95, in load 
>>>>     return cPickle.loads(data) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/compile/function_module.py",
>>>>  line 747, in _constructor_Function 
>>>>     f = maker.create(input_storage, trustme = True) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/compile/function_module.py",
>>>>  line 1415, in create 
>>>>     input_storage=input_storage_lists) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/link.py", 
>>>> line 525, in make_thunk 
>>>>     output_storage=output_storage)[:3] 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/vm.py", 
>>>> line 897, in make_all 
>>>>     no_recycling)) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/op.py", 
>>>> line 1002, in make_thunk 
>>>>     compute_map, no_recycling) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/op.py", 
>>>> line 739, in make_thunk 
>>>>     output_storage=node_output_storage) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cc.py", 
>>>> line 1072, in make_thunk 
>>>>     keep_lock=keep_lock) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cc.py", 
>>>> line 1014, in __compile__ 
>>>>     keep_lock=keep_lock) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cc.py", 
>>>> line 1441, in cthunk_factory 
>>>>     key=key, lnk=self, keep_lock=keep_lock) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cmodule.py",
>>>>  line 1045, in module_from_key 
>>>>     module = self._get_from_key(key) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cmodule.py",
>>>>  line 949, in _get_from_key 
>>>>     return self._get_module(name) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cmodule.py",
>>>>  line 632, in _get_module 
>>>>     self.module_from_name[name] = dlimport(name) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cmodule.py",
>>>>  line 293, in dlimport 
>>>>     rval = __import__(module_name, {}, {}, [module_name]) 
>>>> ImportError: (ImportError('The following error happened while compiling 
>>>> the node', Elemwise{add,no_inplace}(TensorConstant{(1,) of -1}, 
>>>> AdvancedSubtensor1.0), '\n', 'No module named 
>>>> tmpuawGWX.7ecbb18ed585719993b1efa9e2a60ff5'), <
>>>>
>>>>
>>>> 15/03/02 18:57:02 INFO TaskSetManager: Lost task 51.1 in stage 7.19 (TID 
>>>> 17724) on executor node004.cm.cluster: 
>>>> org.apache.spark.api.python.PythonException (Traceback (most recent call 
>>>> last): 
>>>>   File "/scratch/users/tjklein/215178/spark/python/pyspark/worker.py", 
>>>> line 101, in main 
>>>>     process() 
>>>>   File "/scratch/users/tjklein/215178/spark/python/pyspark/worker.py", 
>>>> line 96, in process 
>>>>     serializer.dump_stream(func(split_index, iterator), outfile) 
>>>>   File 
>>>> "/scratch/users/tjklein/215178/spark/python/pyspark/serializers.py", line 
>>>> 236, in dump_stream 
>>>>     vs = list(itertools.islice(iterator, batch)) 
>>>>   File "/home/tjklein/cnn-3d/spark_LORM_refactored.py", line 1046, in 
>>>> distributed_gradient_computation 
>>>>     return broadcast_ADMM_gradient_function.value(*param_list) 
>>>>   File "/scratch/users/tjklein/215178/spark/python/pyspark/broadcast.py", 
>>>> line 106, in value 
>>>>     self._value = self.load(self._path) 
>>>>   File "/scratch/users/tjklein/215178/spark/python/pyspark/broadcast.py", 
>>>> line 95, in load 
>>>>     return cPickle.loads(data) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/compile/function_module.py",
>>>>  line 747, in _constructor_Function 
>>>>     f = maker.create(input_storage, trustme = True) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/compile/function_module.py",
>>>>  line 1415, in create 
>>>>     input_storage=input_storage_lists) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/link.py", 
>>>> line 525, in make_thunk 
>>>>     output_storage=output_storage)[:3] 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/vm.py", 
>>>> line 897, in make_all 
>>>>     no_recycling)) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/op.py", 
>>>> line 739, in make_thunk 
>>>>     output_storage=node_output_storage) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cc.py", 
>>>> line 1072, in make_thunk 
>>>>     keep_lock=keep_lock) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cc.py", 
>>>> line 1014, in __compile__ 
>>>>     keep_lock=keep_lock) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cc.py", 
>>>> line 1441, in cthunk_factory 
>>>>     key=key, lnk=self, keep_lock=keep_lock) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cmodule.py",
>>>>  line 1054, in module_from_key 
>>>>     module = self._get_from_hash(module_hash, key, keep_lock=keep_lock) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cmodule.py",
>>>>  line 957, in _get_from_hash 
>>>>     key_data.add_key(key, save_pkl=bool(key[0])) 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cmodule.py",
>>>>  line 467, in add_key 
>>>>     self.save_pkl() 
>>>>   File 
>>>> "/home/tjklein/anaconda/lib/python2.7/site-packages/theano/gof/cmodule.py",
>>>>  line 484, in save_pkl 
>>>>     with open(self.key_pkl, 'wb') as f: 
>>>> IOError: [Errno [Errno 2] No such file or directory: 
>>>> '/scratch/users/tjklein/215178/theano/compiledir_Linux-2.6-el6.x86_64-x86_64-with-redhat-6.6-Santiago-x86_64-2.7.9-64/tmpQrM4nE/key.pkl']
>>>>
>>>> ​
>>>>
>>>> On Mon, Mar 2, 2015 at 11:21 AM, Frédéric Bastien <
>>>> [email protected]> wrote:
>>>>
>>>>> Hi,
>>>>>
>>>>> Can you give the full stack trace of the error?
>>>>>
>>>>> Do you use the same compiledir for the master and all process when it 
>>>>> is on the same compute node?
>>>>>
>>>>> Fred
>>>>>
>>>>> On Sun, Mar 1, 2015 at 5:17 PM, Tassilo Klein <[email protected]> 
>>>>> wrote:
>>>>>
>>>>>> Hi Fred,
>>>>>>
>>>>>> I got the latest version of Theano from the repository. I ran it on a 
>>>>>> single machine and also tried it on a distributed cluster. But still 
>>>>>> have 
>>>>>> the same issues as before, in both scenarios.
>>>>>>
>>>>>> I get many:
>>>>>>
>>>>>> INFO (theano.gof.compilelock): To manually release the lock, delete 
>>>>>> /scratch/users/tjklein/STANDALONE/theano/compiledir_Linux-2.6-el6.x86_64-x86_64-with-redhat-6.6-Santiago-x86_64-2.7.9-64/lock_dir
>>>>>>
>>>>>> INFO (theano.gof.compilelock): Waiting for existing lock by process 
>>>>>> '252635' (I am process '249796')
>>>>>>
>>>>>> INFO (theano.gof.compilelock): To manually release the lock, delete 
>>>>>> /scratch/users/tjklein/STANDALONE/theano/compiledir_Linux-2.6-el6.x86_64-x86_64-with-redhat-6.6-Santiago-x86_64-2.7.9-64/lock_dir
>>>>>>
>>>>>> INFO (theano.gof.compilelock): Waiting for existing lock by process 
>>>>>> '252635' (I am process '238059')
>>>>>>
>>>>>> INFO (theano.gof.compilelock): To manually release the lock, delete 
>>>>>> /scratch/users/tjklein/STANDALONE/theano/compiledir_Linux-2.6-el6.x86_64-x86_64-with-redhat-6.6-Santiago-x86_64-2.7.9-64/lock_dir
>>>>>>
>>>>>> INFO (theano.gof.compilelock): Waiting for existing lock by process 
>>>>>> '252635' (I am process '237960')
>>>>>>
>>>>>> INFO (theano.gof.compilelock): To manually release the lock, delete 
>>>>>> /scratch/users/tjklein/STANDALONE/theano/compiledir_Linux-2.6-el6.x86_64-x86_64-with-redhat-6.6-Santiago-x86_64-2.7.9-64/lock_dir
>>>>>>
>>>>>> INFO (theano.gof.compilelock): Waiting for existing lock by process 
>>>>>> '252635' (I am process '238179')
>>>>>>
>>>>>> INFO (theano.gof.compilelock): To manually release the lock, delete 
>>>>>> /scratch/users/tjklein/STANDALONE/theano/compiledir_Linux-2.6-el6.x86_64-x86_64-with-redhat-6.6-Santiago-x86_64-2.7.9-64/lock_dir
>>>>>>
>>>>>> INFO (theano.gof.compilelock): Waiting for existing lock by process 
>>>>>> '252635' (I am process '247811')
>>>>>>
>>>>>> INFO (theano.gof.compilelock): To manually release the lock, delete 
>>>>>> /scratch/users/tjklein/STANDALONE/theano/compiledir_Linux-2.6-el6.x86_64-x86_64-with-redhat-6.6-Santiago-x86_64-2.7.9-64/lock_dir
>>>>>>
>>>>>> INFO (theano.gof.compilelock): Waiting for existing lock by process 
>>>>>> '252635' (I am process '237684')
>>>>>>
>>>>>> INFO (theano.gof.compilelock): To manually release the lock, delete 
>>>>>> /scratch/users/tjklein/STANDALONE/theano/compiledir_Linux-2.6-el6.x86_64-x86_64-with-redhat-6.6-Santiago-x86_64-2.7.9-64/lock_dir
>>>>>>
>>>>>> INFO (theano.gof.compilelock): Waiting for existing lock by process 
>>>>>> '252635' (I am process '251284')
>>>>>>
>>>>>> and
>>>>>>
>>>>>> IOError: [Errno [Errno 2] No such file or directory: 
>>>>>> '/scratch/users/STANDALONE/theano/compiledir_Linux-2.6-el6.x86_64-x86_64-with-redhat-6.6-Santiago-x86_64-2.7.9-64/tmptlW68q/key.pkl']
>>>>>>  
>>>>>> <function _constructor_Function at 0x2aaacc1b7b90>: 
>>>>>> (<theano.compile.function_module.FunctionMaker object at 
>>>>>> 0x2aab24139b50>, 
>>>>>>
>>>>>>
>>>>>>
>>>>>> Let me know if you want me to try out something else.
>>>>>>
>>>>>> Best, 
>>>>>>
>>>>>>  Tassilo
>>>>>>
>>>>>> On Wed, Feb 25, 2015 at 5:39 PM, Frédéric Bastien <
>>>>>> [email protected]> wrote:
>>>>>>
>>>>>>> This could cause duplicate compilation and warning related to that. 
>>>>>>> Can you try with up to date Theano without your work around?
>>>>>>> Le 22 févr. 2015 09:16, "György Solymosi" <[email protected]> a 
>>>>>>> écrit :
>>>>>>>
>>>>>>>> Hi Fred,
>>>>>>>>
>>>>>>>> Sorry to jump in, but I had some experiences with the same problem, 
>>>>>>>> so I simply switched off the lock and I had no issues -perhaps 
>>>>>>>> seemingly- 
>>>>>>>> with multiprocessing some -even 4 another- Theano tasks. Is it 
>>>>>>>> possible or 
>>>>>>>> it was just an illusion and it generated some issues I didn't see in 
>>>>>>>> background?
>>>>>>>>
>>>>>>>> George
>>>>>>>>
>>>>>>>> -- 
>>>>>>>>
>>>>>>>> --- 
>>>>>>>> You received this message because you are subscribed to the Google 
>>>>>>>> Groups "theano-users" group.
>>>>>>>> To unsubscribe from this group and stop receiving emails from it, 
>>>>>>>> send an email to [email protected].
>>>>>>>> For more options, visit https://groups.google.com/d/optout.
>>>>>>>>
>>>>>>> -- 
>>>>>>>
>>>>>>> --- 
>>>>>>> You received this message because you are subscribed to a topic in 
>>>>>>> the Google Groups "theano-users" group.
>>>>>>> To unsubscribe from this topic, visit 
>>>>>>> https://groups.google.com/d/topic/theano-users/Pi4zQpfn5Ts/unsubscribe
>>>>>>> .
>>>>>>> To unsubscribe from this group and all its topics, send an email to 
>>>>>>> [email protected].
>>>>>>> For more options, visit https://groups.google.com/d/optout.
>>>>>>>
>>>>>>
>>>>>> -- 
>>>>>>
>>>>>> --- 
>>>>>> You received this message because you are subscribed to the Google 
>>>>>> Groups "theano-users" group.
>>>>>> To unsubscribe from this group and stop receiving emails from it, 
>>>>>> send an email to [email protected].
>>>>>> For more options, visit https://groups.google.com/d/optout.
>>>>>>
>>>>>
>>>>> -- 
>>>>>
>>>>> --- 
>>>>> You received this message because you are subscribed to a topic in the 
>>>>> Google Groups "theano-users" group.
>>>>> To unsubscribe from this topic, visit 
>>>>> https://groups.google.com/d/topic/theano-users/Pi4zQpfn5Ts/unsubscribe
>>>>> .
>>>>> To unsubscribe from this group and all its topics, send an email to 
>>>>> [email protected].
>>>>> For more options, visit https://groups.google.com/d/optout.
>>>>>
>>>>
>>>>
>>> -- 
>>
>> --- 
>> You received this message because you are subscribed to a topic in the 
>> Google Groups "theano-users" group.
>> To unsubscribe from this topic, visit 
>> https://groups.google.com/d/topic/theano-users/Pi4zQpfn5Ts/unsubscribe.
>> To unsubscribe from this group and all its topics, send an email to 
>> [email protected] <javascript:>.
>> For more options, visit https://groups.google.com/d/optout.
>>
>
>

-- 

--- 
You received this message because you are subscribed to the Google Groups 
"theano-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/d/optout.

Reply via email to