Hi!
I'm running julia release 0.3 on an hpc system that features two 6-core
2.4GHz Intel Westmere processors (a total of 12 processor cores) per node.
I noticed my program is significantly slower on that machine than on my
mac, where I run this on 1 core (2.4GHz Intel Core i5). (runtime is 60 secs
vs 90 secs)
I would expect the hpc cores to be slightly slower but was surprised it was
that much. I was wondering if any of the options I set to build this are
culprit for this? I do run the binary of release 0.3 on my mac, whereas I
build from source with those options on the hpc:
OPENBLAS_TARGET_ARCH=NEHALEM
OPENBLAS_DYNAMIC_ARCH = 0
OPENBLAS_USE_THREAD=0
JULIA_CPU_TARGET=core2
I can build julia without the final option. I cannot build it without the
first three options.
I cannot run multiple nodes without the final option, since I get an error
Target architecture mismatch. Please delete or regenerate
sys.{so,dll,dylib}.