On 14-8-2011 23:05, Daniel Fischer wrote:
On Sunday 14 August 2011, 22:42:13, Wishnu Prasetya wrote:
On 14-8-2011 22:17, Daniel Fischer wrote:
We don't know the times for a non-threaded run (or an -N1 run), so it
could be anything from a slowdown to a> 4× speedup (but it's likely
to be a speedup by a factor< 4×).
Well, the -N1 is below. The sequential version of the program has almost
the same profile:
SPARKS: 5 (1 converted, 4 pruned)
INIT time 0.00s ( 0.00s elapsed)
MUT time 2.78s ( 2.99s elapsed)
GC time 4.35s ( 4.15s elapsed)
EXIT time 0.00s ( 0.00s elapsed)
Total time 7.13s ( 7.14s elapsed)
Am I correct then to say that the speed up with respect to sequential is
equal to: tot-elapse-time-N1 / tot-elapse-N4 ? So I have 7.14 / 2.36 =
3.0 speed up, and not 1.46 as Iustin said?
Yes (with respect to wall-clock time of course).
That's not too bad, though it should be possible to increase that factor.
I'll probably have to do something with that GC :)
But that should be the first target, there's probably some low-hanging
fruit there.
Maybe increasing the size of the allocation area (+RTS -Ax) or the heap
(+RTS -Hx) would already do some good.
Also do heap profiling to find out what most likely takes so much GC time
(before compiling for profiling, running with +RTS -hT could produce useful
information).
Ah ok. Thanks a lot for all the answers and tips.
--Wish.
_______________________________________________
Haskell-Cafe mailing list
[email protected]
http://www.haskell.org/mailman/listinfo/haskell-cafe