[Pharo-dev] Re: Array sum. is very slow

Jimmie Houchin Fri, 07 Jan 2022 04:20:55 -0800

As I stated this is a micro benchmark and very much not anythingresembling a real app, Your comments are true if you are writing yourapp. But if you want to stress the language you are going to do thingswhich are seemingly non-sense and abusive.

Also as I stated. The test has to be sufficient to stress fasterlanguages or it is meaningless.

If I remove the #sum and the #average calls from the inner loops, thisis what we get.


Julia      0.2256 seconds
Python   5.318  seconds
Pharo    3.5    seconds

This test does not sufficiently stress the language. Nor does it provideany valuable insight into summing and averaging which is done a lot, inlots of places in every iteration.

If you notice that inner array changes the array every iteration. Soevery call to #sum and #average is getting different data.


Full Test

Julia     1.13  minutes
Python   24.02 minutes
Pharo    2:09:04

Code for the above is now published. You can let me know if I am doingsomething unequal to the various languages.

And just remember anything you do which sufficiently changes the testhas to be done in all the languages to give a fair test. This isn't alets make Pharo look good test. I do want Pharo to look good, but honestly.

Yes, I know that I can bind to BLAS or other external libraries. Butthat is not a test of Pharo. The Python is plain Python3 no Numpy, justusing the the default list [] for the array.

Julia is a whole other world. It is faster than Numpy. This is theirdomain and they optimize, optimize, optimize all the math. In fact theyhave reached the point that some pure Julia code beats pure Fortran.


In all of this I just want Pharo to do the best it can.

With the above results unless you already had an investment in Pharo,you wouldn't even look. :(


Thanks for exploring this with me.


Jimmie




On 1/6/22 18:24, John Brant wrote:

On Jan 6, 2022, at 4:35 PM, Jimmie Houchin <[email protected]> wrote:

No, it is an array of floats. The only integers in the test are in the indexes 
of the loops.

Number random. "generates a float  0.8188008774329387"

So in the randarray below it is an array of 28800 floats.

It just felt so wrong to me that Python3 was so much faster. I don't care if 
Nim, Crystal, Julia are faster. But...


I am new to Iceberg and have never shared anything on Github so this is all new 
to me. I uploaded my language test so you can see what it does. It is a 
micro-benchmark. It does things that are not realistic in an app. But it does 
stress a language in areas important to my app.


https://github.com/jlhouchin/LanguageTestPharo


Let me know if there is anything else I can do to help solve this problem.

I am a lone developer in my spare time. So my apologies for any ugly code.

Are you sure that you have the same algorithm in Python? You are calling sum 
and average inside the loop where you are modifying the array:

        1 to: nsize do: [ :j || n |
                n := narray at: j.
                narray at: j put: (self loop1calc: i j: j n: n).
                nsum := narray sum.
                navg := narray average ]

As a result, you are calculating the sum of the 28,800 size array 28,800 times (plus 
another 28,800 times for the average). If I write a similar loop in Python, it looks 
like it would take almost 9 minutes on my machine without using numpy to calculate 
the sum. The Pharo code takes ~40 seconds. If this is really how the code should be, 
then I would change it to not call sum twice (once for sum and once in average). This 
will almost result in a 2x speedup. You could also modify the algorithm to update the 
nsum value in the loop instead of summing the array each time. I think the updating 
would require <120,000 math ops vs the >1.6 billion that you are performing.


John Brant

[Pharo-dev] Re: Array sum. is very slow

Reply via email to