I tried to simplify the inner loop to:
~~~
for word::D in data
remainder = (remainder >>> word_size)
end
~~~
It still took more than half the time of the original (with table indexing
and two XORs) in my Win7-x64 machine. It shows that not much speed-ups with
the real operations can be expected with this loop structure. Int32 tables
instead of Int64 does not look promising, either. Are you sure the zlib
crc32 function was fed with the same data (100 million bytes fetched from
memory, not generated on demand)? If yes, the Julia loops can use some
improvements.
- [julia-users] bit-twiddling micro benchmark andrew cooke
- Re: [julia-users] bit-twiddling micro benchmark Chris Foster
- [julia-users] Re: bit-twiddling micro benchmark Mike Innes
- Re: [julia-users] Re: bit-twiddling micro benchm... Isaiah Norton
- Re: [julia-users] Re: bit-twiddling micro be... Mike Innes
- Re: [julia-users] Re: bit-twiddling micr... Jake Bolewski
- Re: [julia-users] Re: bit-twiddling micro benchm... Tim Holy
- [julia-users] Re: bit-twiddling micro benchmark Laszlo Hars
- [julia-users] Re: bit-twiddling micro benchmark Laszlo Hars
- Re: [julia-users] Re: bit-twiddling micro benchm... Chris Foster
- Re: [julia-users] Re: bit-twiddling micro be... andrew cooke
- Re: [julia-users] Re: bit-twiddling micr... andrew cooke
- Re: [julia-users] Re: bit-twiddling... andrew cooke
- Re: [julia-users] Re: bit-twiddling... Stefan Karpinski
- Re: [julia-users] Re: bit-twidd... andrew cooke
- Re: [julia-users] Re: bit-t... Laszlo Hars
- Re: [julia-users] Re: bit-t... andrew cooke
- Re: [julia-users] Re: bit-t... Laszlo Hars
