Re: BitArray/BitFields - Review

bearophile Sun, 29 Jul 2012 14:40:15 -0700

Era Scarecrow:

If it has to determine between compact and non-compact, then Ireally don't see how any speed improvement can be made,

Maybe it's better to consider two use cases: dynamic lengthallocated on the heap (or allocated with a given allocator, oftena stack-styled allocator), and fixed-size allocated in place, sothere's no need for that run-time test.

and if it's optimized it may be inlined which would be about asfast as you could get.

I think that "a[x] = 1;" is slower than "a.set(x);" even ifthe array a is allocated in place and everything is inlined. Thisis probably the most important operation an array of bits has tosupport. If this is missing, I have to use my own implementationstill.

Likely similar to the hamming weight table mentioned in TDPL.Combined with the canUseBulk I think I could make it fairlyfast.

There is lot of literature about implementing this operationefficiently. For the first implementation a moderately fast (andshort) code is probably enough. Later faster versions of thisoperation will go in Phobos, coming from papers.

Yes, currently a debate of it's partial ref/value issue iscoming up. As it is it's usable, and if you want to be sureit's unique you can always dup it, otherwise as a previoussuggestion I'll either make two array types for either fullreference vs value, or take his separate slices (reference)idea.

The idea of making two different types (for the dynamic andstatic versions) sounds nice.

There is a third use case, that is bit arrays that start smalland can grow, and rarely grow bigger than one or two words. Thisasks for a dynamic-length bit array type that allocates locallyonly when it's small (so it needs to perform run-time test of thetag. Too bad the D GC doesn't support tagged pointers!). Butprobably this use case is not common enough to require a thirdtype of array. So I suggest to not worry about this case now, andto focus on the other two more common ones.

 A Matrix is just a multidimensional array right?

Well, yes, but there are many ways to implement arrays and nDarrays. Sometimes the implementations differ.

I'll have to read up on it a bit; doing simple division (Likelywhole rows of 32bit at a time) would do that job,

Take a look at my implementation in Bugzilla :-) It's anothertype, the third. The 1D and 2D cases are the most common.

but expanding width several times (compared to height) would bequite a bit slower; Although not too horrible.

A 2D array of bits probably doesn't need to change its size aftercreation, this is not a common use case.

more likely bool array/matrix.. Although if you heavily usespecific lines it could convert to bool[], and then back again,but if you go all over the place it would be worse than justusing bool[].

It's not so simple :-) A bool[] causes many more cache misses, ifthe matrix is not tiny. A bool[] or bool[][] is a good idea onlyif the matrix is very small, or if you don't have a bit matrixlibrary and you don't want to write lot of code. Converting tobool[] the current line is not a good idea.


Bye,
bearophile

Re: BitArray/BitFields - Review

Reply via email to