Re: A couple of questions about arrays and slices

Cecil Ward via Digitalmars-d-learn Sat, 24 Jun 2023 07:47:00 -0700

On Saturday, 24 June 2023 at 12:05:26 UTC, Jonathan M Davis wrote:

On Saturday, June 24, 2023 1:43:53 AM MDT Cecil Ward viaDigitalmars-d-learn wrote:
On Saturday, 24 June 2023 at 07:36:26 UTC, Cecil Ward wrote:
> [...]
I just realised something, your point about altering the tableand having to rehash, is well taken. I hadn’t considered that.The reason for my foolishness in failing to realise that I’masking the impractical is my pattern of usage. I add all theentries into the mapping table and have no interest in anylookups until it is fully built. Then a second function startsto do lookups while the data remains unchanging and that usagepattern can be guaranteed. I could even idup it if that wouldhelp, as copying < 32 uints wouldn’t take forever. A typicalvalue would be a mere 5 or less. I only picked 32 to becompletely safely ott.
Well, if the key were a struct or a class, the hashing functionwould be opHash. For built-in types, the runtime has hashingfunctions that it uses. Either way, with AAs, you really don'tworry about managing the memory, because it's completelyoutside of your control. You just put the elements in thereusing their associated keys, and if you want to try to speed itup after you've populated it, you use rehash so that theruntime can try to move the elements around within thecontainer so that lookup speeds will be closer to optimal.
As such, for the most part, when dealing with AAs and worryingabout efficiency, the question really becomes whether AAs arethe correct solution rather than much of anything having to dowith how you manage their memory.
With so few elements, it's also possible that usingstd.algorithm.searching.find would be faster - e.g. having adynamic array of strings where the matching int is at the sameindex in a dynamic array of ints - or you could usestd.typecons.Tuple!(string, int)[] with something likearr.find!(a => a[0] == key)() to find the tuple with the intyou want.
Simply comparing a small number of strings like that might befaster than what goes on with hashing the string and thenfinding the corresponding element within the AA - or it mightnot be. You'd have to test that to know. The AA woulddefinitely be faster with a large number of elements, but witha small number of elements, the algorithmic complexity doesn'treally matter, and the extra overhad with the AA lookups couldactually mean that the search through the dynamic array isfaster even though it's O(n). But you can only know which isfaster by testing it out with the actual data that you'redealing with.
Regardless, you need to remember that associative arrays arenot arrays in the C sense. Rather, they're hash tables, so theyfunction very differently from dynamic arrays, and the rehashfunction is the closest that you're going to get to affectinghow the elements are laid out internally or how much memory theAA is using.
- Jonathan M Davis

I started out looking into a number of runtime library routines,but in the end it seemed quicker to roll my own code for a cruderecursive descent parser/lexer that parses part of D’s grammarfor expressions, and (again partial grammar) parser for stringliteral expressions and so on. I find certain special elementsand execute actions which involve doing the AA lookup andreplacing variable names with ordinal numbers in decimal in theoutput stream. Admission: The parsing is the thing that has to befast, even though again the size of the D language text is notlikely to be huge at all. But 40 years ago, I came from a worldwith 2k RAM and 0.9 MHz clock rates so I have developed a habitof always thinking about speed before I do anything, needful ornot, to be honest. I once wrote a program that took 35 mins toevaluate 2+2 and print out the answer, so I’m now ashamed ofwriting slow code. Those were bad days, to be honest. 4 GHz+ andILP is nicer.

Re: A couple of questions about arrays and slices

Reply via email to