Re: Hash vs list

Levi Pearson Fri, 21 Apr 2006 19:52:07 -0700


On Apr 21, 2006, at 6:20 PM, Jeff Schroeder wrote:

I'm curious whether searching for a single value in a hash isgenerally
(always?) faster than searching for it in a list.  The programming I'm
doing is in PHP, but I think it's a question with a general answer.

Simple answer: No, searching for a single value in a hash is notalways faster. The difference is that hash lookup is O(1) whilelookup in an unordered list is O(n). Since you said you haven't hada data structures class, I'll assume you haven't learned O-notationand give a little summary.

What O-notation gives you a measure of is the time-complexity of analgorithm. This means it lets you know how the amount of time scalesup as the problem size (represented by n) grows. So, O(1) means theamount of time stays the same regardless of n. O(n) means the timeto solve the problem grows linearly with n. So, if it takes xmilliseconds to look up a single value in an unordered list of size1, it takes n*x to look up a single value in a list of size n. Othercommon complexities are O(log n) (better than O(n) )and O(n^2) (worsethan O(n) ).

Anyway, what O-notation doesn't tell you is what the constant factorsin the algorithm are. It's all about how the algorithm scales. Thismeans that if you know your problem will always be a certain size,the optimal algorithm might be different than if you know yourproblem set will grow, depending on the constant factors.

Now, back to reality. Hash tables offer O(1) lookup which isimplemented by running a hashing function on the key value. Theresult of this function is an index to an array. If there was acollision in the hash table (common with large tables), there is asmall list to search through to find the desired element. If therewas no collision, the value is stored directly in the array. So, yousee there is a bit of overhead involved. There's also quite a bit ofspace overhead as well, since the table array is often rather sparse.

So, for small lists, a naive search algorithm may actually be fasterthan a hash lookup. This will vary widely on the hash and searchimplementations, though, and certainly the hash will look better andbetter as the problem size increases.

I hope that made some sense. Let me know if there's anything thatneeds clarification.


                --Levi

/*
PLUG: http://plug.org, #utah on irc.freenode.net
Unsubscribe: http://plug.org/mailman/options/plug
Don't fear the penguin.
*/

Re: Hash vs list

Reply via email to