On Sat, Apr 9, 2022 at 5:27 AM Elijah Stone <elro...@elronnd.net> wrote:
> Hashing is O(1) (or, if you prefer, O(#y) for ~.y, same as sorting).  A
> sufficiently smart(tm) hash function will avoid inordinate collision
> rates, so I am not sure what worst case behaviour you are referring to.

Collision rates are indeed an important constraint.

> If you are able to share your 1e8-sized dataset where sorting before
> removing duplicates was faster than using ~., that would be great.  Maybe
> I can make it faster :)

That was an intermediate result from almost ten years ago.

Anyways, it's worth spending a little time thinking about worst case
performance.

Thanks,

-- 
Raul
----------------------------------------------------------------------
For information about J forums see http://www.jsoftware.com/forums.htm

Reply via email to