Hello Ignite users,

I am considering to sample randomly on large amounts of data, but I was
wondering what would be the most efficient way for this. Right now, I think
I might need cluster-based randomness using a MOD function as described
here: https://www.alandix.com/academic/topics/random/sampling-SQL.html

I currently have a UUID column (uuid4), which I think can be used for it,
but I might need some bit manipulation to get the non-random parts out of
the UUID.
Do you think this is indeed the most straightforward way to do it?


Kind regards,

Stéphane Thibaud

Reply via email to