Recommended way of random sampling

Stéphane Thibaud Sun, 19 May 2019 05:00:42 -0700

Hello Ignite users,

I am considering to sample randomly on large amounts of data, but I was
wondering what would be the most efficient way for this. Right now, I think
I might need cluster-based randomness using a MOD function as described
here: https://www.alandix.com/academic/topics/random/sampling-SQL.html


I currently have a UUID column (uuid4), which I think can be used for it,
but I might need some bit manipulation to get the non-random parts out of
the UUID.
Do you think this is indeed the most straightforward way to do it?


Kind regards,

Stéphane Thibaud

Recommended way of random sampling

Reply via email to