Hi, I am trying to process some logs and the data is sorted(*almost*) by timestamp. If I do a full sort it takes a lot of time. Is there some way to sort more efficiently (like restricting sort to per partition).
Thanks in advance
Hi, I am trying to process some logs and the data is sorted(*almost*) by timestamp. If I do a full sort it takes a lot of time. Is there some way to sort more efficiently (like restricting sort to per partition).
Thanks in advance