hi, now I'm doing something like this on a data frame to make use of table partitioning
df.filter($"sex" === "male").write.parquet("path/to/table/sex=male") df.filter($"sex" === "female").write.parquet("path/to/table/sex=female") this will filter dataset multiple times, are there better way to do this? thanks.