hi,

now I'm doing something like this on a data frame to make use of table
partitioning

df.filter($"sex" === "male").write.parquet("path/to/table/sex=male")
df.filter($"sex" === "female").write.parquet("path/to/table/sex=female")

this will filter dataset multiple times, are there better way to do this?


thanks.

Reply via email to