BryanCutler commented on issue #24926: [SPARK-28128][PYTHON][SQL] Pandas
Grouped UDFs skip empty partitions
URL: https://github.com/apache/spark/pull/24926#issuecomment-505099961
Thanks @dongjoon-hyun and @HyukjinKwon !
This
BryanCutler commented on issue #24926: [SPARK-28128][PYTHON][SQL] Pandas
Grouped UDFs skip empty partitions
URL: https://github.com/apache/spark/pull/24926#issuecomment-504546730
Looks like this cut almost 30s off of test time lol, happy friday
@shaneknapp !
BryanCutler commented on issue #24926: [SPARK-28128][PYTHON][SQL] Pandas
Grouped UDFs skip empty partitions
URL: https://github.com/apache/spark/pull/24926#issuecomment-504490665
>do we have a test that process empty partitions?
We have one for pandas scalar udfs, but not
BryanCutler commented on issue #24926: [SPARK-28128][PYTHON][SQL] Pandas
Grouped UDFs skip empty partitions
URL: https://github.com/apache/spark/pull/24926#issuecomment-504256446
cc @HyukjinKwon @icexelloss
This is an
BryanCutler commented on issue #24926: [SPARK-28128][PYTHON][SQL] Pandas
Grouped UDFs skip empty partitions
URL: https://github.com/apache/spark/pull/24926#issuecomment-504256239
This probably won't have an impact if data is not small, but given how many
tests there are for these it