Github user chetkhatri commented on the issue:
https://github.com/apache/spark/pull/20081
@cloud-fan spark.sql.files.maxRecordsPerFile didn't worked out when i was
working with mine 30 TB of Spark Hive workload whereas repartition and coalesce
made sense.--- --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
