[GitHub] spark issue #21608: [SPARK-24626] [SQL] Improve Analyze Table command

2018-06-22 Thread maropu
Github user maropu commented on the issue: https://github.com/apache/spark/pull/21608 cc: @wzhfy @gatorsmile --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail:

[GitHub] spark issue #21608: [SPARK-24626] [SQL] Improve Analyze Table command

2018-06-22 Thread maropu
Github user maropu commented on the issue: https://github.com/apache/spark/pull/21608 ok, can you put the result in the description? Also, can you make the title more precise? e.g., Parallelize size computation in ANALYZE command ---

[GitHub] spark issue #21608: [SPARK-24626] [SQL] Improve Analyze Table command

2018-06-22 Thread Achuth17
Github user Achuth17 commented on the issue: https://github.com/apache/spark/pull/21608 Yes, In the case where the data is stored in S3 I noticed a significant difference. Some rough numbers - When done serially for a table in S3 with 1000 partitions, the calculateTotalSize

[GitHub] spark issue #21608: [SPARK-24626] [SQL] Improve Analyze Table command

2018-06-21 Thread maropu
Github user maropu commented on the issue: https://github.com/apache/spark/pull/21608 This pr improves actual performance values? (My question is that the calculation is a bottleneck?) --- - To unsubscribe, e-mail:

[GitHub] spark issue #21608: [SPARK-24626] [SQL] Improve Analyze Table command

2018-06-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/21608 Can one of the admins verify this patch? --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #21608: [SPARK-24626] [SQL] Improve Analyze Table command

2018-06-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/21608 Can one of the admins verify this patch? --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #21608: [SPARK-24626] [SQL] Improve Analyze Table command

2018-06-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/21608 Can one of the admins verify this patch? --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional