[jira] [Commented] (HUDI-1668) GlobalSortPartitioner is getting called twice during bulk_insert.

2021-05-25 Thread Sugamber (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17351095#comment-17351095 ] Sugamber commented on HUDI-1668: [~nishith29] Yes, We can close this. Thank you!!! >

[jira] [Commented] (HUDI-1668) GlobalSortPartitioner is getting called twice during bulk_insert.

2021-05-24 Thread Nishith Agarwal (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17350651#comment-17350651 ] Nishith Agarwal commented on HUDI-1668: --- [~shivnarayan] [~sugamberku] Spark has a 3 stage sorting

[jira] [Commented] (HUDI-1668) GlobalSortPartitioner is getting called twice during bulk_insert.

2021-04-21 Thread Sugamber (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17326526#comment-17326526 ] Sugamber commented on HUDI-1668: [~shivnarayan]  I see Global sort executed twice in this example. >

[jira] [Commented] (HUDI-1668) GlobalSortPartitioner is getting called twice during bulk_insert.

2021-04-21 Thread Sugamber (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17326525#comment-17326525 ] Sugamber commented on HUDI-1668: I've attached the both screenshot. !Screenshot 2021-04-21 at 6.40.19

[jira] [Commented] (HUDI-1668) GlobalSortPartitioner is getting called twice during bulk_insert.

2021-04-21 Thread Sugamber (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17326397#comment-17326397 ] Sugamber commented on HUDI-1668: [~shivnarayan], I don't have spark 2.4.3 cluster.  I'll run the job and

[jira] [Commented] (HUDI-1668) GlobalSortPartitioner is getting called twice during bulk_insert.

2021-04-17 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17324294#comment-17324294 ] sivabalan narayanan commented on HUDI-1668: --- btw, min supported spark version is 2.4.3. Can you

[jira] [Commented] (HUDI-1668) GlobalSortPartitioner is getting called twice during bulk_insert.

2021-04-17 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17324293#comment-17324293 ] sivabalan narayanan commented on HUDI-1668: --- and btw, we have a direct row writing option (w/o

[jira] [Commented] (HUDI-1668) GlobalSortPartitioner is getting called twice during bulk_insert.

2021-04-17 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17324292#comment-17324292 ] sivabalan narayanan commented on HUDI-1668: --- [~sugamberku]: can you attach screenshots for spark