satish created HUDI-1555:
----------------------------
Summary: clustering bugs from large scale testing
Key: HUDI-1555
URL: https://issues.apache.org/jira/browse/HUDI-1555
Project: Apache Hudi
Issue Type: Sub-task
Reporter: satish
Assignee: satish1) writeStatusRdd.isEmpty causes 20-25% performance degradation 2) while computing #groups, we use int instead of long reducing parallelism of writing new files -- This message was sent by Atlassian Jira (v8.3.4#803005)
