Hi all,

In the last contributor meetup, the topic around data correctness or data
corruption is rather concerning. Not only is the number of such issues that
have been reported recently, but also the way that Hive community is
handling these issues. The latter is the the topic of this discussion. I
think everyone agrees that the current practice is problematic and that
Hive community should treat data correctness more seriously. Therefore, I'd
like to find a "standard" procedural that we should follow. Here are my
initial thought:

1. JIRA should be correctly labeled and the title should reflect data
correctness.
2. JIRA should bear adequate description about the issue, including
affected version, JIRA that incurred the issue, any workaround, etc.
3. Once confirmed, advisory message should be sent to user@ and @dev
regarding the problem.
4. Once the JIRA is closed, a message should be sent again to the lists
advising the availability of the fix.

I know these may not be all clear or actionable, but I hope we can have
concrete steps to follow at the end of this discussion.

Please share your thoughts.

Thanks,
Xuefu

Reply via email to