[ https://issues.apache.org/jira/browse/PIG-2553?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13495811#comment-13495811 ]
Prashant Kommireddi commented on PIG-2553: ------------------------------------------ That's a good point Dmitriy. The patch does not handle multiple relations being written to hbase. Is it sufficient to check for the schema (hdfs://, hbase://, file://,...) ? Rohini, you are right. Any implementation of StoreFunc similar to Hadoop MultipleOutputFormat would break this. As Dmitriy suggested, I think it makes sense to provide an option to users, in addition to logging a warning message? > Pig shouldn't allow attempts to write multiple relations into same directory > ---------------------------------------------------------------------------- > > Key: PIG-2553 > URL: https://issues.apache.org/jira/browse/PIG-2553 > Project: Pig > Issue Type: Improvement > Reporter: Dmitriy V. Ryaboy > Assignee: Prashant Kommireddi > Attachments: PIG-2553.patch > > > We've seen multiple occasions where users accidentally try to store 2 or more > different relations to the same destination directory. Currently, this passes > the Pig planner and fails on MR side due to concurrent attempts to create the > same part file on the reducer. This is extremely confusing to the user, and > hard to debug. > We should instead fail their scripts before they are even submitted, since we > can identify the erroneous condition from the beginning. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira