[ 
https://issues.apache.org/jira/browse/PIG-2553?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13495811#comment-13495811
 ] 

Prashant Kommireddi commented on PIG-2553:
------------------------------------------

That's a good point Dmitriy. The patch does not handle multiple relations being 
written to hbase. Is it sufficient to check for the schema (hdfs://, hbase://, 
file://,...) ?

Rohini, you are right. Any implementation of StoreFunc similar to Hadoop 
MultipleOutputFormat would break this. As Dmitriy suggested, I think it makes 
sense to provide an option to users, in addition to logging a warning message?
                
> Pig shouldn't allow attempts to write multiple relations into same directory
> ----------------------------------------------------------------------------
>
>                 Key: PIG-2553
>                 URL: https://issues.apache.org/jira/browse/PIG-2553
>             Project: Pig
>          Issue Type: Improvement
>            Reporter: Dmitriy V. Ryaboy
>            Assignee: Prashant Kommireddi
>         Attachments: PIG-2553.patch
>
>
> We've seen multiple occasions where users accidentally try to store 2 or more 
> different relations to the same destination directory. Currently, this passes 
> the Pig planner and fails on MR side due to concurrent attempts to create the 
> same part file on the reducer. This is extremely confusing to the user, and 
> hard to debug.
> We should instead fail their scripts before they are even submitted, since we 
> can identify the erroneous condition from the beginning.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to