[ 
https://issues.apache.org/jira/browse/PIG-4901?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15713443#comment-15713443
 ] 

Rohini Palaniswamy commented on PIG-4901:
-----------------------------------------

bq. output001.multi.gz/a.gz/f1.gz/a-f1-0,000.gz
Does the output file path have comma in it?

> To use Multistorage for each Group
> ----------------------------------
>
>                 Key: PIG-4901
>                 URL: https://issues.apache.org/jira/browse/PIG-4901
>             Project: Pig
>          Issue Type: Improvement
>          Components: piggybank
>    Affects Versions: 0.11.1, 0.16.0
>         Environment: Hadoop 1.2.0
>            Reporter: Divya
>            Assignee: Adam Szita
>            Priority: Minor
>             Fix For: 0.17.0
>
>         Attachments: PIG-4901.2.patch, PIG-4901.patch
>
>
> I am trying to group my data and store in hdfs with a folder for each 'name' 
> and subfolders for each 'YearMonth' under each name folder.
> Input:
> (Date)            (name)     (col3)     (col4)
> 2015-02-02    abc              y          z
> 2016-01-02    xyz              i            j
> 2015-03-02    abc              f          b
> 2015-02-06    abc              y          z
> 2016-03-02    xyz              a          q
>     
> Expected out in hdfs:
> abc folder
>     ->201502 subfolder
>            2015-02-02    abc              y          z
>            2015-02-06    abc              y          z
>     ->201503 subfolder
>            2015-03-02    abc              f           b
> xyz folder
>     ->201601
>           2016-01-02    xyz              i            j
>     ->201603
>           2016-03-02    xyz              a          q
> I am not sure of how to use the Multistorage option on Name column after 
> grouping the tuples by date.
> Any help is appreciated.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to