[ 
https://issues.apache.org/jira/browse/PIG-3223?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13599488#comment-13599488
 ] 

Johnny Zhang commented on PIG-3223:
-----------------------------------

[~mkramer], my apologize for late reply. Actually I found you properly can load 
comma separated avro file when using globe wrap it. like

in = LOAD 'test_dir1/{test_glob1.avro,test_glob2.avro,test_glob3.avro}' USING 
org.apache.pig.piggybank.storage.avro.AvroStorage ();

Does this satisfy your requirement? If this is the case, there is no code 
change required. 
There is no test cover it right now, I just added one more test case to 
TestAvroStorage to make sure it won't break in the future.
 
                
> AvroStorage does not handle comma separated input paths
> -------------------------------------------------------
>
>                 Key: PIG-3223
>                 URL: https://issues.apache.org/jira/browse/PIG-3223
>             Project: Pig
>          Issue Type: Bug
>          Components: piggybank
>    Affects Versions: 0.10.0, 0.11
>            Reporter: Michael Kramer
>            Assignee: Johnny Zhang
>         Attachments: AvroStorage.patch, AvroStorage.patch-2, 
> AvroStorageUtils.patch, AvroStorageUtils.patch-2, PIG-3223.patch.txt
>
>
> In pig 0.11, a patch was issued to AvroStorage to support globs and comma 
> separated input paths (PIG-2492).  While this function works fine for 
> glob-formatted input paths, it fails when issued a standard comma separated 
> list of paths.  fs.globStatus does not seem to be able to parse out such a 
> list, and a java.net.URISyntaxException is thrown when toURI is called on the 
> path.  
> I have a working fix for this, but it's extremely ugly (basically checking if 
> the string of input paths is globbed, otherwise splitting on ",").  I'm sure 
> there's a more elegant solution.  I'd be happy to post the relevant methods 
> and "fixes" if necessary.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to