[ https://issues.apache.org/jira/browse/PIG-3617?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13846891#comment-13846891 ]
Nezih Yigitbasi commented on PIG-3617: -------------------------------------- Guys, I plan to implement a solution that checks for an hbase prefix ("hbase://") in the fileSpec variable (see FileLocalizer.delete), but this method currently uses the DataStorage interfaces (either local or distributed) kept in PigContext to access the backend storage and as far as I can see there is no HBase specific implementation for the DataStorage interface. So what do you guys think is the right way to fix this issue? Is it implementing a DataStorage interface for HBase or simply use HBaseAdmin.deleteTable to delete the temp table? > problem with temp file deletion in MAPREDUCE operator > ----------------------------------------------------- > > Key: PIG-3617 > URL: https://issues.apache.org/jira/browse/PIG-3617 > Project: Pig > Issue Type: Bug > Affects Versions: 0.12.0 > Reporter: Nezih Yigitbasi > > Hi all, > When I run a native MR job with the MAPREDUCE keyword and store the > intermediate data in HBase with: > stored = MAPREDUCE 'my.jar' > STORE x INTO 'hbase://temp_table' > USING > org.apache.pig.backend.hadoop.hbase.HBaseStorage('hbase_schema') > .... and the rest ....; > Pig tries to delete the temp files, which in this case has an HBase path, and > fails with the exception: > Caused by: java.lang.IllegalArgumentException: java.net.URISyntaxException: > Relative path in absolute URI: file:hbase:/temp_table > at org.apache.hadoop.fs.Path.initialize(Path.java:148) > at org.apache.hadoop.fs.Path.<init>(Path.java:126) > at > org.apache.pig.backend.hadoop.datastorage.HDataStorage.isContainer(HDataStorage.java:197) > at > org.apache.pig.backend.hadoop.datastorage.HDataStorage.asElement(HDataStorage.java:128) > at org.apache.pig.impl.io.FileLocalizer.delete(FileLocalizer.java:415) > at > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher.launchPig(MapReduceLauncher.java:419) > at org.apache.pig.PigServer.launchPlan(PigServer.java:1322) -- This message was sent by Atlassian JIRA (v6.1.4#6159)