HBaseStorage has problems with processing multiregion tables
------------------------------------------------------------

                 Key: PIG-1828
                 URL: https://issues.apache.org/jira/browse/PIG-1828
             Project: Pig
          Issue Type: Bug
    Affects Versions: 0.8.0
         Environment: Hadoop 0.20.2, Hbase 0.20.6, Distributed mode
            Reporter: Lukas


As brought up in the pig user mailing list 
(http://www.mail-archive.com/user%40pig.apache.org/msg00606.html) Pig does 
sometime not scan the full HBase table.
It seems that HBaseStorage has problems scanning large tables. It issues just 
one mapper job instead of one mapper job per table region.
Ian Stevens, who brought this issue up in the mailing list, attached a script 
to reproduce the problem (https://gist.github.com/766929).
However, in my case, the problem only occurred, after the table was split into 
more than one regions.



-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to