Another possible reason is that we found sometimes hadoop framework does not return the correct count to the clients. In all these cases, the count is smaller than the number of rows actually loaded.
which version of hadoop are you using? Zheng On Mon, Feb 8, 2010 at 11:27 PM, Jeff Hammerbacher <[email protected]> wrote: > Hey wd, > > Actually, what version are you running? Your bug sounds an awful lot like > http://issues.apache.org/jira/browse/HIVE-327, which was fixed many moons > ago. > > Thanks, > Jeff > > On Mon, Feb 8, 2010 at 11:25 PM, Carl Steinbach <[email protected]> wrote: >> >> Hi wd, >> >> Please file a JIRA ticket for this issue. >> >> Thanks. >> >> Carl >> >> On Mon, Feb 8, 2010 at 7:05 PM, wd <[email protected]> wrote: >>> >>> hi, >>> >>> I've use hive map reduce to process some log files. I found out that hive >>> will output like "num1 rows loaded to table_name" message every run. But the >>> "num1" not equal to "select count(1) from table_name" execute result. >>> >>> I think this should be a bug. If we can not count the right num, why we >>> output that message? >> > > -- Yours, Zheng
