> I am sure the issue has something to do with an empty string passed to the
> substr function.


We can rule out the substr() function. I get the same stack trace with any
query like:

hive> select <anyColumn> from ushb where dtpartition='2010-10-25' limit 10;

But this query succeeds:

hive> select * from ushb where dtpartition='2010-10-25' limit 10 ;

So SOMETHING about the data makes Hive (Hadoop?) unhappy. More specifically
something about trying to select a particular column from the data on some
days. I'm looking at the data to see if I can sort what it is.

I have way more than 256 partitions per table. AFAIK, there is no partition
> limit.
>
> From your stack trace, you have some host name issue somewhere.
>

I see why you'd think that from the stack trace, though I can't imagine why
it'd have a "host name issue somewhere." The partition create statements
have no hostname component. The query has no hostname component.

This is definitely a curious problem.

-- 
Tim Ellis
Riot Games

Reply via email to