Hi Bejoy,

I modified the load command as below

LOAD DATA LOCAL INPATH '/home/cloudera/CrimeHive.csv' INTO TABLE 
crime_managed_native PARTITION (State='Alabama',State='California');

Now data is loaded however when I issue command as

Select *from crime_managed_native where State='Alabama'

No records are returned whereas (I have Alabama record in source file)

Select *from crime_managed_native where State='California'

It returns only California records.

Does that mean only California records got inserted in table. But I see that 
entire file is stored under 
/user/hive/warehouse/learn.db/crime_managed_native/State=California

and there is no directory for State=Alabama.

Also, what happens with rest of the records which do not have state as Alabama 
/ California.

Do we have any documents which talks about partitioning in detail.

Thanks,
Kuldeep



From: Bejoy KS [mailto:bejoy...@yahoo.com]
Sent: 30 July 2012 17:45
To: user@hive.apache.org
Subject: Re: Data Loaded but Select returns nothing!

Kuldeep

Couple of things I hoticed here are

Your table is bucketed, when you load data into a bucketed table you need to 
enable

hive.enforce.bucketing=true;

Bucketing needs a MR job so you need to load the non bucketed data into a 
normal table and from that load to a bucketed table using 'Insert Overwrite'.

Then another quick nit
Your table is partitioned so you need to load your data into some partition but 
you have not spefied a partition in Load.
Regards
Bejoy KS

Sent from handheld, please excuse typos.
________________________________
From: Kuldeep Chitrakar 
<kuldeep.chitra...@synechron.com<mailto:kuldeep.chitra...@synechron.com>>
Date: Mon, 30 Jul 2012 06:58:33 -0500
To: 
user@hive.apache.org<user@hive.apache.org<mailto:user@hive.apache.org%3cu...@hive.apache.org>>
ReplyTo: user@hive.apache.org<mailto:user@hive.apache.org>
Subject: Data Loaded but Select returns nothing!

Hi

I am trying to load a CSV file into HIve table.

Everything works fine but when a fire "select * from tablename" command. It 
does not retun anything.

--Create Table

CREATE TABLE IF NOT EXISTS learn.crime_managed_native (
NoState String,
TypeofCrime String,
Crime String,
Year int,
Count int)
PARTITIONED BY (State String)
CLUSTERED BY (Crime) SORTED BY (Year ASC) INTO 8 BUCKETS
ROW FORMAT DELIMITED FIELDS TERMINATED BY ','
STORED AS TEXTFILE;

--Load Data

LOAD DATA LOCAL INPATH '/home/cloudera/CrimeHive.csv' INTO TABLE 
crime_managed_native;

What could be the possible issue.

Thanks,
Kuldeep



Reply via email to