Hi All,
We have developed a custom storgeHandler implementing *HiveStorageHandler*.
We also have Api's/statistics for totalCount, max, min etc. for the data
stored in our database.
See below example queries:
1. select count(*) from my_table;
2. select max(id_column) from my_table;
So for above q
Hi all,
Noticed something extremely strange and concerning regarding subquery column
selection. Here's the query, and the sample results in presto vs hive
SELECTipm.ip,ipm.master_id AS mid,COALESCE(ad.cookie, '') AS cid,
COALESCE(ad.url_cid, '') AS campaign_id,COALESCE(ad.url_si
> java.util.concurrent.ExecutionException: java.io.FileNotFoundException:
> /tmp/staging-slider-HHIwk3/lib/tez.tar.gz (Is a directory)
LLAP expects to find a tarball where tez.lib.uris is - looks like you've got a
directory?
Cheers,
Gopal
Why do you want to do single inserts?
It has been more designed for bulk loads.
In any case newer version of Hive 2 using TEZ +llap improve it significantly
(also for bulk analysis). Nevertheless, it is good practice to not use single
inserts in an analysis systems, but try to combine and bulk-l
Hi,
I am new to Hive. I just created a simple table in hive and inserted two
records, the first insertion took 16.4 sec, while the second took 14.3 sec.
Why is that very slow? is this the normal performance you get in Hive using
INSERT ? Is there a way to improve the performance of a single "inser
Hi Rajesh,
I thought the recommended way to start/stop LLAP was to use Slider
this is what we do to start
slider create llap --resources $BASEDIR/resources.json --template
$BASEDIR/appConfig.json
and to stop before you can start again
slider stop llap
slider destroy llap --force || slider destr
Hi All,
I am using Hadoop 2.7.3 and Hive 2.2.0 and TEZ 0.9.0 in linux. and able to
execute hive query in yarn as TEZ type.
When I start hive LLAP service -./hive --service llap --name @llap --instances
1 --cache 1024m --xmx 2048m --size 3225m --loglevel DEBUG --args " -XX:+UseG1GC
-XX:+Resize