Zero Bytes Files importance

2016-12-27 Thread Mahender Sarangam
   Hi

When we dump or have hive query to insert data into another hive table. There 
will be Folder which contains actual data and apart from that we see another 0 
Byte File present at same level. I would like to understand importance of Zero 
byte files. What happens if we delete this file. Please find below screen shot 
like 1,2,3 with Zero bytes files.

[cid:part1.2CC86749.337B84DF@outlook.com]

/Mahender


Zero Bytes Files importance

2016-12-27 Thread Mahender Sarangam
   Hi

When we dump or have hive query to insert data into another hive table. There 
will be Folder which contains actual data and apart from that we see another 0 
Byte File present at same level. I would like to understand importance of Zero 
byte files. What happens if we delete this file.

[cid:part1.726DBDDD.87D45B69@outlook.com]

[cid:part2.0F4C161E.C55C3FE2@outlook.com]

/Mahender


Re: Anyway to avoid creating subdirectories by "Insert with unionĀ²

2016-12-27 Thread Mahender Sarangam
HI Gopal,

Another question which i have is whenever we run Union All statement, 
apart from Folders we also see Zero Bytes Files in HDFS. Are there locks 
file (LCK) ?

Mahender

On 2/24/2016 4:26 PM, Gopal Vijayaraghavan wrote:
>> SET mapred.input.dir.recursive=TRUE;
> ...
>> Can we set above setting as tblProperties or Hive Table properties.
> Not directly, those are MapReduce properties - they are not settable via
> Hive tables.
>
> That said, you can write your own SemanticAnalyzerHooks to do pretty much
> anything you want like that.
>
> You can use hooks to modify the job, after tables have been resolved.
>
>
> Ideally such a hook should not modify the plan (much), because it's too
> late to do it right.
>
> But I sometimes prototype Hive optimizer features as Hooks, like this one.
>
> https://github.com/t3rmin4t0r/captain-hook
>
>
> Cheers,
> Gopal
>
>



insert on condition

2016-12-27 Thread anup ahire
Hello All,

Is it possible to achieve " Insert overwrite into a partition only if it
does not exist " ?
Partition is a string date.

Thanks,
Anup