Ok. I found the solution myself:

The problem is the asterisk in the from clause. If I take it out:

select * from dfs.datatransfer.`ct_fremde/2015` where dir0=8

... everything works fine and I do get results back for 08 as well.


Anyway, this seems to be a bug, as there is an inconsistent behavior for 
different folders for the same query. Should I file a jira?

Greetings,

Uwe

-----Original Message-----
From: Geercken, Uwe [mailto:[email protected]] 
Sent: Mittwoch, 23. September 2015 09:21
To: [email protected]
Subject: dir0 shows no result

I have used the queries below to create parquet files from 2 CSV files:

create table dfs.datatransfer.`ct_fremde/2015/07` as select
to_timestamp(columns[0],'dd.MM.yyyy') as Datum, columns[1] as Airline_In, 
columns[2] as Trip_In, columns[3] as Ac_Typ, columns[4] as Ordertype,
to_time(columns[5],'HH:mm') as Start_Time, columns[6] as End_Time, columns[7] 
as Reg_In from dfs.datatransfer.`CT_Fremde_Juli_2015.tsv`


create table dfs.datatransfer.`ct_fremde/2015/08` as select
to_timestamp(columns[0],'dd.MM.yyyy') as Datum, columns[1] as Airline_In, 
columns[2] as Trip_In, columns[3] as Ac_Typ, columns[4] as Ordertype,
to_time(columns[5],'HH:mm') as Start_Time, columns[6] as End_Time, columns[7] 
as Reg_In from dfs.datatransfer.`CT_Fremde_August_2015.tsv`


when I query the data using following sql:

select distinct dir0 from dfs.datatransfer.`ct_fremde/2015/*`

... I get 07 and 08 as the result.


When I run a group by query:

select dir0,count(3) from dfs.datatransfer.`ct_fremde/2015/*` group by dir0

... I get 2115 for 07 and 2128 for 08 back.


Now when I run following query:

select * from dfs.datatransfer.`ct_fremde/2015/*` where dir0=7

... I get records back from the query


And when I run this query:

select * from dfs.datatransfer.`ct_fremde/2015/*` where dir0=8

... I do NOT get a result back


Am I doing something wrong here? Or what is going on here?

Greetings,

Uwe


Reply via email to