Alexandra Dunai created IMPALA-9822:
---------------------------------------
Summary: Impala does not notify user that row format delimited
fields is only logical when using STORED AS TEXTFILE
Key: IMPALA-9822
URL: https://issues.apache.org/jira/browse/IMPALA-9822
Project: IMPALA
Issue Type: Bug
Affects Versions: Impala 3.4.0
Reporter: Alexandra Dunai
When creating a table with added "ROW FORMAT DELIMITED FIELDS", Impala does not
alert the user that this is only logical when using STORED AS TEXTFILE.
You only discover that you made a mistake after trying to run a select from the
table.
Table creation:
{code:java}
[adunai-1.adunai.root.hwx.site:21000] default> CREATE EXTERNAL TABLE
sales_fact_1997(product_id INT,time_id INT,customer_id INT,promotion_id
INT,store_id INT,store_sales DECIMAL(10,4),store_cost DECIMAL(10,4),unit_sales
DECIMAL(10,4)) > row format delimited fields terminated by '\011' STORED AS
PARQUET > location '/user/impala/mondrian/sales_fact_1997';
Query: CREATE EXTERNAL TABLE sales_fact_1997(product_id INT,time_id
INT,customer_id INT,promotion_id INT,store_id INT,store_sales
DECIMAL(10,4),store_cost DECIMAL(10,4),unit_sales DECIMAL(10,4))row format
delimited fields terminated by '\011' STORED AS PARQUETlocation
'/user/impala/mondrian/sales_fact_1997'
+-------------------------+| summary |+-------------------------+| Table has
been created. |+-------------------------+
Fetched 1 row(s) in 0.10s
{code}
Select:
{code:java}
[adunai-1.adunai.root.hwx.site:21000] mondrian> select count(*) from
agg_c_10_sales_fact_1997;Query: select count(*) from
agg_c_10_sales_fact_1997Query submitted at: 2020-06-03 11:55:06 (Coordinator:
http://adunai-1.adunai.root.hwx.site:25000)Query progress can be monitored at:
http://adunai-1.adunai.root.hwx.site:25000/query_plan?query_id=d547fafd0162da4e:872a95c100000000ERROR:
File
'hdfs://adunai-2.adunai.root.hwx.site:8020/user/impala/mondrian/agg_c_10_sales_fact_1997/agg_c_10_sales_fact_1997.tsv'
has an invalid Parquet version number: 717. Please check that it is a valid
Parquet file. This error can also occur due to stale metadata. If you believe
this is a valid Parquet file, try running "refresh
mondrian.agg_c_10_sales_fact_1997".
{code}
--
This message was sent by Atlassian Jira
(v8.3.4#803005)