Juan Yu created IMPALA-5845:
-------------------------------
Summary: Impala should de-duplicate row parsing error
Key: IMPALA-5845
URL: https://issues.apache.org/jira/browse/IMPALA-5845
Project: IMPALA
Issue Type: Bug
Reporter: Juan Yu
Impala log file grew very quickly with lots of error like
I0824 10:44:46.527885 8679 runtime-state.cc:217] Error from query
804d64b80be65fda:a5349b0700000000: Error parsing row: file:
hdfs://nameservice1/user/omc/data/databases/production/tables/misc/20170803014202.csv,
before offset: 120795952
There are 622000 errors for only 141 unique files
Impala already de-duplicate similar error in lots of scenarios, could the row
parsing error be de-duplicated as well to reduce log size and easier
troubleshooting?
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)