[
https://issues.apache.org/jira/browse/IMPALA-5845?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Quanlong Huang updated IMPALA-5845:
-----------------------------------
Fix Version/s: Impala 4.1.1
> Impala should de-duplicate row parsing error
> --------------------------------------------
>
> Key: IMPALA-5845
> URL: https://issues.apache.org/jira/browse/IMPALA-5845
> Project: IMPALA
> Issue Type: Bug
> Components: Backend
> Affects Versions: Impala 3.1.0
> Reporter: Juan Yu
> Assignee: Riza Suminto
> Priority: Major
> Labels: ramp-up, supportability
> Fix For: Impala 4.2.0, Impala 4.1.1
>
>
> Impala log file grew very quickly with lots of error like
> I0824 10:44:46.527885 8679 runtime-state.cc:217] Error from query
> 804d64b80df65fda:a5349b0700000000: Error parsing row: file:
> hdfs://nameservice1/user/hive/tpcds.db/store_sales/00005.parq, before offset:
> 120795952
> There are 622000 errors for only 141 unique files
> Impala already de-duplicate similar error in lots of scenarios, could the row
> parsing error be de-duplicated as well to reduce log size and easier
> troubleshooting?
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]