[ 
https://issues.apache.org/jira/browse/IMPALA-5845?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Quanlong Huang updated IMPALA-5845:
-----------------------------------
    Fix Version/s: Impala 4.1.1

> Impala should de-duplicate row parsing error
> --------------------------------------------
>
>                 Key: IMPALA-5845
>                 URL: https://issues.apache.org/jira/browse/IMPALA-5845
>             Project: IMPALA
>          Issue Type: Bug
>          Components: Backend
>    Affects Versions: Impala 3.1.0
>            Reporter: Juan Yu
>            Assignee: Riza Suminto
>            Priority: Major
>              Labels: ramp-up, supportability
>             Fix For: Impala 4.2.0, Impala 4.1.1
>
>
> Impala log file grew very quickly with lots of error like
>  I0824 10:44:46.527885  8679 runtime-state.cc:217] Error from query 
> 804d64b80df65fda:a5349b0700000000: Error parsing row: file: 
> hdfs://nameservice1/user/hive/tpcds.db/store_sales/00005.parq, before offset: 
> 120795952
> There are 622000 errors for only 141 unique files
> Impala already de-duplicate similar error in lots of scenarios, could the row 
> parsing error be de-duplicated as well to reduce log size and easier 
> troubleshooting?



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to