[ 
https://issues.apache.org/jira/browse/IMPALA-12630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17799524#comment-17799524
 ] 

ASF subversion and git services commented on IMPALA-12630:
----------------------------------------------------------

Commit 1b93c370b250211195a2b24ecf8c4e8c100dd679 in impala's branch 
refs/heads/master from Riza Suminto
[ https://gitbox.apache.org/repos/asf?p=impala.git;h=1b93c370b ]

IMPALA-12630: Deflake TestOrcStats.test_orc_stats

After revert of IMPALA-9923 (b03e8ef95c856f499d17ea7815831e30e2e9f467),
TestOrcStats.test_orc_stats becomes flaky in the following test case:

select count(*) from tpch_orc_def.lineitem where l_orderkey = 1609411;

This is because tpch_orc_def.lineitem sometimes loaded as 1 ORC file in
one run, and loaded in as 4 ORC files in the other run (see JIRA
comments for more analysis).

This patch attempts to deflake the test by changing the test case to
count over orders table instead. orders table is not as large as
lineitem table, but more consistently written as 1 ORC file with more
than 1 stripe.

Testing:
- Pass TestOrcStats.test_orc_stats after reloading tpch workload.

Change-Id: Ibb7bb25b44878df3e037e750a985656ed6857a7d
Reviewed-on: http://gerrit.cloudera.org:8080/20794
Reviewed-by: Impala Public Jenkins <[email protected]>
Tested-by: Impala Public Jenkins <[email protected]>


> TestOrcStats.test_orc_stats fails in count-start on lineitem with filter
> ------------------------------------------------------------------------
>
>                 Key: IMPALA-12630
>                 URL: https://issues.apache.org/jira/browse/IMPALA-12630
>             Project: IMPALA
>          Issue Type: Bug
>            Reporter: Quanlong Huang
>            Assignee: Riza Suminto
>            Priority: Critical
>             Fix For: Impala 4.4.0
>
>         Attachments: load-tpch-core-hive-generated-orc-def-block.sql, 
> meta-lineitem.txt, meta-orders.txt, profile_1134.txt, profile_949.txt, 
> ubuntu-20.04-1134-load-tpch-core-hive-generated-orc-def-block.sql.log, 
> ubuntu-20.04-1158-load-tpch-core-hive-generated-orc-def-block.sql.log
>
>
> Saw the test failed several times recently:
> https://jenkins.impala.io/job/ubuntu-20.04-dockerised-tests/949
> https://jenkins.impala.io/job/ubuntu-20.04-from-scratch/1134
> {noformat}
> query_test/test_orc_stats.py:41: in test_orc_stats
>     self.run_test_case('QueryTest/orc-stats', vector, use_db=unique_database)
> common/impala_test_suite.py:776: in run_test_case
>     update_section=pytest.config.option.update_results)
> common/test_result_verifier.py:683: in verify_runtime_profile
>     % (function, field, expected_value, actual_value, op, actual))
> E   AssertionError: Aggregation of SUM over RowsRead did not match expected 
> results.
> E   EXPECTED VALUE:
> E   13501
> E   
> E   
> E   ACTUAL VALUE:
> E   20000
> E   
> E   OP:
> E   : {noformat}
> The query is
> {code:sql}
> select count(*) from tpch_orc_def.lineitem where l_orderkey = 1609411
> {code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to