[ 
https://issues.apache.org/jira/browse/IMPALA-7371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16664092#comment-16664092
 ] 

bharath v commented on IMPALA-7371:
-----------------------------------

I've dug into the logs a bit and don't see anything obviously wrong. One 
interesting point though is that the job seems to be spending quite a bit of 
time around this test. (timestamps below).  Don't see that on core tests, so 
probably related to S3. 

One a failed run:

{noformat}
01:39:32.276 
query_test/test_hdfs_file_mods.py::TestHdfsFileMods::test_file_modifications[modification_type:
 move_file | exec_option: {'batch_size': 0, 'num_nodes': 0, 
'disable_codegen_rows_threshold': 0, 'disable_codegen': False, 
'abort_on_error': 1, 'exec_single_node_rows_threshold': 0} | table_format: 
text/none] PASSED
01:39:58.413 
query_test/test_hdfs_file_mods.py::TestHdfsFileMods::test_file_modifications[modification_type:
 append | exec_option: {'batch_size': 0, 'num_nodes': 0, 
'disable_codegen_rows_threshold': 0, 'disable_codegen': False, 
'abort_on_error': 1, 'exec_single_node_rows_threshold': 0} | table_format: 
text/none] PASSED
01:44:33.785 
query_test/test_insert.py::TestInsertQueries::test_insert[exec_option: 
{'sync_ddl': 0, 'batch_size': 0, 'num_nodes': 0, 
'disable_codegen_rows_threshold': 0, 'disable_codegen': True, 'abort_on_error': 
1, 'exec_single_node_rows_threshold': 0} | table_format: text/none] FAILED

===== ~3hrs ======

04:32:22.108 
query_test/test_insert.py::TestInsertQueries::test_insert[exec_option: 
{'sync_ddl': 0, 'batch_size': 0, 'num_nodes': 0, 
'disable_codegen_rows_threshold': 0, 'disable_codegen': False, 
'abort_on_error': 1, 'exec_single_node_rows_threshold': 0} | table_format: 
text/none] PASSED
04:33:21.934 
query_test/test_insert.py::TestInsertQueries::test_insert_overwrite[exec_option:
 {'sync_ddl': 0, 'batch_size': 0, 'num_nodes': 0, 
'disable_codegen_rows_threshold': 0, 'disable_codegen': True, 'abort_on_error': 
1, 'exec_single_node_rows_threshold': 0} | table_format: text/none] PASSED
04:34:26.631 
query_test/test_insert.py::TestInsertQueries::test_insert_overwrite[exec_option:
 {'sync_ddl': 0, 'batch_size': 0, 'num_nodes': 0, 
'disable_codegen_rows_threshold': 0, 'disable_codegen': False, 
'abort_on_error': 1, 'exec_single_node_rows_threshold': 0} | table_format: 
text/none] PASSED
{noformat}

On a successful run

{noformat}
'exec_single_node_rows_threshold': 0} | table_format: text/none] PASSED
00:56:47.304 
query_test/test_hdfs_file_mods.py::TestHdfsFileMods::test_file_modifications[modification_type:
 move_file | exec_option: {'batch_size': 0, 'num_nodes': 0, 
'disable_codegen_rows_threshold': 0, 'disable_codegen': False, 
'abort_on_error': 1, 'exec_single_node_rows_threshold': 0} | table_format: 
text/none] PASSED
00:57:07.228 
query_test/test_hdfs_file_mods.py::TestHdfsFileMods::test_file_modifications[modification_type:
 append | exec_option: {'batch_size': 0, 'num_nodes': 0, 
'disable_codegen_rows_threshold': 0, 'disable_codegen': False, 
'abort_on_error': 1, 'exec_single_node_rows_threshold': 0} | table_format: 
text/none] PASSED
01:49:47.897 
query_test/test_insert.py::TestInsertQueries::test_insert[exec_option: 
{'sync_ddl': 0, 'batch_size': 0, 'num_nodes': 0, 
'disable_codegen_rows_threshold': 0, 'disable_codegen': True, 'abort_on_error': 
1, 'exec_single_node_rows_threshold': 0} | table_format: text/none] PASSED

======= ~1.5 hrs =====

03:13:49.498 
query_test/test_insert.py::TestInsertQueries::test_insert[exec_option: 
{'sync_ddl': 0, 'batch_size': 0, 'num_nodes': 0, 
'disable_codegen_rows_threshold': 0, 'disable_codegen': False, 
'abort_on_error': 1, 'exec_single_node_rows_threshold': 0} | table_format: 
text/none] PASSED
03:14:16.074 
query_test/test_insert.py::TestInsertQueries::test_insert_overwrite[exec_option:
 {'sync_ddl': 0, 'batch_size': 0, 'num_nodes': 0, 
'disable_codegen_rows_threshold': 0, 'disable_codegen': True, 'abort_on_error': 
1, 'exec_single_node_rows_threshold': 0} | table_format: text/none] PASSED
03:14:44.101 
query_test/test_insert.py::TestInsertQueries::test_insert_overwrite[exec_option:
 {'sync_ddl': 0, 'batch_size': 0, 'num_nodes': 0, 
'disable_codegen_rows_threshold': 0, 'disable_codegen': False, 
'abort_on_error': 1, 'exec_single_node_rows_threshold': 0} | table_format: 
text/none] PASSED
{noformat}

On the failed run, I could see a similar gap in the impalad and catalogd logs. 
Not super clear where this time is spent. Needs more profiling of the jenkins 
runs when this happens.

{noformat}
    CodeGen: Total=0 Peak=209.00 KB
I0728 00:59:26.070897  8356 query-state.cc:424] Cancel: 
query_id=fa4b08ba88a16c6b:6be1b0a200000000
I0728 00:59:26.070952  8356 query-exec-mgr.cc:155] ReleaseQueryState(): 
query_id=fa4b08ba88a16c6b:6be1b0a200000000 refcnt=1
<----- gap ------>
I0728 03:26:50.400184  6411 impala-beeswax-server.cc:52] query(): 
query=invalidate metadata functional. alltypesinsert
I0728 03:26:50.400235  6411 impala-beeswax-server.cc:427] query: Query {
  01: query (string) = "invalidate metadata functional. alltypesinsert",
  03: configuration (list) = list<string>[7] {
    [0] = "DISABLE_CODEGEN_ROWS_THRESHOLD=0",
    [1] = "BATCH_SIZE=0",
    [2] = "NUM_NODES=0",
    [3] = "SYNC_DDL=0",
    [4] = "DISABLE_CODEGEN=False",
    [5] = "ABORT_ON_ERROR=1",
    [6] = "EXEC_SINGLE_NODE_ROWS_THRESHOLD=0",
  },
  04: hadoop_user (string) = "jenkins",
}

{noformat}


{noformat}
I0728 00:56:12.747411   900 HdfsTable.java:942] Loaded file and block metadata 
for functional.alltypesinsert
I0728 00:56:12.747599   900 HdfsTable.java:1273] Incrementally loaded table 
metadata for: functional.alltypesinsert
I0728 00:56:12.750581   473 catalog-server.cc:490] Collected update: 
TABLE:functional.alltypesinsert, version=3444, original size=16379
I0728 00:56:12.750932   473 catalog-server.cc:490] Collected update: 
CATALOG_SERVICE_ID, version=3444, original size=49
I0728 00:56:13.648466   479 catalog-server.cc:245] A catalog update with 2 
entries is assembled. Catalog version: 3444 Last sent catalog version: 3443
<------ gap ----->
I0728 03:26:50.402408   899 CatalogServiceCatalog.java:1555] Invalidating table 
metadata: functional.alltypesinsert
I0728 03:26:50.542961   473 catalog-server.cc:490] Collected update: 
TABLE:functional.alltypesinsert, version=3445, original size=61
I0728 03:26:50.543259   473 catalog-server.cc:490] Collected update: 
CATALOG_SERVICE_ID, version=3445, original size=49
I0728 03:26:52.543041   479 catalog-server.cc:245] A catalog update with 2 
entries is assembled. Catalog version: 3445 Last sent catalog versio
{noformat}

> TestInsertQueries.test_insert fails on S3 with 0 rows returned
> --------------------------------------------------------------
>
>                 Key: IMPALA-7371
>                 URL: https://issues.apache.org/jira/browse/IMPALA-7371
>             Project: IMPALA
>          Issue Type: Bug
>          Components: Catalog
>    Affects Versions: Impala 3.0
>            Reporter: David Knupp
>            Assignee: bharath v
>            Priority: Critical
>         Attachments: catalogd_excerpt.INFO, impalad_excerpt.INFO, 
> profile.txt, profile_excerpt.log
>
>
> Stacktrace
> {noformat}
> query_test/test_insert.py:118: in test_insert
>     multiple_impalad=vector.get_value('exec_option')['sync_ddl'] == 1)
> /data/jenkins/workspace/impala-cdh6.0.x-core-s3/repos/Impala/tests/common/impala_test_suite.py:426:
>  in run_test_case
>     self.__verify_results_and_errors(vector, test_section, result, use_db)
> /data/jenkins/workspace/impala-cdh6.0.x-core-s3/repos/Impala/tests/common/impala_test_suite.py:299:
>  in __verify_results_and_errors
>     replace_filenames_with_placeholder)
> /data/jenkins/workspace/impala-cdh6.0.x-core-s3/repos/Impala/tests/common/test_result_verifier.py:434:
>  in verify_raw_results
>     VERIFIER_MAP[verifier](expected, actual)
> /data/jenkins/workspace/impala-cdh6.0.x-core-s3/repos/Impala/tests/common/test_result_verifier.py:261:
>  in verify_query_result_is_equal
>     assert expected_results == actual_results
> E   assert Comparing QueryTestResults (expected vs actual):
> E     75,false,0,0,0,0,0,0,'04/01/09','0' != None
> E     76,true,1,1,1,10,1.100000023841858,10.1,'04/01/09','1' != None
> E     77,false,2,2,2,20,2.200000047683716,20.2,'04/01/09','2' != None
> E     78,true,3,3,3,30,3.299999952316284,30.3,'04/01/09','3' != None
> E     79,false,4,4,4,40,4.400000095367432,40.4,'04/01/09','4' != None
> E     80,true,5,5,5,50,5.5,50.5,'04/01/09','5' != None
> E     81,false,6,6,6,60,6.599999904632568,60.6,'04/01/09','6' != None
> E     82,true,7,7,7,70,7.699999809265137,70.7,'04/01/09','7' != None
> E     83,false,8,8,8,80,8.800000190734863,80.8,'04/01/09','8' != None
> E     84,true,9,9,9,90,9.899999618530273,90.90000000000001,'04/01/09','9' != 
> None
> E     85,false,0,0,0,0,0,0,'04/02/09','0' != None
> E     86,true,1,1,1,10,1.100000023841858,10.1,'04/02/09','1' != None
> E     87,false,2,2,2,20,2.200000047683716,20.2,'04/02/09','2' != None
> E     88,true,3,3,3,30,3.299999952316284,30.3,'04/02/09','3' != None
> E     89,false,4,4,4,40,4.400000095367432,40.4,'04/02/09','4' != None
> E     90,true,5,5,5,50,5.5,50.5,'04/02/09','5' != None
> E     91,false,6,6,6,60,6.599999904632568,60.6,'04/02/09','6' != None
> E     92,true,7,7,7,70,7.699999809265137,70.7,'04/02/09','7' != None
> E     93,false,8,8,8,80,8.800000190734863,80.8,'04/02/09','8' != None
> E     94,true,9,9,9,90,9.899999618530273,90.90000000000001,'04/02/09','9' != 
> None
> E     95,false,0,0,0,0,0,0,'04/03/09','0' != None
> E     96,true,1,1,1,10,1.100000023841858,10.1,'04/03/09','1' != None
> E     97,false,2,2,2,20,2.200000047683716,20.2,'04/03/09','2' != None
> E     98,true,3,3,3,30,3.299999952316284,30.3,'04/03/09','3' != None
> E     99,false,4,4,4,40,4.400000095367432,40.4,'04/03/09','4' != None
> E     Number of rows returned (expected vs actual): 25 != 0
> {noformat}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to