Quanlong Huang created IMPALA-11593:
---------------------------------------

             Summary: TestMemReservations.test_per_backend_min_reservation 
failed by NullPointerException from libhdfs in S3 builds
                 Key: IMPALA-11593
                 URL: https://issues.apache.org/jira/browse/IMPALA-11593
             Project: IMPALA
          Issue Type: Bug
            Reporter: Quanlong Huang
            Assignee: Quanlong Huang


Saw this failure on an S3 build:
{noformat}
custom_cluster/test_mem_reservations.py:102: in test_per_backend_min_reservation
    assert t.error is None
E   assert 'ImpalaBeeswaxException:\n Query aborted:Disk I/O error on 
impala-ec2-centos79-m6i-4xlarge-ondemand-1db1.vpc.cloudera....warehouse/alltypes/year=2009/month=9/090901.txt\nError(255):
 Unknown error 255\nRoot cause: NullPointerException: \n\n' is None
E    +  where 'ImpalaBeeswaxException:\n Query aborted:Disk I/O error on 
impala-ec2-centos79-m6i-4xlarge-ondemand-1db1.vpc.cloudera....warehouse/alltypes/year=2009/month=9/090901.txt\nError(255):
 Unknown error 255\nRoot cause: NullPointerException: \n\n' = 
<QuerySubmitThread(Thread-165, stopped 140272709113600)>.error
{noformat}
Impalad logs for the query:
{noformat}
I0915 03:12:33.839942 21677 impala-server.cc:1333] 
09439d05a2468038:3816f0f200000000] Registered query 
query_id=09439d05a2468038:3816f0f200000000 
session_id=874c5100c59607af:a86e04c8f62bb9a9
I0915 03:12:33.889168 21677 Frontend.java:1628] 
09439d05a2468038:3816f0f200000000] Analyzing query: select max(t.c1), 
avg(t.c2), min(t.c3), avg(c4), avg(c5), avg(c6)
    from (select
        max(tinyint_col) over (order by int_col) c1,
        avg(tinyint_col) over (order by smallint_col) c2,
        min(tinyint_col) over (order by smallint_col desc) c3,
        rank() over (order by int_col desc) c4,
        dense_rank() over (order by bigint_col) c5,
        first_value(tinyint_col) over (order by bigint_col desc) c6
        from functional.alltypes) t; db: default
I0915 03:12:33.981251 21677 FeSupport.java:315] 
09439d05a2468038:3816f0f200000000] Requesting prioritized load of table(s): 
functional.alltypes
I0915 03:12:33.986737 21677 thrift-util.cc:99] 
09439d05a2468038:3816f0f200000000] TSocket::open() connect() <Host: localhost 
Port: 26000>: Connection refused
I0915 03:12:34.582643 21677 BaseAuthorizationChecker.java:113] 
09439d05a2468038:3816f0f200000000] Authorization check took 693 ms
I0915 03:12:34.582674 21677 Frontend.java:1671] 
09439d05a2468038:3816f0f200000000] Analysis and authorization finished.
I0915 03:12:34.723712 21208 control-service.cc:148] 
4a4ebd3b7575254c:eb71cd8000000000] ExecQueryFInstances(): 
query_id=4a4ebd3b7575254c:eb71cd8000000000 
coord=impala-ec2-centos79-m6i-4xlarge-ondemand-1db1.vpc.cloudera.com:27000 
#instances=1
I0915 03:12:34.738032 21758 query-state.cc:942] 
4a4ebd3b7575254c:eb71cd8000000002] Executing instance. 
instance_id=4a4ebd3b7575254c:eb71cd8000000002 fragment_idx=1 
per_fragment_instance_idx=1 coord_state_idx=1 #in-flight=1
I0915 03:12:34.850791 21820 admission-controller.cc:1819] 
09439d05a2468038:3816f0f200000000] Trying to admit 
id=09439d05a2468038:3816f0f200000000 in pool_name=default-pool 
executor_group_name=default per_host_mem_estimate=1.34 GB 
dedicated_coord_mem_estimate=1.10 GB max_requests=-1 max_queued=200 
max_mem=-1.00 B
I0915 03:12:34.850811 21820 admission-controller.cc:1827] 
09439d05a2468038:3816f0f200000000] Stats: agg_num_running=1, agg_num_queued=0, 
agg_mem_reserved=1.56 GB,  local_host(local_mem_admitted=0, 
num_admitted_running=0, num_queued=0, backend_mem_reserved=192.46 MB, 
topN_query_stats: queries=[4a4ebd3b7575254c:eb71cd8000000000], 
total_mem_consumed=192.46 MB, fraction_of_pool_total_mem=1; pool_level_stats: 
num_running=1, min=192.46 MB, max=192.46 MB, pool_total_mem=192.46 MB, 
average_per_query=192.46 MB)
I0915 03:12:34.850852 21820 admission-controller.cc:1218] 
09439d05a2468038:3816f0f200000000] Admitting query 
id=09439d05a2468038:3816f0f200000000
I0915 03:12:34.850939 21820 impala-server.cc:2159] 
09439d05a2468038:3816f0f200000000] Registering query locations
I0915 03:12:34.850998 21820 coordinator.cc:150] 
09439d05a2468038:3816f0f200000000] Exec() 
query_id=09439d05a2468038:3816f0f200000000 stmt=select max(t.c1), avg(t.c2), 
min(t.c3), avg(c4), avg(c5), avg(c6)
    from (select
        max(tinyint_col) over (order by int_col) c1,
        avg(tinyint_col) over (order by smallint_col) c2,
        min(tinyint_col) over (order by smallint_col desc) c3,
        rank() over (order by int_col desc) c4,
        dense_rank() over (order by bigint_col) c5,
        first_value(tinyint_col) over (order by bigint_col desc) c6 
        from functional.alltypes) t;
I0915 03:12:34.851434 21820 coordinator.cc:474] 
09439d05a2468038:3816f0f200000000] starting execution on 3 backends for 
query_id=09439d05a2468038:3816f0f200000000
I0915 03:12:34.856995 21208 control-service.cc:148] 
09439d05a2468038:3816f0f200000000] ExecQueryFInstances(): 
query_id=09439d05a2468038:3816f0f200000000 
coord=impala-ec2-centos79-m6i-4xlarge-ondemand-1db1.vpc.cloudera.com:27001 
#instances=2
I0915 03:12:34.858456 21820 coordinator.cc:533] 
09439d05a2468038:3816f0f200000000] started execution on 3 backends for 
query_id=09439d05a2468038:3816f0f200000000
I0915 03:12:34.860503 21841 query-state.cc:942] 
09439d05a2468038:3816f0f200000002] Executing instance. 
instance_id=09439d05a2468038:3816f0f200000002 fragment_idx=1 
per_fragment_instance_idx=1 coord_state_idx=0 #in-flight=2
I0915 03:12:34.860591 21843 query-state.cc:942] 
09439d05a2468038:3816f0f200000000] Executing instance. 
instance_id=09439d05a2468038:3816f0f200000000 fragment_idx=0 
per_fragment_instance_idx=0 coord_state_idx=0 #in-flight=3
I0915 03:12:35.057634 21208 coordinator.cc:1032] Backend completed: 
host=impala-ec2-centos79-m6i-4xlarge-ondemand-1db1.vpc.cloudera.com:27002 
remaining=3 query_id=09439d05a2468038:3816f0f200000000
I0915 03:12:35.057649 21208 coordinator-backend-state.cc:371] 
query_id=09439d05a2468038:3816f0f200000000: first in-progress backend: 
impala-ec2-centos79-m6i-4xlarge-ondemand-1db1.vpc.cloudera.com:27001
I0915 03:12:35.149704 21208 coordinator.cc:1032] Backend completed: 
host=impala-ec2-centos79-m6i-4xlarge-ondemand-1db1.vpc.cloudera.com:27000 
remaining=2 query_id=09439d05a2468038:3816f0f200000000
I0915 03:12:35.149719 21208 coordinator-backend-state.cc:371] 
query_id=09439d05a2468038:3816f0f200000000: first in-progress backend: 
impala-ec2-centos79-m6i-4xlarge-ondemand-1db1.vpc.cloudera.com:27001
I0915 03:12:35.106189 21377 status.cc:71] Disk I/O error on 
impala-ec2-centos79-m6i-4xlarge-ondemand-1db1.vpc.cloudera.com:27001: Failed to 
open HDFS file 
s3a://impala-test-uswest2-1/test-warehouse/alltypes/year=2009/month=9/090901.txt
Error(255): Unknown error 255
Root cause: NullPointerException: 
    @          0x1f096f4  impala::Status::Status()
    @          0x1f1dfdb  impala::Status::Status()
    @          0x2ea28bb  impala::io::OpenHdfsFileOp::Execute()
    @          0x2ea3884  impala::SynchronousWorkItem::WorkerExecute()
    @          0x2ea4071  impala::SynchronousThreadPool::Worker()
    @          0x2ea6a89  
boost::detail::function::void_function_invoker2<>::invoke()
    @          0x2ea6734  boost::function2<>::operator()()
    @          0x2ea56f1  impala::ThreadPool<>::WorkerThread()
    @          0x2ea8819  boost::_mfi::mf1<>::operator()()
    @          0x2ea8645  boost::_bi::list2<>::operator()<>()
    @          0x2ea82d2  boost::_bi::bind_t<>::operator()()
    @          0x2ea7eaf  
boost::detail::function::void_function_obj_invoker0<>::invoke()
    @          0x221e4f7  boost::function0<>::operator()()
    @          0x29a898f  impala::Thread::SuperviseThread()
    @          0x29b12f0  boost::_bi::list5<>::operator()<>()
    @          0x29b1214  boost::_bi::bind_t<>::operator()()
    @          0x29b11d5  boost::detail::thread_data<>::run()
    @          0x42018b1  thread_proxy
    @     0x7f4590612ea4  start_thread
    @     0x7f458d00cb0c  __clone
I0915 03:12:35.195694 21854 hdfs-scan-node.cc:515] 
09439d05a2468038:3816f0f200000002] Scan node (id=0) ran into a parse error for 
scan range 
s3a://impala-test-uswest2-1/test-warehouse/alltypes/year=2009/month=9/090901.txt(0:20179).
 Processed 0 bytes.
I0915 03:12:35.196115 21841 query-state.cc:951] 
09439d05a2468038:3816f0f200000002] Instance completed. 
instance_id=09439d05a2468038:3816f0f200000002 #in-flight=1 
status=DISK_IO_ERROR: Disk I/O error on 
impala-ec2-centos79-m6i-4xlarge-ondemand-1db1.vpc.cloudera.com:27001: Failed to 
open HDFS file 
s3a://impala-test-uswest2-1/test-warehouse/alltypes/year=2009/month=9/090901.txt
Error(255): Unknown error 255
Root cause: NullPointerException:
I0915 03:12:35.196135 21829 query-state.cc:462] 
09439d05a2468038:3816f0f200000000] UpdateBackendExecState(): last report for 
09439d05a2468038:3816f0f200000000
I0915 03:12:35.198632 21208 coordinator.cc:1032] Backend completed: 
host=impala-ec2-centos79-m6i-4xlarge-ondemand-1db1.vpc.cloudera.com:27001 
remaining=1 query_id=09439d05a2468038:3816f0f200000000
I0915 03:12:35.198649 21208 coordinator.cc:752] ExecState: query 
id=09439d05a2468038:3816f0f200000000 
finstance=09439d05a2468038:3816f0f200000002 on 
host=impala-ec2-centos79-m6i-4xlarge-ondemand-1db1.vpc.cloudera.com:27001 
(EXECUTING -> ERROR) status=Disk I/O error on 
impala-ec2-centos79-m6i-4xlarge-ondemand-1db1.vpc.cloudera.com:27001: Failed to 
open HDFS file 
s3a://impala-test-uswest2-1/test-warehouse/alltypes/year=2009/month=9/090901.txt
Error(255): Unknown error 255
Root cause: NullPointerException:
I0915 03:12:35.198689 21208 coordinator-backend-state.cc:974] 
query_id=09439d05a2468038:3816f0f200000000 target backend=127.0.0.1:27001: Not 
cancelling because the backend is already done: Disk I/O error on 
impala-ec2-centos79-m6i-4xlarge-ondemand-1db1.vpc.cloudera.com:27001: Failed to 
open HDFS file 
s3a://impala-test-uswest2-1/test-warehouse/alltypes/year=2009/month=9/090901.txt
Error(255): Unknown error 255
Root cause: NullPointerException:
I0915 03:12:35.198695 21208 coordinator-backend-state.cc:974] 
query_id=09439d05a2468038:3816f0f200000000 target backend=127.0.0.1:27000: Not 
cancelling because the backend is already done:
I0915 03:12:35.198702 21208 coordinator-backend-state.cc:974] 
query_id=09439d05a2468038:3816f0f200000000 target backend=127.0.0.1:27002: Not 
cancelling because the backend is already done:
I0915 03:12:35.198706 21208 coordinator.cc:999] CancelBackends() 
query_id=09439d05a2468038:3816f0f200000000, tried to cancel 0 backends
I0915 03:12:35.198752 21208 coordinator.cc:1375] Release admission control 
resources for query_id=09439d05a2468038:3816f0f200000000
{noformat}
This could be due to the same cause of IMPALA-11592. Maybe there is an issue 
inside the hdfs client. Hadoop version: hadoop-3.1.1.7.2.16.0-171



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to