andygrove commented on PR #1059:
URL: 
https://github.com/apache/datafusion-ballista/pull/1059#issuecomment-2374024160

   Thanks @palaska. This looks great. I wasn't able to run any queries though 
with these changes.
   
   I built with `cargo build --release`.
   
   I started the scheduler:
   
   ```
   $ ./target/release/ballista-scheduler 
   2024-09-25T12:57:29.760309Z  INFO main ThreadId(01) 
ballista_scheduler::cluster: Initializing Sled database in temp directory    
   2024-09-25T12:57:29.769023Z  INFO main ThreadId(01) 
ballista_scheduler::scheduler_process: Ballista v0.12.0 Scheduler listening on 
0.0.0.0:50050    
   2024-09-25T12:57:29.769038Z  INFO main ThreadId(01) 
ballista_scheduler::scheduler_process: Starting Scheduler grpc server with task 
scheduling policy of PullStaged    
   2024-09-25T12:57:29.769137Z  INFO main ThreadId(01) 
ballista_scheduler::cluster::kv: Initializing heartbeat listener    
   2024-09-25T12:57:29.769155Z  INFO main ThreadId(01) 
ballista_scheduler::scheduler_server::query_stage_scheduler: Starting 
QueryStageScheduler    
   2024-09-25T12:57:29.769175Z  INFO tokio-runtime-worker ThreadId(31) 
ballista_core::event_loop: Starting the event loop query_stage  
   ```
   
   I started the executor:
   
   ```
   $ ./target/release/ballista-executor -c 8
   2024-09-25T12:57:34.575905Z  INFO main ThreadId(01) 
ballista_executor::executor_process: Running with config:    
   2024-09-25T12:57:34.575919Z  INFO main ThreadId(01) 
ballista_executor::executor_process: work_dir: /tmp/.tmpnbFKWN    
   2024-09-25T12:57:34.575921Z  INFO main ThreadId(01) 
ballista_executor::executor_process: concurrent_tasks: 8    
   2024-09-25T12:57:34.576719Z  INFO tokio-runtime-worker ThreadId(32) 
ballista_executor::executor_process: Ballista v0.12.0 Rust Executor Flight 
Server listening on 0.0.0.0:50051    
   2024-09-25T12:57:34.576759Z  INFO tokio-runtime-worker ThreadId(30) 
ballista_executor::execution_loop: Starting poll work loop with scheduler    
   ```
   
   The executor immediately started producing errors:
   
   ```
   2024-09-25T12:57:34.577270Z  WARN tokio-runtime-worker ThreadId(30) 
ballista_executor::execution_loop: Executor poll work loop failed. If this 
continues to happen the Scheduler might be marked as dead. Error: status: 
Cancelled, message: "h2 protocol error: http2 error", details: [], metadata: 
MetadataMap { headers: {} }    
   2024-09-25T12:57:34.678711Z  WARN tokio-runtime-worker ThreadId(02) 
ballista_executor::execution_loop: Executor poll work loop failed. If this 
continues to happen the Scheduler might be marked as dead. Error: status: 
Cancelled, message: "h2 protocol error: http2 error", details: [], metadata: 
MetadataMap { headers: {} }    
   ```
   
   I then went back to the scheduler process and saw errors there as well:
   
   ```
   thread 'tokio-runtime-worker' panicked at 
/home/andy/git/apache/datafusion-ballista/ballista/scheduler/src/scheduler_server/grpc.rs:95:56:
   called `Option::unwrap()` on a `None` value
   note: run with `RUST_BACKTRACE=1` environment variable to display a backtrace
   thread 'tokio-runtime-worker' panicked at 
/home/andy/git/apache/datafusion-ballista/ballista/scheduler/src/scheduler_server/grpc.rs:95:56:
   called `Option::unwrap()` on a `None` value
   ```
   
   Here is the backtrace:
   
   ```
   thread 'tokio-runtime-worker' panicked at 
/home/andy/git/apache/datafusion-ballista/ballista/scheduler/src/scheduler_server/grpc.rs:95:56:
   called `Option::unwrap()` on a `None` value
   stack backtrace:
      0: rust_begin_unwind
      1: core::panicking::panic_fmt
      2: core::panicking::panic
      3: core::option::unwrap_failed
      4: ballista_scheduler::scheduler_server::grpc::<impl 
ballista_core::serde::generated::ballista::scheduler_grpc_server::SchedulerGrpc 
for 
ballista_scheduler::scheduler_server::SchedulerServer<T,U>>::poll_work::{{closure}}
      5: 
<<ballista_core::serde::generated::ballista::scheduler_grpc_server::SchedulerGrpcServer<T>
 as tower_service::Service<http::request::Request<B>>>::call::PollWorkSvc<T> as 
tonic::server::service::UnaryService<ballista_core::serde::generated::ballista::PollWorkParams>>::call::{{closure}}
      6: 
<ballista_core::serde::generated::ballista::scheduler_grpc_server::SchedulerGrpcServer<T>
 as tower_service::Service<http::request::Request<B>>>::call::{{closure}}
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to