andygrove commented on PR #1059: URL: https://github.com/apache/datafusion-ballista/pull/1059#issuecomment-2374024160
Thanks @palaska. This looks great. I wasn't able to run any queries though with these changes. I built with `cargo build --release`. I started the scheduler: ``` $ ./target/release/ballista-scheduler 2024-09-25T12:57:29.760309Z INFO main ThreadId(01) ballista_scheduler::cluster: Initializing Sled database in temp directory 2024-09-25T12:57:29.769023Z INFO main ThreadId(01) ballista_scheduler::scheduler_process: Ballista v0.12.0 Scheduler listening on 0.0.0.0:50050 2024-09-25T12:57:29.769038Z INFO main ThreadId(01) ballista_scheduler::scheduler_process: Starting Scheduler grpc server with task scheduling policy of PullStaged 2024-09-25T12:57:29.769137Z INFO main ThreadId(01) ballista_scheduler::cluster::kv: Initializing heartbeat listener 2024-09-25T12:57:29.769155Z INFO main ThreadId(01) ballista_scheduler::scheduler_server::query_stage_scheduler: Starting QueryStageScheduler 2024-09-25T12:57:29.769175Z INFO tokio-runtime-worker ThreadId(31) ballista_core::event_loop: Starting the event loop query_stage ``` I started the executor: ``` $ ./target/release/ballista-executor -c 8 2024-09-25T12:57:34.575905Z INFO main ThreadId(01) ballista_executor::executor_process: Running with config: 2024-09-25T12:57:34.575919Z INFO main ThreadId(01) ballista_executor::executor_process: work_dir: /tmp/.tmpnbFKWN 2024-09-25T12:57:34.575921Z INFO main ThreadId(01) ballista_executor::executor_process: concurrent_tasks: 8 2024-09-25T12:57:34.576719Z INFO tokio-runtime-worker ThreadId(32) ballista_executor::executor_process: Ballista v0.12.0 Rust Executor Flight Server listening on 0.0.0.0:50051 2024-09-25T12:57:34.576759Z INFO tokio-runtime-worker ThreadId(30) ballista_executor::execution_loop: Starting poll work loop with scheduler ``` The executor immediately started producing errors: ``` 2024-09-25T12:57:34.577270Z WARN tokio-runtime-worker ThreadId(30) ballista_executor::execution_loop: Executor poll work loop failed. If this continues to happen the Scheduler might be marked as dead. Error: status: Cancelled, message: "h2 protocol error: http2 error", details: [], metadata: MetadataMap { headers: {} } 2024-09-25T12:57:34.678711Z WARN tokio-runtime-worker ThreadId(02) ballista_executor::execution_loop: Executor poll work loop failed. If this continues to happen the Scheduler might be marked as dead. Error: status: Cancelled, message: "h2 protocol error: http2 error", details: [], metadata: MetadataMap { headers: {} } ``` I then went back to the scheduler process and saw errors there as well: ``` thread 'tokio-runtime-worker' panicked at /home/andy/git/apache/datafusion-ballista/ballista/scheduler/src/scheduler_server/grpc.rs:95:56: called `Option::unwrap()` on a `None` value note: run with `RUST_BACKTRACE=1` environment variable to display a backtrace thread 'tokio-runtime-worker' panicked at /home/andy/git/apache/datafusion-ballista/ballista/scheduler/src/scheduler_server/grpc.rs:95:56: called `Option::unwrap()` on a `None` value ``` Here is the backtrace: ``` thread 'tokio-runtime-worker' panicked at /home/andy/git/apache/datafusion-ballista/ballista/scheduler/src/scheduler_server/grpc.rs:95:56: called `Option::unwrap()` on a `None` value stack backtrace: 0: rust_begin_unwind 1: core::panicking::panic_fmt 2: core::panicking::panic 3: core::option::unwrap_failed 4: ballista_scheduler::scheduler_server::grpc::<impl ballista_core::serde::generated::ballista::scheduler_grpc_server::SchedulerGrpc for ballista_scheduler::scheduler_server::SchedulerServer<T,U>>::poll_work::{{closure}} 5: <<ballista_core::serde::generated::ballista::scheduler_grpc_server::SchedulerGrpcServer<T> as tower_service::Service<http::request::Request<B>>>::call::PollWorkSvc<T> as tonic::server::service::UnaryService<ballista_core::serde::generated::ballista::PollWorkParams>>::call::{{closure}} 6: <ballista_core::serde::generated::ballista::scheduler_grpc_server::SchedulerGrpcServer<T> as tower_service::Service<http::request::Request<B>>>::call::{{closure}} ``` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For additional commands, e-mail: github-h...@datafusion.apache.org