liurenjie1024 commented on issue #30: URL: https://github.com/apache/arrow-ballista/issues/30#issuecomment-1275529223
> Our eventual goal is to support running a plan on 100s of parquet files without having to fetch them all before (or concurrently). However, we currently have other things blocking this goal so additional work to the scheduler is on hold for now I'm a little confused here. Avoiding fetching 100s of parquet files is more like an optimizer issue? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
