Thanks Jack and Ryan, I will give an overview of the current design of the PR[1].
1. To avoid duplication between the endpoints, we now have a single endpoint planTableScan which accepts columns projections, a filter, etc. and initiates a plan. The server will respond back to the client in either two ways. * Either the server has completed planning and returns plan-tasks, file-scan-tasks. * The server has not completed planning, and returns a plan-id. 2. If the client receives a plan-id, they will call the fetchPlanningResult endpoint which takes in plan-id as input. Servers will then return a status on if planning is completed or not. The client will continue to call this endpoint until the server returns a completed status with plan-tasks, file-scan-tasks. 3. If a client wants to release resources for a given plan, a delete endpoint was added cancelPlanning which takes in plan-id as input. 4. An endpoint fetchScanTasks was added in order for a client to get the file-scan-tasks associated with a plan-task by providing a plan-task as input. [1] https://github.com/apache/iceberg/pull/9695 Regards, Rahil Chertara From: "rdb...@gmail.com" <rdb...@gmail.com> Reply-To: "dev@iceberg.apache.org" <dev@iceberg.apache.org> Date: Tuesday, September 3, 2024 at 1:24 PM To: "dev@iceberg.apache.org" <dev@iceberg.apache.org> Subject: RE: [VOTE] Merge REST Spec Change To Add New Scan Planning APIs CAUTION: This email originated from outside of the organization. Do not click links or open attachments unless you can confirm the sender and know the content is safe. +1 I think it would be good to give an overview of the current proposal since it has evolved quite a bit from the original like Jack said. On Tue, Sep 3, 2024 at 9:09 AM Jack Ye <yezhao...@gmail.com<mailto:yezhao...@gmail.com>> wrote: Thanks for keeping pushing for this Rahil. Personally I am +1 (binding) for this, with just some minor comments in the latest PR. But I think the initial DISCUSS thread [1] was quite a while ago and a lot has changed after a lot of comments and reviews. Should we restart another DISCUSS thread before voting, to make sure people are aware of the latest design and address any additional comments? Best, Jack Ye [1] https://lists.apache.org/thread/qq13468x6gk0vxnsckzc5xd02tjlvpkm On Mon, Sep 2, 2024 at 9:22 PM Chertara, Rahil <rcher...@amazon.com.invalid> wrote: Hi all, I've opened a PR [1] to add REST spec changes for a new protocol around table scan planning. For context around the design discussions, see the original google doc proposal [2], the dev list discussion thread [3], and finally the discussion that has happened on the spec change PR. Please vote on merging this change. The vote will remain open for at least 72 hours. [] +1 [] +0 [] -1, do not merge because ... [1] https://github.com/apache/iceberg/pull/9695 [2] https://docs.google.com/document/d/1FdjCnFZM1fNtgyb9-v9fU4FwOX4An-pqEwSaJe8RgUg/edit#heading=h.cftjlkb2wh4h [3] https://lists.apache.org/thread/qq13468x6gk0vxnsckzc5xd02tjlvpkm Thanks, Rahil Chertara