Hi David and community, Apologies for the delayed response. Your original message was not delivered to my personal inbox, which caused a slight delay, so I am replying to the list here in a new thread. https://lists.apache.org/thread/6fkh2vo598y47mmrzvqxbg8pwjl8v2ly
Thank you for your proposal and for taking the initiative to open PR 54939 to demonstrate your approach! I amvery excited to review your work specifically focused on Phase 1: Client-Side Plan-ID Caching. To set clear expectations for your GSoC timeline, and as a heads-up to the broader Spark developer community: Because the underlying SPIP (SPARK-55163) is still actively being discussed and has not yet received formal PMC approval, your GSoC project will function purely as an experimental prototype. Your open Pull Requests will be used by mentors to evaluate your GSoC deliverables and milestones. However, please be aware that your code will not be merged into the mainline Apache Spark repository during the GSoC program. Successfully completing your GSoC project and passing the evaluations is tied to the quality of your prototype and testing, not to getting the code merged. Your prototype will be incredibly valuable in helping the community benchmark the latency improvements for Spark Connect. I look forward to reviewing your finalized proposal! Regards, Vaquar Khan *Linkedin *-https://www.linkedin.com/in/vaquar-khan-b695577/ *Book *- https://us.amazon.com/stores/Vaquar-Khan/author/B0DMJCG9W6?ref=ap_rdr&shoppingPortalEnabled=true *GitBook*-https://vaquarkhan.github.io/microservices-recipes-a-free-gitbook/ *Stack *-https://stackoverflow.com/users/4812170/vaquar-khan *github*-https://github.com/vaquarkhan On Saturday, March 21, 2026 3:10:31 AM PDT, David Gvadzabia wrote: Hi Spark developers! My name is David Gvadzabia, and I am a Computer Science student at Lafayette College interested in contributing to SPARK-55163 for GSoC 2026. I have been reading through the Spark Connect metadata caching discussion and exploring the current implementation in the Python Connect client. As an initial contribution, I opened a small foundational PR in this area: Reuse metadata plans for DataFrames https://github.com/apache/spark/pull/54939 My goal with this first step was to keep the change small and... https://lists.apache.org/thread/6fkh2vo598y47mmrzvqxbg8pwjl8v2ly
