Thanks for the reply. Can i ask something Can I join slack communication channel of beam
On Mon, 24 Feb, 2025, 22:44 Danny McCormick, <dannymccorm...@google.com> wrote: > Hey Aditya, glad to hear that you are interested in this project. I've > tried to answer your questions below: > > > What are the key technical challenges in integrating Beam with Pinecone > and Tecton? > > The main challenges will be around understanding how those systems (and > other similar systems) work, how their client libraries are set up, how > Beam handles sources/sinks to enable efficient execution, and being able to > stitch all of those pieces together into a working connector. This will > require an understanding of the Beam model and will require reasoning > through some distributed systems principles. > > > Should the connectors support both batch and streaming modes? > > Yes, we will need to support both. > > > Are there any existing patterns or reference implementations to follow? > > Yes, here is an example enrichment handler with the Feast Feature store - > https://github.com/apache/beam/blob/42bbc1ed432bf912f895271b3d3954cb70e69cf8/sdks/python/apache_beam/transforms/enrichment_handlers/feast_feature_store.py#L83 > and > here is an example sink for writing TFRecords - > https://github.com/apache/beam/blob/42bbc1ed432bf912f895271b3d3954cb70e69cf8/sdks/python/apache_beam/io/tfrecordio.py#L299 > . > > We'd need similar concepts for writing to and enriching from various > feature stores and vector DBs. > > Thanks, > Danny > > On Sat, Feb 22, 2025 at 11:02 AM Aditya <adiworkprof...@gmail.com> wrote: > >> *Hi Danny and Beam Dev Team,* >> >> I hope you're doing well. I am interested in contributing to the *"Beam >> ML Vector DB/Feature Store Integrations"* project as part of GSoC and >> would love to get more insights into the project’s scope and expectations. >> About Me >> >> I am a software engineer passionate about distributed systems and machine >> learning infrastructure. I have been actively contributing to Apache >> projects and open-source communities. Below is a summary of my >> contributions: >> >> *Previous Contributions:* >> >> - *Apache Airflow* >> - 10+ contributions via PRs and issues >> - 5+ merged PRs >> - Active daily participation in the Slack community >> - Currently working on HTTP operator improvements >> - *Shell_sage* >> - Implemented a logging flag feature >> - Created SQLite database integration for log storage >> - Successfully merged PR >> - *Other Apache Projects* >> - Contributions to Apache ZooKeeper >> - Documentation improvements for Apache Maven >> - Active participation in MSS and SugarLabs >> >> *My Profiles:* >> >> - *GitHub:* https://github.com/aditya0yadav >> - *LinkedIn:* https://www.linkedin.com/in/2580aditya/ >> >> I would love to understand more about this project, specifically: >> >> 1. What are the key technical challenges in integrating Beam with >> Pinecone and Tecton? >> 2. Should the connectors support both batch and streaming modes? >> 3. Are there any existing patterns or reference implementations to >> follow? >> >> Looking forward to your guidance and hoping to contribute meaningfully to >> the project. >> >> *Best regards,* >> Aditya Yadav >> adiworkprof...@gmail.com >> >