+1 sounds cool, definitely we should implement that ideas soon El mié, 13 abr 2022 a las 14:55, Alexander Alten (<[email protected]>) escribió:
> +1 from my side :) > > > On 13. Apr 2022, at 14:54, Bertty Contreras <[email protected]> wrote: > > > > Hi folks, > > > > These days are the deadline (19 of April) for the Google Summer of > > Code(GSoC)[1], and we want to apply two ideas that the students could > > implement inside of Apache Wayang (Incubating). It will help them to > learn > > the internals of Wayang and also learn about the cost model; the ideas > are: > > > > - the first is the paper [Expand your Training Limits! Generating > Training > > Data for ML-based Data Management]( > > > https://www.agora-ecosystem.com/publications_pdf/expand_training_limits.pdf > ) > > where the authors try to generate data for training an ml that will > provide > > the cost model; this tries to help with the generation of data to train > the > > cost model of the current model, and this will help to more people tuning > > them model. > > > > - the second idea comes from [Zero-Shot Cost Models for Out-of-the-box > > Learned Cost Prediction](https://arxiv.org/pdf/2201.00561.pdf), where > the > > idea is to create a model pre-trained, but it learns during the new > queries > > are coming, this could help people that can wait for having a training > > model and also help to build a model that not need to be calibrated. > > > > If you have another idea, also we can add it :D, the deadline > > > > Best regards, > > Bertty > > > > [1] https://summerofcode.withgoogle.com > >
