aokolnychyi commented on issue #24515: [SPARK-14083][WIP] Basic bytecode analyzer to speed up Datasets URL: https://github.com/apache/spark/pull/24515#issuecomment-488754018 @dbtsai and I collaborated on this. The approach we take was originally described by @JoshRosen. The implementation and scope are very different, though. As the PR description says, the goal is to start a discussion and not to suggest a properly working optimal approach. To the best of my knowledge, there are multiple companies that are trying/tried to implement this functionality. It would be great if everyone can share their progress/ideas/pain points so that we can, hopefully, improve typed Datasets in Spark. @rednaxelafx @kiszk @maropu @rxin @dongjoon-hyun @cloud-fan @gatorsmile
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
