tgravescs commented on issue #25983: [SPARK-29327][MLLIB]Support specifying features via multiple columns URL: https://github.com/apache/spark/pull/25983#issuecomment-545646668 Yes we have an external XGBoost repo you should be able to see the gpu code @firestarman can you point to it and explain a bit? I get what your saying but I also think sometimes its the chicken and egg problem and other external algorithms outside mllib could benefit but don't want to hassle with modifying spark. we would have to investigate if other mllib algo's would be able to skip the vector assembly or if this api would just be an easier to use api and hide the vector creation from the user.
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
