ankitsultana opened a new issue, #8951: URL: https://github.com/apache/pinot/issues/8951
Native Join support has just recently been added to Pinot via the Multi-Stage Execution Engine Support. For some high-throughput join use-cases, we should be able to improve the performance quite a bit by avoiding the shuffle step. This can be done by ensuring that data for the same partition lies on the same servers. We have started looking into Colocated Joins and have broken it down into two parts: 1. Supporting co-location of data (via a new concept called Table Groups). 2. Adding support for Co-located join in the existing join framework. We also wrote a design doc for the first part: https://docs.google.com/document/d/1HWkRmjUxLBnkKETw21GJ--yQdiFa9hEE4v6Y68nNWx4/edit#heading=h.xhb8dmu7bi7k Would be great to get the community's feedback. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
