Jackie-Jiang commented on issue #7978: URL: https://github.com/apache/pinot/issues/7978#issuecomment-1281543027
It is very smart. There are some extra cares need to be taken: - There should be only one main record for each id, or it might cross match other records, and get unexpected result - To resolve `IN_PARTITIONED_SUBQUERY` (`IN_ID_SET` after resolving `IdSet`), we need to scan all the ids because inverted index won't be applied for `IN_ID_SET`. When there are very few matching ids, this could be slower than sending 2 queries, where the second query is constructed using the ids from first query in an `IN` clause. Ideally we can build this natively to handle these inefficiencies. The subquery can happen at segment level instead of partition level (`IN_PARTITIONeD_SUBQUERY`) or table level (`IN_SUBQUERY`) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
