[GitHub] [pinot] Jackie-Jiang commented on issue #7978: Support Structured, Repeated Records

GitBox Mon, 17 Oct 2022 14:55:46 -0700


Jackie-Jiang commented on issue #7978:
URL: https://github.com/apache/pinot/issues/7978#issuecomment-1281543027


   It is very smart. There are some extra cares need to be taken:
   - There should be only one main record for each id, or it might cross match 
other records, and get unexpected result
   - To resolve `IN_PARTITIONED_SUBQUERY` (`IN_ID_SET` after resolving 
`IdSet`), we need to scan all the ids because inverted index won't be applied 
for `IN_ID_SET`. When there are very few matching ids, this could be slower 
than sending 2 queries, where the second query is constructed using the ids 
from first query in an `IN` clause.
   
   Ideally we can build this natively to handle these inefficiencies. The 
subquery can happen at segment level instead of partition level 
(`IN_PARTITIONeD_SUBQUERY`) or table level (`IN_SUBQUERY`)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] [pinot] Jackie-Jiang commented on issue #7978: Support Structured, Repeated Records

Reply via email to