Rajesh Balamohan created HIVE-24205:

             Summary: Optimise CuckooSetBytes
                 Key: HIVE-24205
                 URL: https://issues.apache.org/jira/browse/HIVE-24205
             Project: Hive
          Issue Type: Improvement
            Reporter: Rajesh Balamohan
         Attachments: Screenshot 2020-09-28 at 4.29.24 PM.png

{{FilterStringColumnInList, StringColumnInList}}  etc use CuckooSetBytes for 

!Screenshot 2020-09-28 at 4.29.24 PM.png|width=714,height=508!

One option to optimize would be to add boundary conditions on "length" with the 
min/max length stored in the hashes. This would significantly reduce the number 
of hash computation that needs to happen. E.g 

This message was sent by Atlassian Jira

Reply via email to