[ 
https://issues.apache.org/jira/browse/FLINK-25262?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

hehuiyuan updated FLINK-25262:
------------------------------
    Description: 
Send data to lookup table  by hash , which  can improve cache hit rate, futher 
improve processing performance and reduce the size of cache.

 

Shoulder we consider to introducing it?

 

 

!image-2021-12-12-15-18-08-574.png|width=419,height=193!

 

I have a simple test.  The parallelism is 10 and the kafka source has 100 
million records and the hbase lookuptable has 100 thousands records. It need 
100 minutes for forward and 5 minutes for hash.

  was:
Send data to lookup table  by hash , which  can improve cache hit rate, futher 
improve processing performance and reduce the size of cache.

Shoulder we consider to introducing it?

 

!image-2021-12-12-15-18-08-574.png|width=419,height=193!


> Support to send data to  lookup table for KeyGroupStreamPartitioner way for 
> SQL
> -------------------------------------------------------------------------------
>
>                 Key: FLINK-25262
>                 URL: https://issues.apache.org/jira/browse/FLINK-25262
>             Project: Flink
>          Issue Type: Improvement
>            Reporter: hehuiyuan
>            Priority: Minor
>         Attachments: image-2021-12-12-15-15-48-540.png, 
> image-2021-12-12-15-18-08-574.png
>
>
> Send data to lookup table  by hash , which  can improve cache hit rate, 
> futher improve processing performance and reduce the size of cache.
>  
> Shoulder we consider to introducing it?
>  
>  
> !image-2021-12-12-15-18-08-574.png|width=419,height=193!
>  
> I have a simple test.  The parallelism is 10 and the kafka source has 100 
> million records and the hbase lookuptable has 100 thousands records. It need 
> 100 minutes for forward and 5 minutes for hash.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to