[ 
https://issues.apache.org/jira/browse/FLINK-2533?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chengxiang Li updated FLINK-2533:
---------------------------------
    Assignee: GaoLun

> Gap based random sample optimization
> ------------------------------------
>
>                 Key: FLINK-2533
>                 URL: https://issues.apache.org/jira/browse/FLINK-2533
>             Project: Flink
>          Issue Type: Improvement
>          Components: Core
>            Reporter: Chengxiang Li
>            Assignee: GaoLun
>            Priority: Minor
>
> For random sampler with fraction, like BernoulliSampler and PoissonSampler, 
> Gap based random sampler could exploit O(k) sample implementation instead of 
> previous O\(n\) sample implementation, it should perform better while sample 
> fraction is very small. [This 
> blog|http://erikerlandson.github.io/blog/2014/09/11/faster-random-samples-with-gap-sampling/]
>  describes more detail about gap based random sampler.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to