An approach to speed up counting the number of rows using the Kudu client API
could first retrieve the list of KuduScanTokens for the scan below and then use
several worker threads in parallel to add up the number of rows.
I implemented it in Java this afternoon and it took about 13s to go
Hi Dmitry, check Java Kudu API examples if you have not done it yet
https://github.com/apache/kudu/tree/master/examples
I remember it had a helper class that counts rows. Like Adar said, I do not
think there is a better / faster way - you just create a Kudu scanner, get
rows back and iterate over
Probably a scan with no predicates and a minimal projection. Then you
can iterate over the results and increment a count of rows.
Or, if you're using Impala, "SELECT COUNT(*) FROM FOO".
On Fri, Mar 22, 2019 at 3:23 AM Дмитрий Павлов wrote:
>
> Hi guys
>
> What is the quickest way to get total