[
https://issues.apache.org/jira/browse/ARROW-4661?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hatem Helal updated ARROW-4661:
-------------------------------
Description:
This was discussed in here:
[https://github.com/apache/arrow/pull/3721]
For testing/benchmarking dictionary encoding its useful to control the number
of repeated values and it would also be good to optionally include null values.
The ability to provide a custom alphabet would be handy for generating strings
with unicode characters.
was:
This was discussed in here:
[https://github.com/apache/arrow/pull/3721]
For testing/benchmarking dictionary encoding its useful to control the number
of repeated values and it would also be good to optionally include null values.
> [C++] Consolidate random string generators for use in benchmarks and unittests
> ------------------------------------------------------------------------------
>
> Key: ARROW-4661
> URL: https://issues.apache.org/jira/browse/ARROW-4661
> Project: Apache Arrow
> Issue Type: Improvement
> Reporter: Hatem Helal
> Assignee: Hatem Helal
> Priority: Minor
> Fix For: 0.14.0
>
>
> This was discussed in here:
> [https://github.com/apache/arrow/pull/3721]
> For testing/benchmarking dictionary encoding its useful to control the number
> of repeated values and it would also be good to optionally include null
> values. The ability to provide a custom alphabet would be handy for
> generating strings with unicode characters.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)