[ 
https://issues.apache.org/jira/browse/ARROW-2624?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16491809#comment-16491809
 ] 

Antoine Pitrou commented on ARROW-2624:
---------------------------------------

The ASV benchmark suite has some support to generate random data for some 
types, but not random schema.
(see the {{BuiltinsGenerator}} in {{python/benchmarks/common.py}})

> [Python] Random schema and data generator for Arrow conversion and Parquet 
> testing
> ----------------------------------------------------------------------------------
>
>                 Key: ARROW-2624
>                 URL: https://issues.apache.org/jira/browse/ARROW-2624
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: Python
>            Reporter: Wes McKinney
>            Priority: Major
>
> See discussion in https://github.com/apache/arrow/issues/2067
> Being able to generate random complex schemas and corresponding example data 
> sets will help with exercising edge cases in many different parts of the 
> codebase. One practical example: reading and writing nested data to Parquet 
> format



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to