[ https://issues.apache.org/jira/browse/ARROW-2624?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16491809#comment-16491809 ]
Antoine Pitrou commented on ARROW-2624: --------------------------------------- The ASV benchmark suite has some support to generate random data for some types, but not random schema. (see the {{BuiltinsGenerator}} in {{python/benchmarks/common.py}}) > [Python] Random schema and data generator for Arrow conversion and Parquet > testing > ---------------------------------------------------------------------------------- > > Key: ARROW-2624 > URL: https://issues.apache.org/jira/browse/ARROW-2624 > Project: Apache Arrow > Issue Type: Improvement > Components: Python > Reporter: Wes McKinney > Priority: Major > > See discussion in https://github.com/apache/arrow/issues/2067 > Being able to generate random complex schemas and corresponding example data > sets will help with exercising edge cases in many different parts of the > codebase. One practical example: reading and writing nested data to Parquet > format -- This message was sent by Atlassian JIRA (v7.6.3#76005)