hugofragata opened a new issue, #12885: URL: https://github.com/apache/arrow/issues/12885
Hi all. My current use case for pyarrow is to read json and write the data as parquet. I'm having issues on a specific json column that contains both integer and string values. I could use pyarrow.json.ParseOptions and pass it an explicit_schema, however there's several schemas that have this issue and I'd like a single solution to all of them. Is it possible to still keep the default inference behaviour while also ignoring or casting values that don't conform? By ignoring I'm mean similar to unexpected_field_behavior:ignore but w/o an explicit_schema. Or, casting I mean casting to higher class of the json data type, for example int casts to str in this case. Cheers! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org