dojiong commented on issue #7845: URL: https://github.com/apache/arrow-datafusion/issues/7845#issuecomment-1769768742
> A third way could be to parse JSON data into Arrow Structs > One limitation of this approach is that it requires all the JSON records to have the same schema Yeah, JSON is schemaless, it's hard to fix schema to Arrow Struct. ---- > store BSON as Binary in datafusion That could be land quickly, without modifying the types of Arrow. But there are some limitations: `json_extract_path` in postgres returns `json` data type, it's easy to process returned data(extract inner or convert type). Datafusion does not support convert binary to json element types (e.g. int, boolean). Then we should add `json_extract_path_int`/`json_extract_path_str`/`json_extract_path_bool`...., that's annoying. If we use binary as BSON, we should consider a pleasure way to convert BSON data to other datatypes. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
