dojiong commented on issue #7845:
URL: 
https://github.com/apache/arrow-datafusion/issues/7845#issuecomment-1769768742

   > A third way could be to parse JSON data into Arrow Structs
   > One limitation of this approach is that it requires all the JSON records 
to have the same schema
   
   Yeah, JSON is schemaless, it's hard to fix schema to Arrow Struct.
   
   ----
   
   > store BSON as Binary in datafusion
   
   That could be land quickly, without modifying the types of Arrow. But there 
are some limitations:
   
   `json_extract_path` in postgres returns `json` data type, it's easy to 
process returned data(extract inner or convert type). 
   Datafusion does not support convert binary to json element types (e.g. int, 
boolean).
   Then we should add 
`json_extract_path_int`/`json_extract_path_str`/`json_extract_path_bool`...., 
that's annoying.
   
   If we use binary as BSON, we should consider a pleasure way to convert BSON 
data to other datatypes.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to