zhuqi-lucas opened a new issue, #19920:
URL: https://github.com/apache/datafusion/issues/19920

   ### Is your feature request related to a problem or challenge?
   
   DataFusion currently uses arrow-json's LineDelimitedReader, which is 
optimized for NDJSON format. When we encounter data sources that provide JSON 
arrays (i.e., [{...}, {...}]), we run into parsing issues.
   
   I will try to add native support in DataFusion upstream by extending 
JsonFormat with a configuration option:
   ```rust
   enum JsonFormatType {
       LineDelimited,  // default, current behavior
       Array,          // new, for JSON array support
   }
   ```
   
   - Backward compatible - existing code continues to work unchanged
   
   - Clean integration - arrow-json's ReaderBuilder already supports both 
formats, we just need to expose the option
   
   - Explicit control - users can specify which format they're working with 
rather than relying on auto-detection (which can be error-prone with large 
files)
   
   ### Describe the solution you'd like
   
   _No response_
   
   ### Describe alternatives you've considered
   
   _No response_
   
   ### Additional context
   
   _No response_


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to