Rafferty97 commented on issue #12852: URL: https://github.com/apache/datafusion/issues/12852#issuecomment-2469435302
This looks to be caused by the call to `Schema::try_merge` at `datasource/file_format/csv.rs:328`. That function merges schemas together by joining on field name, so a side effect is the merging of fields within a single schema that happen to have the same name. I think an important question is whether it makes sense for a schema to have multiple fields with the same name in the first place, or whether it should be disallowed. While maybe it's defensible given fields can be accessed by index, it does make it pretty hard to sensibly merge schemas based on field name as `Schema::try_merge` does. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
