dheeraj12347 commented on issue #37577: URL: https://github.com/apache/beam/issues/37577#issuecomment-3889264933
Hi @atheendre130505, I'd like to take this issue! I've worked with Python SDKs and data serialization in other Apache projects (like Fory), so I'm familiar with the challenges of schema inference and Row object construction. The description provides a very clear reproduction case. I'll start by investigating apache_beam/yaml/yaml_provider.py to see how we can better unify the schema when YamlProviders.create encounters mixed types, perhaps by enforcing a consistent wrapper or improving the diagnostic error. Please assign this to me, and I'll work on a fix. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
