StephanEwen commented on issue #7506: [FLINK-11347] Optimize the ParquetAvroWriters factory URL: https://github.com/apache/flink/pull/7506#issuecomment-455159341 The schema must be serializable, hence we convert it to a string and back. The schema is in the closure of the factory, which itself is part of the user code that is shipped for distributed execution, hence the requirement to be serializable. The parsing also happens just once when the writer is created, so my assumption is that the cost is acceptable. I would close this PR, because the solution here is not possible. Please reopen the issue, if you disagree and would like t pursue this further.
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services
