StephanEwen commented on issue #7506: [FLINK-11347] Optimize the 
ParquetAvroWriters factory
URL: https://github.com/apache/flink/pull/7506#issuecomment-455159341
 
 
   The schema must be serializable, hence we convert it to a string and back.
   The schema is in the closure of the factory, which itself is part of the 
user code that is shipped for distributed execution, hence the requirement to 
be serializable.
   
   The parsing also happens just once when the writer is created, so my 
assumption is that the cost is acceptable.
   
   I would close this PR, because the solution here is not possible.
   Please reopen the issue, if you disagree and would like t pursue this 
further.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

Reply via email to