HeartSaVioR commented on pull request #1877:
URL: https://github.com/apache/iceberg/pull/1877#issuecomment-739437218


   Yeah I'm happy to document it. Thanks!
   
   One thing I wonder is how much it is helpful or even it hurts to have table 
property for this, as we are in consensus that this might be dangerous on batch 
query if they don't notice the behavior (depending on cardinality of 
partitions). I also commented on previous PR and it got merged without 
answering it. For now we don't explain what is fanout writer in the doc, so 
they have no idea and fear to enable this, but once we document the behavior 
without proper warn, they may be misunderstanding the behavior as good for all 
cases and update the table property. (I guess you're concerning about 
documenting this due to this, do I understand correctly?)
   
   Personally I'm not 100% sure we'd like to add table property which is only 
good for specific workload, but at least we could document this with warning 
that this opens files for cardinality of partitions in the data in each task, 
so only recommended to use it in streaming write (not mentioning table property 
here). WDYT?


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to