Github user mallman commented on the issue:

    https://github.com/apache/spark/pull/16578
  
    > I'm reluctant to generalize this PR without practical experience applying 
it to other column-oriented file formats. The only format I'm familiar with and 
have production experience with is Parquet.
    
    I want to expand on what I wrote. A lot of this patch is already 
generalized, e.g. the catalyst code. The tricky part is porting and testing the 
file access code. While columnar formats operate under the same principles, the 
devil is in the details, so to speak. Hence my reluctance to sign off on a 
broad generalization of this patch to other file formats.
    
    BTW, one thing that's occurred to me is the possibility of putting this 
functionality behind a configuration setting for the first one or two releases 
in which it exists. In the case of a bug we've overlooked, the end user can 
disable the optimization. What do you think?


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to