[GitHub] spark issue #19294: [SPARK-21549][CORE] Respect OutputFormats with no output...

mridulm Thu, 28 Sep 2017 00:50:30 -0700

Github user mridulm commented on the issue:

    https://github.com/apache/spark/pull/19294
  
    @szhem Unfortunately, I do not have enough context about how sql writes 
paths to comment on when which branch will be taken.
    I was going purely from code in `HadoopMapReduceCommitProtocol`, if 
(assuming) the methods are invoked, they will throw NPE due to null path. It 
might be possible that in case of outputformat's without directory the method 
is never invoked - or it could in future evolution of code even if as an impl 
detail it currently does not or ... : I cannot unfortunately comment 
authoritatively about sql data sources.
    
    Having said that, this PR is definitely a big improvement over what 
currently exists in spark - it fixes spark core and also allows a bunch of 
other data source's (like shc for example, and I am sure others too - phoenix 
perhaps ?) to work when they currently fail.
    
    I want someone more knowledgable with spark sql to take a look - perhaps 
minor enhancements will make it work for all cases (if it does not already).



---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark issue #19294: [SPARK-21549][CORE] Respect OutputFormats with no output...

Reply via email to