[ 
https://issues.apache.org/jira/browse/SPARK-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14014910#comment-14014910
 ] 

Colin Patrick McCabe commented on SPARK-1518:
---------------------------------------------

It seems reasonable to have a list of supported versions in the Maven build.  
That wouldn't exclude people from building against other versions, of course, 
but they might have to supply a maven definition via {{\-D}} or something.

bq. For example, do you consider 0.23 to be Hadoop 2? It's YARN-based and it 
was (and still is AFAIK) widely used at Yahoo

0.23 is not Hadoop 2.  It's a branch that Yahoo! uses internally.  Everyone 
else has moved on to branch-2 (Hortonworks, Cloudera, WANDisco, Intel, etc. 
etc.)  Yahoo! also has some clusters running on branch-2, and that is their 
future too.  More info here: 
http://osdir.com/ml/general-hadoop-apache/2012-04/msg00000.html

bq. What about CDH4? Its version number is 2.0.0-something

Technically CDH4 is "Cloudera's distribution of Hadoop including Apache Hadoop 
2.0.0."  Its evolution didn't stop with 2.0.0, though.  We still are going to 
make another release in the cdh4 line where we backport some things.  CDH5 is 
where the focus is now, though.

bq. In any case we will be sensible about old versions, but my philosophy is 
always to support the broadest range possible, and from everything I've seen 
it's paid off – users appreciate when you do not force their hand to upgrade. 
This is why Yahoo for example continues to be our biggest contributor on YARN 
support, even though their YARN is pretty different.

Agree.

> Spark master doesn't compile against hadoop-common trunk
> --------------------------------------------------------
>
>                 Key: SPARK-1518
>                 URL: https://issues.apache.org/jira/browse/SPARK-1518
>             Project: Spark
>          Issue Type: Bug
>          Components: Spark Core
>            Reporter: Marcelo Vanzin
>            Assignee: Colin Patrick McCabe
>            Priority: Critical
>
> FSDataOutputStream::sync() has disappeared from trunk in Hadoop; 
> FileLogger.scala is calling it.
> I've changed it locally to hsync() so I can compile the code, but haven't 
> checked yet whether those are equivalent. hsync() seems to have been there 
> forever, so it hopefully works with all versions Spark cares about.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to