I had an interaction on my project <https://github.com/nchammas/flintrock/issues/139#issuecomment-236268723> today that suggested some people may be confused about what the packages available on the downloads page are actually for.
Specifically, the various -hadoopx.x.tgz packages suggest that Hadoop itself is actually included in the package. I’m not 100% sure myself honestly, but as I explained in my comment linked above, I believe the -hadoopx.x.tgz just indicates the version of Hadoop that Spark was built against. Does it make sense to add a brief note to the downloads page <http://spark.apache.org/downloads.html> explaining this? I am assuming it would be too disruptive to change the package names to something more descriptive like -built-against-hadoopx.x.tgz. Nick