Hello group, Am having .gz files as part of my input and when reading on the support for gzip files, I stumbled upon this thread on StackOverflow <http://stackoverflow.com/questions/16302385/gzip-support-in-spark/16309699#16309699> which says that Spark supports gz files. But a few days back I saw a mail thread here in the group pointing to this link<https://www.inkling.com/read/hadoop-definitive-guide-tom-white-3rd/chapter-4/compression#8ca1fda1252b67145680b3a5e9d45b2a> and claiming that *Spark does not handle .gz files as they are not splittable*.
These two items seems to be ambiguous. Can anyone confirm on the real scenario ? Thanks! Regards, Ramkumar Chokkalingam , University of Washington. LinkedIn <http://www.linkedin.com/in/mynameisram>
