Hello group,

Am having .gz files as part of my input and when reading on the support for
gzip files, I stumbled upon this thread on StackOverflow
<http://stackoverflow.com/questions/16302385/gzip-support-in-spark/16309699#16309699>
which
says that Spark supports gz files. But a few days back I saw a mail thread
 here in the group pointing to this
link<https://www.inkling.com/read/hadoop-definitive-guide-tom-white-3rd/chapter-4/compression#8ca1fda1252b67145680b3a5e9d45b2a>
and
claiming that *Spark does not handle .gz files as they are not splittable*.


These two items seems to be ambiguous. Can anyone confirm on the real
scenario ? Thanks!

Regards,

Ramkumar Chokkalingam ,
University of Washington.
LinkedIn <http://www.linkedin.com/in/mynameisram>

Reply via email to