Hi,
have you tried to use spark-csv (https://github.com/databricks/spark-csv)
? after all you can reconduct an XL file to CSV
hth.
On Thu, Jul 21, 2016 at 4:25 AM, Felix Cheung
wrote:
> From looking at be CLConnect package, its loadWorkbook() function only
> supports reading from local fi
>From looking at be CLConnect package, its loadWorkbook() function only
>supports reading from local file path, so you might need a way to call HDFS
>command to get the file from HDFS first.
SparkR currently does not support this - you could read it in as a text file (I
don't think .xlsx is a t
Hi Yogesh ,
I have never tried reading XLS files using Spark . But I think you can
use sc.wholeTextFiles to read the complete xls at once , as xls files are
xml internally, you need to read them all to parse . Then I think you can
use apache poi to read them .
Also, you can copy you XLS data t
Hi,
I am trying to load and read excel sheets from HDFS in sparkR using
XLConnect package.
Can anyone help me in finding out how to read xls files from HDFS in sparkR ?
Regards,
Yogesh
-
To unsubscribe e-mail: user-unsubscr...@s