Re: can not show all data for this table

2016-06-15 Thread Mich Talebzadeh
at last some progress :) Dr Mich Talebzadeh LinkedIn * https://www.linkedin.com/profile/view?id=AAEWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw * http://talebzadehmich.wordpress.com On 15 June 2016 at 10:52, Lee

Re: can not show all data for this table

2016-06-15 Thread Lee Ho Yeung
Hi Mich, i find my problem cause now, i missed setting delimiter which is tab, but it got error, and i notice that only libre office and open and read well, even if Excel in window, it still can not separate in well format scala> val df =

Re: can not show all data for this table

2016-06-15 Thread Lee Ho Yeung
Hi Mich, https://drive.google.com/file/d/0Bxs_ao6uuBDUQ2NfYnhvUl9EZXM/view?usp=sharing https://drive.google.com/file/d/0Bxs_ao6uuBDUS1UzTWd1Q2VJdEk/view?usp=sharing this time I ensure headers cover all data, only some columns which have headers do not have data but still can not show all data

Re: can not show all data for this table

2016-06-14 Thread Mich Talebzadeh
there may be an issue with data in your csv file. like blank header line etc. sounds like you have an issue there. I normally get rid of blank lines before putting csv file in hdfs. can you actually select from that temp table. like sql("select TransactionDate, TransactionType, Description,

Re: can not show all data for this table

2016-06-14 Thread Lee Ho Yeung
filter also has error 16/06/14 19:00:27 WARN Utils: Service 'SparkUI' could not bind on port 4040. Attempting port 4041. Spark context available as sc. SQL context available as sqlContext. scala> import org.apache.spark.sql.SQLContext import org.apache.spark.sql.SQLContext scala> val sqlContext

can not show all data for this table

2016-06-14 Thread Lee Ho Yeung
after tried following commands, can not show data https://drive.google.com/file/d/0Bxs_ao6uuBDUVkJYVmNaUGx2ZUE/view?usp=sharing https://drive.google.com/file/d/0Bxs_ao6uuBDUc3ltMVZqNlBUYVk/view?usp=sharing /home/martin/Downloads/spark-1.6.1/bin/spark-shell --packages