[jira] [Commented] (SPARK-17633) texFile() and wholeTextFiles() count difference

2016-09-22 Thread Anshul (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15513110#comment-15513110 ] Anshul commented on SPARK-17633: RDD is not cached, in this scenario. > texFile() and wholeTextFiles()

[jira] [Commented] (SPARK-17633) texFile() and wholeTextFiles() count difference

2016-09-22 Thread Anshul (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15513168#comment-15513168 ] Anshul commented on SPARK-17633: What could be the possible reason for this? As spark's transformations

[jira] [Commented] (SPARK-17633) texFile() and wholeTextFiles() count difference

2016-09-22 Thread Anshul (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15513116#comment-15513116 ] Anshul commented on SPARK-17633: data.csv 1,"a" 2,"b" val x=sc.textFile("data.csv") x.count is 2 If I

[jira] [Created] (SPARK-17633) texFile() and wholeTextFiles() count difference

2016-09-22 Thread Anshul (JIRA)
Anshul created SPARK-17633: -- Summary: texFile() and wholeTextFiles() count difference Key: SPARK-17633 URL: https://issues.apache.org/jira/browse/SPARK-17633 Project: Spark Issue Type: Bug