[jira] [Commented] (SPARK-19177) SparkR Data Frame operation between columns elements

2017-01-13 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15821986#comment-15821986 ] Shivaram Venkataraman commented on SPARK-19177: --- Thanks for the example - this is very

[jira] [Commented] (SPARK-17237) DataFrame fill after pivot causing org.apache.spark.sql.AnalysisException

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17237?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15821983#comment-15821983 ] Apache Spark commented on SPARK-17237: -- User 'maropu' has created a pull request for this issue:

[jira] [Assigned] (SPARK-19189) Optimize CartesianRDD to avoid parent RDD partition re-computation and re-serialization

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19189: Assignee: Apache Spark > Optimize CartesianRDD to avoid parent RDD partition

[jira] [Assigned] (SPARK-19189) Optimize CartesianRDD to avoid parent RDD partition re-computation and re-serialization

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19189: Assignee: (was: Apache Spark) > Optimize CartesianRDD to avoid parent RDD partition

[jira] [Commented] (SPARK-19113) Fix flaky test: o.a.s.sql.streaming.StreamSuite fatal errors from a source should be sent to the user

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19113?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15821984#comment-15821984 ] Apache Spark commented on SPARK-19113: -- User 'zsxwing' has created a pull request for this issue:

[jira] [Updated] (SPARK-19213) FileSourceScanExec usese sparksession from hadoopfsrelation creation time instead of the one active at time of execution

2017-01-13 Thread Robert Kruszewski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Kruszewski updated SPARK-19213: -- Description: If you look at

[jira] [Commented] (SPARK-19196) Explicitly prevent Insert into View or Create View As Insert

2017-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19196?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15821957#comment-15821957 ] Sean Owen commented on SPARK-19196: --- This JIRA has become corrupted somehow. Could you reopen a new

[jira] [Deleted] (SPARK-19196) Explicitly prevent Insert into View or Create View As Insert

2017-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19196?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen deleted SPARK-19196: -- > Explicitly prevent Insert into View or Create View As Insert >

[jira] [Deleted] (SPARK-19201) Explicitly prevent Insert into View or Create View As Insert

2017-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19201?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen deleted SPARK-19201: -- > Explicitly prevent Insert into View or Create View As Insert >

[jira] [Commented] (SPARK-19194) collection function: index

2017-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19194?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15821956#comment-15821956 ] Sean Owen commented on SPARK-19194: --- This issue has become corrupted somehow by JIRA and I'll have to

[jira] [Commented] (SPARK-19202) Explicitly prevent Insert into View or Create View As Insert

2017-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15821954#comment-15821954 ] Sean Owen commented on SPARK-19202: --- Deleting as a corrupted duplicate of SPARK-19196 > Explicitly

[jira] [Deleted] (SPARK-19194) collection function: index

2017-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19194?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen deleted SPARK-19194: -- > collection function: index > -- > > Key: SPARK-19194 >

[jira] [Deleted] (SPARK-19202) Explicitly prevent Insert into View or Create View As Insert

2017-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen deleted SPARK-19202: -- > Explicitly prevent Insert into View or Create View As Insert >

[jira] [Deleted] (SPARK-19197) Explicitly prevent Insert into View or Create View As Insert

2017-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen deleted SPARK-19197: -- > Explicitly prevent Insert into View or Create View As Insert >

[jira] [Commented] (SPARK-19201) Explicitly prevent Insert into View or Create View As Insert

2017-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15821953#comment-15821953 ] Sean Owen commented on SPARK-19201: --- Deleting as a corrupted duplicate of SPARK-19196 > Explicitly

[jira] [Deleted] (SPARK-19199) Explicitly prevent Insert into View or Create View As Insert

2017-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen deleted SPARK-19199: -- > Explicitly prevent Insert into View or Create View As Insert >

[jira] [Commented] (SPARK-19198) Explicitly prevent Insert into View or Create View As Insert

2017-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15821949#comment-15821949 ] Sean Owen commented on SPARK-19198: --- Deleting as a corrupted duplicate of SPARK-19196 > Explicitly

[jira] [Commented] (SPARK-19197) Explicitly prevent Insert into View or Create View As Insert

2017-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19197?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15821947#comment-15821947 ] Sean Owen commented on SPARK-19197: --- Deleting as a corrupted duplicate of SPARK-19196 > Explicitly

[jira] [Commented] (SPARK-19200) Explicitly prevent Insert into View or Create View As Insert

2017-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15821952#comment-15821952 ] Sean Owen commented on SPARK-19200: --- Deleting as a corrupted duplicate of SPARK-19196 > Explicitly

[jira] [Deleted] (SPARK-19200) Explicitly prevent Insert into View or Create View As Insert

2017-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen deleted SPARK-19200: -- > Explicitly prevent Insert into View or Create View As Insert >

[jira] [Deleted] (SPARK-19198) Explicitly prevent Insert into View or Create View As Insert

2017-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19198?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen deleted SPARK-19198: -- > Explicitly prevent Insert into View or Create View As Insert >

[jira] [Commented] (SPARK-19199) Explicitly prevent Insert into View or Create View As Insert

2017-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15821950#comment-15821950 ] Sean Owen commented on SPARK-19199: --- Deleting as a corrupted duplicate of SPARK-19196 > Explicitly

[jira] [Deleted] (SPARK-19192) collection function: index

2017-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19192?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen deleted SPARK-19192: -- > collection function: index > -- > > Key: SPARK-19192 >

[jira] [Deleted] (SPARK-19195) collection function:index

2017-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19195?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen deleted SPARK-19195: -- > collection function:index > - > > Key: SPARK-19195 >

[jira] [Commented] (SPARK-19193) collection function: index

2017-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19193?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15821945#comment-15821945 ] Sean Owen commented on SPARK-19193: --- Deleting as a corrupted duplicate of SPARK-19194 > collection

[jira] [Deleted] (SPARK-19191) collection function:index

2017-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19191?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen deleted SPARK-19191: -- > collection function:index > - > > Key: SPARK-19191 >

[jira] [Commented] (SPARK-19191) collection function:index

2017-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19191?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15821943#comment-15821943 ] Sean Owen commented on SPARK-19191: --- Deleting as a corrupted duplicate of SPARK-19194 > collection

[jira] [Deleted] (SPARK-19193) collection function: index

2017-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19193?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen deleted SPARK-19193: -- > collection function: index > -- > > Key: SPARK-19193 >

[jira] [Commented] (SPARK-19195) collection function:index

2017-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19195?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15821946#comment-15821946 ] Sean Owen commented on SPARK-19195: --- Deleting as a corrupted duplicate of SPARK-19194 > collection

[jira] [Commented] (SPARK-19192) collection function: index

2017-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19192?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15821944#comment-15821944 ] Sean Owen commented on SPARK-19192: --- Deleting as a corrupted duplicate of SPARK-19194 > collection

[jira] [Commented] (SPARK-18667) input_file_name function does not work with UDF

2017-01-13 Thread Ben (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15821929#comment-15821929 ] Ben commented on SPARK-18667: - Happy to help. Were you also able to reproduce the second issue, regarding the

[jira] [Resolved] (SPARK-19187) querying from parquet partitioned table throws FileNotFoundException when some partitions' hdfs locations do not exist

2017-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19187?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-19187. --- Resolution: Duplicate > querying from parquet partitioned table throws FileNotFoundException when >

[jira] [Comment Edited] (SPARK-18667) input_file_name function does not work with UDF

2017-01-13 Thread Ben (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15821506#comment-15821506 ] Ben edited comment on SPARK-18667 at 1/13/17 3:51 PM: -- So, I created a new example

[jira] [Commented] (SPARK-19116) LogicalPlan.statistics.sizeInBytes wrong for trivial parquet file

2017-01-13 Thread Andrew Ray (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19116?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15821914#comment-15821914 ] Andrew Ray commented on SPARK-19116: The 2318 number is the size of the parquet files written to disk

[jira] [Commented] (SPARK-18801) Support resolve a nested view

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18801?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15821910#comment-15821910 ] Apache Spark commented on SPARK-18801: -- User 'jiangxb1987' has created a pull request for this

[jira] [Assigned] (SPARK-19214) Inconsistencies between DataFrame and Dataset APIs

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19214?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19214: Assignee: (was: Apache Spark) > Inconsistencies between DataFrame and Dataset APIs >

[jira] [Assigned] (SPARK-19214) Inconsistencies between DataFrame and Dataset APIs

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19214?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19214: Assignee: Apache Spark > Inconsistencies between DataFrame and Dataset APIs >

[jira] [Commented] (SPARK-19214) Inconsistencies between DataFrame and Dataset APIs

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19214?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15821905#comment-15821905 ] Apache Spark commented on SPARK-19214: -- User 'aray' has created a pull request for this issue:

[jira] [Assigned] (SPARK-19189) Optimize CartesianRDD to avoid parent RDD partition re-computation and re-serialization

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19189: Assignee: (was: Apache Spark) > Optimize CartesianRDD to avoid parent RDD partition

[jira] [Assigned] (SPARK-19189) Optimize CartesianRDD to avoid parent RDD partition re-computation and re-serialization

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19189: Assignee: Apache Spark > Optimize CartesianRDD to avoid parent RDD partition

[jira] [Commented] (SPARK-19215) Add necessary check for `RDD.checkpoint` to avoid potential mistakes

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15821871#comment-15821871 ] Apache Spark commented on SPARK-19215: -- User 'WeichenXu123' has created a pull request for this

[jira] [Assigned] (SPARK-19215) Add necessary check for `RDD.checkpoint` to avoid potential mistakes

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19215: Assignee: (was: Apache Spark) > Add necessary check for `RDD.checkpoint` to avoid

[jira] [Assigned] (SPARK-19189) Optimize CartesianRDD to avoid parent RDD partition re-computation and re-serialization

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19189: Assignee: (was: Apache Spark) > Optimize CartesianRDD to avoid parent RDD partition

[jira] [Assigned] (SPARK-19189) Optimize CartesianRDD to avoid parent RDD partition re-computation and re-serialization

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19189: Assignee: (was: Apache Spark) > Optimize CartesianRDD to avoid parent RDD partition

[jira] [Assigned] (SPARK-19189) Optimize CartesianRDD to avoid parent RDD partition re-computation and re-serialization

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19189: Assignee: Apache Spark > Optimize CartesianRDD to avoid parent RDD partition

[jira] [Assigned] (SPARK-19189) Optimize CartesianRDD to avoid parent RDD partition re-computation and re-serialization

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19189: Assignee: (was: Apache Spark) > Optimize CartesianRDD to avoid parent RDD partition

[jira] [Assigned] (SPARK-19189) Optimize CartesianRDD to avoid parent RDD partition re-computation and re-serialization

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19189: Assignee: Apache Spark > Optimize CartesianRDD to avoid parent RDD partition

[jira] [Assigned] (SPARK-19189) Optimize CartesianRDD to avoid parent RDD partition re-computation and re-serialization

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19189: Assignee: Apache Spark > Optimize CartesianRDD to avoid parent RDD partition

[jira] [Assigned] (SPARK-19189) Optimize CartesianRDD to avoid parent RDD partition re-computation and re-serialization

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19189: Assignee: Apache Spark > Optimize CartesianRDD to avoid parent RDD partition

[jira] [Assigned] (SPARK-19189) Optimize CartesianRDD to avoid parent RDD partition re-computation and re-serialization

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19189: Assignee: (was: Apache Spark) > Optimize CartesianRDD to avoid parent RDD partition

[jira] [Assigned] (SPARK-19189) Optimize CartesianRDD to avoid parent RDD partition re-computation and re-serialization

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19189: Assignee: Apache Spark > Optimize CartesianRDD to avoid parent RDD partition

[jira] [Assigned] (SPARK-19189) Optimize CartesianRDD to avoid parent RDD partition re-computation and re-serialization

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19189: Assignee: (was: Apache Spark) > Optimize CartesianRDD to avoid parent RDD partition

[jira] [Assigned] (SPARK-19189) Optimize CartesianRDD to avoid parent RDD partition re-computation and re-serialization

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19189: Assignee: (was: Apache Spark) > Optimize CartesianRDD to avoid parent RDD partition

[jira] [Assigned] (SPARK-19189) Optimize CartesianRDD to avoid parent RDD partition re-computation and re-serialization

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19189: Assignee: Apache Spark > Optimize CartesianRDD to avoid parent RDD partition

[jira] [Assigned] (SPARK-19215) Add necessary check for `RDD.checkpoint` to avoid potential mistakes

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19215: Assignee: Apache Spark > Add necessary check for `RDD.checkpoint` to avoid potential

[jira] [Commented] (SPARK-19136) Aggregator with case class as output type fails with ClassCastException

2017-01-13 Thread Andrew Ray (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15821868#comment-15821868 ] Andrew Ray commented on SPARK-19136: I forgot you can also just do: {code}

[jira] [Created] (SPARK-19215) Add necessary check for `RDD.checkpoint` to avoid potential mistakes

2017-01-13 Thread Weichen Xu (JIRA)
Weichen Xu created SPARK-19215: -- Summary: Add necessary check for `RDD.checkpoint` to avoid potential mistakes Key: SPARK-19215 URL: https://issues.apache.org/jira/browse/SPARK-19215 Project: Spark

[jira] [Commented] (SPARK-17568) Add spark-submit option for user to override ivy settings used to resolve packages/artifacts

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15821862#comment-15821862 ] Apache Spark commented on SPARK-17568: -- User 'themodernlife' has created a pull request for this

[jira] [Commented] (SPARK-18667) input_file_name function does not work with UDF

2017-01-13 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15821852#comment-15821852 ] Liang-Chi Hsieh commented on SPARK-18667: - Hi [~someonehere15], Thanks for providing the info. I

[jira] [Created] (SPARK-19214) Inconsistencies between DataFrame and Dataset APIs

2017-01-13 Thread Alexander Alexandrov (JIRA)
Alexander Alexandrov created SPARK-19214: Summary: Inconsistencies between DataFrame and Dataset APIs Key: SPARK-19214 URL: https://issues.apache.org/jira/browse/SPARK-19214 Project: Spark

[jira] [Commented] (SPARK-19111) S3 Mesos history upload fails silently if too large

2017-01-13 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15821773#comment-15821773 ] Steve Loughran commented on SPARK-19111: Just realised one more thing If the allocated threads

[jira] [Comment Edited] (SPARK-19177) SparkR Data Frame operation between columns elements

2017-01-13 Thread Vicente Masip (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15821758#comment-15821758 ] Vicente Masip edited comment on SPARK-19177 at 1/13/17 1:20 PM: If I want

[jira] [Comment Edited] (SPARK-19177) SparkR Data Frame operation between columns elements

2017-01-13 Thread Vicente Masip (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15821758#comment-15821758 ] Vicente Masip edited comment on SPARK-19177 at 1/13/17 1:20 PM: If I want

[jira] [Comment Edited] (SPARK-19177) SparkR Data Frame operation between columns elements

2017-01-13 Thread Vicente Masip (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15821758#comment-15821758 ] Vicente Masip edited comment on SPARK-19177 at 1/13/17 1:19 PM: If I want

[jira] [Comment Edited] (SPARK-19177) SparkR Data Frame operation between columns elements

2017-01-13 Thread Vicente Masip (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15821758#comment-15821758 ] Vicente Masip edited comment on SPARK-19177 at 1/13/17 1:18 PM: If I want

[jira] [Comment Edited] (SPARK-19177) SparkR Data Frame operation between columns elements

2017-01-13 Thread Vicente Masip (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15821758#comment-15821758 ] Vicente Masip edited comment on SPARK-19177 at 1/13/17 1:19 PM: If I want

[jira] [Comment Edited] (SPARK-19177) SparkR Data Frame operation between columns elements

2017-01-13 Thread Vicente Masip (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15821758#comment-15821758 ] Vicente Masip edited comment on SPARK-19177 at 1/13/17 1:19 PM: If I want

[jira] [Commented] (SPARK-19177) SparkR Data Frame operation between columns elements

2017-01-13 Thread Vicente Masip (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15821758#comment-15821758 ] Vicente Masip commented on SPARK-19177: --- If I want to specify schema with gapply or I NEED to

[jira] [Assigned] (SPARK-19213) FileSourceScanExec usese sparksession from hadoopfsrelation creation time instead of the one active at time of execution

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19213: Assignee: Apache Spark > FileSourceScanExec usese sparksession from hadoopfsrelation

[jira] [Commented] (SPARK-19213) FileSourceScanExec usese sparksession from hadoopfsrelation creation time instead of the one active at time of execution

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19213?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15821736#comment-15821736 ] Apache Spark commented on SPARK-19213: -- User 'robert3005' has created a pull request for this issue:

[jira] [Assigned] (SPARK-19213) FileSourceScanExec usese sparksession from hadoopfsrelation creation time instead of the one active at time of execution

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19213: Assignee: (was: Apache Spark) > FileSourceScanExec usese sparksession from

[jira] [Created] (SPARK-19213) FileSourceScanExec usese sparksession from hadoopfsrelation creation time instead of the one active at time of execution

2017-01-13 Thread Robert Kruszewski (JIRA)
Robert Kruszewski created SPARK-19213: - Summary: FileSourceScanExec usese sparksession from hadoopfsrelation creation time instead of the one active at time of execution Key: SPARK-19213 URL:

[jira] [Assigned] (SPARK-19189) Optimize CartesianRDD to avoid parent RDD partition re-computation and re-serialization

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19189: Assignee: (was: Apache Spark) > Optimize CartesianRDD to avoid parent RDD partition

[jira] [Assigned] (SPARK-19189) Optimize CartesianRDD to avoid parent RDD partition re-computation and re-serialization

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19189: Assignee: Apache Spark > Optimize CartesianRDD to avoid parent RDD partition

[jira] [Assigned] (SPARK-19189) Optimize CartesianRDD to avoid parent RDD partition re-computation and re-serialization

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19189: Assignee: Apache Spark > Optimize CartesianRDD to avoid parent RDD partition

[jira] [Assigned] (SPARK-19189) Optimize CartesianRDD to avoid parent RDD partition re-computation and re-serialization

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19189: Assignee: (was: Apache Spark) > Optimize CartesianRDD to avoid parent RDD partition

[jira] [Commented] (SPARK-19208) MaxAbsScaler and MinMaxScaler are very inefficient

2017-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15821728#comment-15821728 ] Sean Owen commented on SPARK-19208: --- You have 29,890,095 features. At extremes of scale this might make

[jira] [Assigned] (SPARK-19189) Optimize CartesianRDD to avoid parent RDD partition re-computation and re-serialization

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19189: Assignee: (was: Apache Spark) > Optimize CartesianRDD to avoid parent RDD partition

[jira] [Assigned] (SPARK-19189) Optimize CartesianRDD to avoid parent RDD partition re-computation and re-serialization

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19189: Assignee: Apache Spark > Optimize CartesianRDD to avoid parent RDD partition

[jira] [Assigned] (SPARK-19189) Optimize CartesianRDD to avoid parent RDD partition re-computation and re-serialization

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19189: Assignee: (was: Apache Spark) > Optimize CartesianRDD to avoid parent RDD partition

[jira] [Assigned] (SPARK-19189) Optimize CartesianRDD to avoid parent RDD partition re-computation and re-serialization

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19189: Assignee: Apache Spark > Optimize CartesianRDD to avoid parent RDD partition

[jira] [Deleted] (SPARK-19190) Optimize CartesianRDD to avoid partition re-computation and re-serialization

2017-01-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19190?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang deleted SPARK-19190: > Optimize CartesianRDD to avoid partition re-computation and re-serialization >

[jira] [Assigned] (SPARK-19189) Optimize CartesianRDD to avoid parent RDD partition re-computation and re-serialization

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19189: Assignee: Apache Spark > Optimize CartesianRDD to avoid parent RDD partition

[jira] [Assigned] (SPARK-19189) Optimize CartesianRDD to avoid parent RDD partition re-computation and re-serialization

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19189: Assignee: (was: Apache Spark) > Optimize CartesianRDD to avoid parent RDD partition

[jira] [Deleted] (SPARK-19203) Optimize CartesianRDD to avoid partition re-computation and re-serialization

2017-01-13 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19203?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang deleted SPARK-19203: > Optimize CartesianRDD to avoid partition re-computation and re-serialization >

[jira] [Assigned] (SPARK-19189) Optimize CartesianRDD to avoid parent RDD partition re-computation and re-serialization

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19189: Assignee: Apache Spark > Optimize CartesianRDD to avoid parent RDD partition

[jira] [Assigned] (SPARK-19189) Optimize CartesianRDD to avoid parent RDD partition re-computation and re-serialization

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19189: Assignee: (was: Apache Spark) > Optimize CartesianRDD to avoid parent RDD partition

[jira] [Assigned] (SPARK-19189) Optimize CartesianRDD to avoid parent RDD partition re-computation and re-serialization

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19189: Assignee: (was: Apache Spark) > Optimize CartesianRDD to avoid parent RDD partition

[jira] [Assigned] (SPARK-19189) Optimize CartesianRDD to avoid parent RDD partition re-computation and re-serialization

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19189: Assignee: Apache Spark > Optimize CartesianRDD to avoid parent RDD partition

[jira] [Assigned] (SPARK-19189) Optimize CartesianRDD to avoid parent RDD partition re-computation and re-serialization

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19189: Assignee: Apache Spark > Optimize CartesianRDD to avoid parent RDD partition

[jira] [Assigned] (SPARK-19189) Optimize CartesianRDD to avoid parent RDD partition re-computation and re-serialization

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19189: Assignee: (was: Apache Spark) > Optimize CartesianRDD to avoid parent RDD partition

[jira] [Assigned] (SPARK-19189) Optimize CartesianRDD to avoid parent RDD partition re-computation and re-serialization

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19189: Assignee: (was: Apache Spark) > Optimize CartesianRDD to avoid parent RDD partition

[jira] [Assigned] (SPARK-19189) Optimize CartesianRDD to avoid parent RDD partition re-computation and re-serialization

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19189: Assignee: Apache Spark > Optimize CartesianRDD to avoid parent RDD partition

[jira] [Assigned] (SPARK-19189) Optimize CartesianRDD to avoid parent RDD partition re-computation and re-serialization

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19189: Assignee: (was: Apache Spark) > Optimize CartesianRDD to avoid parent RDD partition

[jira] [Assigned] (SPARK-19189) Optimize CartesianRDD to avoid parent RDD partition re-computation and re-serialization

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19189: Assignee: Apache Spark > Optimize CartesianRDD to avoid parent RDD partition

[jira] [Updated] (SPARK-19189) Optimize CartesianRDD to avoid parent RDD partition re-computation and re-serialization

2017-01-13 Thread Weichen Xu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weichen Xu updated SPARK-19189: --- Summary: Optimize CartesianRDD to avoid parent RDD partition re-computation and re-serialization

[jira] [Commented] (SPARK-19189) Optimize CartesianRDD to avoid partition re-computation and re-serialization

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19189?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15821711#comment-15821711 ] Apache Spark commented on SPARK-19189: -- User 'WeichenXu123' has created a pull request for this

[jira] [Assigned] (SPARK-19189) Optimize CartesianRDD to avoid partition re-computation and re-serialization

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19189: Assignee: (was: Apache Spark) > Optimize CartesianRDD to avoid partition

[jira] [Assigned] (SPARK-19189) Optimize CartesianRDD to avoid partition re-computation and re-serialization

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19189: Assignee: Apache Spark > Optimize CartesianRDD to avoid partition re-computation and

[jira] [Assigned] (SPARK-18971) Netty issue may cause the shuffle client hang

2017-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18971: Assignee: Apache Spark > Netty issue may cause the shuffle client hang >

<    1   2   3   >