[jira] [Commented] (SPARK-1795) Add recursive directory file search to fileInputStream

2014-07-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1795?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14067722#comment-14067722 ] Apache Spark commented on SPARK-1795: - User 'patrickotoole' has created a pull request

[jira] [Commented] (SPARK-1623) SPARK-1623. Broadcast cleaner should use getCanonicalPath when deleting files by name

2014-07-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14067725#comment-14067725 ] Apache Spark commented on SPARK-1623: - User 'nsuthar' has created a pull request for

[jira] [Commented] (SPARK-1597) Add a version of reduceByKey that takes the Partitioner as a second argument

2014-07-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1597?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14067727#comment-14067727 ] Apache Spark commented on SPARK-1597: - User 'techaddict' has created a pull request

[jira] [Commented] (SPARK-1630) PythonRDDs don't handle nulls gracefully

2014-07-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14067729#comment-14067729 ] Apache Spark commented on SPARK-1630: - User 'kalpit' has created a pull request for

[jira] [Commented] (SPARK-1022) Add unit tests for kafka streaming

2014-07-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14067730#comment-14067730 ] Apache Spark commented on SPARK-1022: - User 'tdas' has created a pull request for this

[jira] [Commented] (SPARK-1682) Add gradient descent w/o sampling and RDA L1 updater

2014-07-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1682?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14067739#comment-14067739 ] Apache Spark commented on SPARK-1682: - User 'dongwang218' has created a pull request

[jira] [Commented] (SPARK-2226) HAVING should be able to contain aggregate expressions that don't appear in the aggregation list.

2014-07-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14067765#comment-14067765 ] Apache Spark commented on SPARK-2226: - User 'willb' has created a pull request for

[jira] [Commented] (SPARK-2521) Broadcast RDD object once per TaskSet (instead of sending it for every task)

2014-07-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14067824#comment-14067824 ] Apache Spark commented on SPARK-2521: - User 'rxin' has created a pull request for this

[jira] [Commented] (SPARK-2045) Sort-based shuffle implementation

2014-07-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2045?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14067826#comment-14067826 ] Apache Spark commented on SPARK-2045: - User 'mateiz' has created a pull request for

[jira] [Commented] (SPARK-2598) RangePartitioner's binary search does not use the given Ordering

2014-07-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2598?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14067836#comment-14067836 ] Apache Spark commented on SPARK-2598: - User 'rxin' has created a pull request for this

[jira] [Commented] (SPARK-2047) Use less memory in AppendOnlyMap.destructiveSortedIterator

2014-07-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2047?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14068042#comment-14068042 ] Apache Spark commented on SPARK-2047: - User 'aarondav' has created a pull request for

[jira] [Commented] (SPARK-2282) PySpark crashes if too many tasks complete quickly

2014-07-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14068082#comment-14068082 ] Apache Spark commented on SPARK-2282: - User 'aarondav' has created a pull request for

[jira] [Commented] (SPARK-2603) Remove unnecessary toMap and toList in converting Java collections to Scala collections JsonRDD.scala

2014-07-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2603?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14068093#comment-14068093 ] Apache Spark commented on SPARK-2603: - User 'yhuai' has created a pull request for

[jira] [Commented] (SPARK-2470) Fix PEP 8 violations

2014-07-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14068162#comment-14068162 ] Apache Spark commented on SPARK-2470: - User 'nchammas' has created a pull request for

[jira] [Commented] (SPARK-2582) Make Block Manager Master pluggable

2014-07-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2582?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14068238#comment-14068238 ] Apache Spark commented on SPARK-2582: - User 'harishreedharan' has created a pull

[jira] [Commented] (SPARK-2565) Update ShuffleReadMetrics as blocks are fetched

2014-07-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14068262#comment-14068262 ] Apache Spark commented on SPARK-2565: - User 'sryza' has created a pull request for

[jira] [Commented] (SPARK-2103) Java + Kafka + Spark Streaming NoSuchMethodError in java.lang.Object.init

2014-07-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14068263#comment-14068263 ] Apache Spark commented on SPARK-2103: - User 'jerryshao' has created a pull request for

[jira] [Commented] (SPARK-2549) Functions defined inside of other functions trigger failures

2014-07-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2549?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14068368#comment-14068368 ] Apache Spark commented on SPARK-2549: - User 'ScrapCodes' has created a pull request

[jira] [Commented] (SPARK-1680) Clean up use of setExecutorEnvs in SparkConf

2014-07-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1680?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14068584#comment-14068584 ] Apache Spark commented on SPARK-1680: - User 'tgravescs' has created a pull request for

[jira] [Commented] (SPARK-2608) scheduler backend create executor launch command not correctly

2014-07-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14068710#comment-14068710 ] Apache Spark commented on SPARK-2608: - User 'scwf' has created a pull request for this

[jira] [Commented] (SPARK-2434) Generate runtime warnings for naive implementations

2014-07-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2434?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14069010#comment-14069010 ] Apache Spark commented on SPARK-2434: - User 'brkyvz' has created a pull request for

[jira] [Commented] (SPARK-2567) Resubmitted stage sometimes remains as active stage in the web UI

2014-07-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2567?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14069311#comment-14069311 ] Apache Spark commented on SPARK-2567: - User 'tsudukim' has created a pull request for

[jira] [Commented] (SPARK-2609) Log thread ID when spilling ExternalAppendOnlyMap

2014-07-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2609?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14069327#comment-14069327 ] Apache Spark commented on SPARK-2609: - User 'andrewor14' has created a pull request

[jira] [Commented] (SPARK-2505) Weighted Regularizer

2014-07-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2505?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14069354#comment-14069354 ] Apache Spark commented on SPARK-2505: - User 'dbtsai' has created a pull request for

[jira] [Commented] (SPARK-2514) Random RDD generator

2014-07-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2514?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14069778#comment-14069778 ] Apache Spark commented on SPARK-2514: - User 'dorx' has created a pull request for this

[jira] [Commented] (SPARK-2612) ALS has data skew for popular product

2014-07-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2612?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14069883#comment-14069883 ] Apache Spark commented on SPARK-2612: - User 'renozhang' has created a pull request for

[jira] [Commented] (SPARK-2615) Add == support for HiveQl

2014-07-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2615?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14069934#comment-14069934 ] Apache Spark commented on SPARK-2615: - User 'chenghao-intel' has created a pull

[jira] [Commented] (SPARK-2617) Correct doc and usage of preservesPartitioning

2014-07-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2617?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14070066#comment-14070066 ] Apache Spark commented on SPARK-2617: - User 'mengxr' has created a pull request for

[jira] [Commented] (SPARK-2614) Add the spark-examples-xxx-.jar to the Debian package created by assembly/pom.xml (e.g. -PDeb)

2014-07-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14070197#comment-14070197 ] Apache Spark commented on SPARK-2614: - User 'tzolov' has created a pull request for

[jira] [Commented] (SPARK-2260) Spark submit standalone-cluster mode is broken

2014-07-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2260?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14071296#comment-14071296 ] Apache Spark commented on SPARK-2260: - User 'andrewor14' has created a pull request

[jira] [Commented] (SPARK-2634) MapOutputTrackerWorker.mapStatuses should be thread-safe

2014-07-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2634?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14071317#comment-14071317 ] Apache Spark commented on SPARK-2634: - User 'zsxwing' has created a pull request for

[jira] [Commented] (SPARK-2638) Improve concurrency of fetching Map outputs

2014-07-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14071351#comment-14071351 ] Apache Spark commented on SPARK-2638: - User 'javadba' has created a pull request for

[jira] [Commented] (SPARK-2640) In local[N], free cores of the only executor should be touched by spark.task.cpus for every finish/start-up of tasks.

2014-07-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14071448#comment-14071448 ] Apache Spark commented on SPARK-2640: - User 'woshilaiceshide' has created a pull

[jira] [Commented] (SPARK-2298) Show stage attempt in UI

2014-07-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14071542#comment-14071542 ] Apache Spark commented on SPARK-2298: - User 'rxin' has created a pull request for this

[jira] [Commented] (SPARK-2644) Hive should not be enabled by default in the build.

2014-07-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2644?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14071556#comment-14071556 ] Apache Spark commented on SPARK-2644: - User 'ScrapCodes' has created a pull request

[jira] [Commented] (SPARK-2646) log4j initialization not quite compatible with log4j 2.x

2014-07-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2646?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14071745#comment-14071745 ] Apache Spark commented on SPARK-2646: - User 'srowen' has created a pull request for

[jira] [Commented] (SPARK-2647) DAGScheduler plugs others when processing one JobSubmitted event

2014-07-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2647?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14071754#comment-14071754 ] Apache Spark commented on SPARK-2647: - User 'YanTangZhai' has created a pull request

[jira] [Commented] (SPARK-2648) through shuffling blocksByAddress avoid much reducers to fetch data from a executor at a time

2014-07-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2648?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14071996#comment-14071996 ] Apache Spark commented on SPARK-2648: - User 'lianhuiwang' has created a pull request

[jira] [Commented] (SPARK-2651) Add maven scalastyle plugin

2014-07-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2651?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14072164#comment-14072164 ] Apache Spark commented on SPARK-2651: - User 'rahulsinghaliitd' has created a pull

[jira] [Commented] (SPARK-1630) PythonRDDs don't handle nulls gracefully

2014-07-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14072235#comment-14072235 ] Apache Spark commented on SPARK-1630: - User 'davies' has created a pull request for

[jira] [Commented] (SPARK-2569) Customized UDFs in hive not running with Spark SQL

2014-07-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2569?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14072258#comment-14072258 ] Apache Spark commented on SPARK-2569: - User 'marmbrus' has created a pull request for

[jira] [Commented] (SPARK-2656) Python version without support for exact sample size

2014-07-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2656?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14072585#comment-14072585 ] Apache Spark commented on SPARK-2656: - User 'dorx' has created a pull request for this

[jira] [Commented] (SPARK-2657) Use more compact data structures than ArrayBuffer in groupBy and cogroup

2014-07-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14072608#comment-14072608 ] Apache Spark commented on SPARK-2657: - User 'mateiz' has created a pull request for

[jira] [Commented] (SPARK-2658) HiveQL: 1 = true should evaluate to true

2014-07-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2658?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14072629#comment-14072629 ] Apache Spark commented on SPARK-2658: - User 'marmbrus' has created a pull request for

[jira] [Commented] (SPARK-2659) HiveQL: Division operator should always perform fractional division

2014-07-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2659?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14072655#comment-14072655 ] Apache Spark commented on SPARK-2659: - User 'marmbrus' has created a pull request for

[jira] [Commented] (SPARK-2458) Make failed application log visible on History Server

2014-07-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14072665#comment-14072665 ] Apache Spark commented on SPARK-2458: - User 'tsudukim' has created a pull request for

[jira] [Commented] (SPARK-2010) Support for nested data in PySpark SQL

2014-07-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14072681#comment-14072681 ] Apache Spark commented on SPARK-2010: - User 'davies' has created a pull request for

[jira] [Commented] (SPARK-2568) RangePartitioner should go through the data only once

2014-07-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14072729#comment-14072729 ] Apache Spark commented on SPARK-2568: - User 'mengxr' has created a pull request for

[jira] [Commented] (SPARK-2567) Resubmitted stage sometimes remains as active stage in the web UI

2014-07-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2567?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14072809#comment-14072809 ] Apache Spark commented on SPARK-2567: - User 'kayousterhout' has created a pull request

[jira] [Commented] (SPARK-1726) Tasks that fail to serialize remain in active stages forever.

2014-07-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1726?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14072808#comment-14072808 ] Apache Spark commented on SPARK-1726: - User 'kayousterhout' has created a pull request

[jira] [Commented] (SPARK-2663) Support the GroupingSet/ROLLUP/CUBE

2014-07-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2663?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14072832#comment-14072832 ] Apache Spark commented on SPARK-2663: - User 'chenghao-intel' has created a pull

[jira] [Commented] (SPARK-2652) Turning default configurations for PySpark

2014-07-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2652?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14072909#comment-14072909 ] Apache Spark commented on SPARK-2652: - User 'davies' has created a pull request for

[jira] [Commented] (SPARK-2665) Add EqualNS support for HiveQL

2014-07-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2665?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14072977#comment-14072977 ] Apache Spark commented on SPARK-2665: - User 'chenghao-intel' has created a pull

[jira] [Commented] (SPARK-2604) Spark Application hangs on yarn in edge case scenario of executor memory requirement

2014-07-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14073085#comment-14073085 ] Apache Spark commented on SPARK-2604: - User 'twinkle-sachdeva' has created a pull

[jira] [Commented] (SPARK-2666) when task is FetchFailed cancel running tasks of failedStage

2014-07-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2666?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14073116#comment-14073116 ] Apache Spark commented on SPARK-2666: - User 'lianhuiwang' has created a pull request

[jira] [Commented] (SPARK-2668) Support log4j log to yarn container log directory

2014-07-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14073199#comment-14073199 ] Apache Spark commented on SPARK-2668: - User 'renozhang' has created a pull request for

[jira] [Commented] (SPARK-2669) Hadoop configuration is not localised when submitting job in yarn-cluster mode

2014-07-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2669?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14073338#comment-14073338 ] Apache Spark commented on SPARK-2669: - User 'redbaron' has created a pull request for

[jira] [Commented] (SPARK-2479) Comparing floating-point numbers using relative error in UnitTests

2014-07-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14073430#comment-14073430 ] Apache Spark commented on SPARK-2479: - User 'mengxr' has created a pull request for

[jira] [Commented] (SPARK-2464) Twitter Receiver does not stop correctly when streamingContext.stop is called

2014-07-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14073518#comment-14073518 ] Apache Spark commented on SPARK-2464: - User 'tdas' has created a pull request for this

[jira] [Commented] (SPARK-2670) FetchFailedException should be thrown when local fetch has failed

2014-07-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2670?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14073539#comment-14073539 ] Apache Spark commented on SPARK-2670: - User 'sarutak' has created a pull request for

[jira] [Commented] (SPARK-2675) LiveListenerBus should set higher capacity for its event queue

2014-07-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2675?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14073546#comment-14073546 ] Apache Spark commented on SPARK-2675: - User 'concretevitamin' has created a pull

[jira] [Commented] (SPARK-2671) BlockObjectWriter should create parent directory when the directory doesn't exist

2014-07-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2671?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14073548#comment-14073548 ] Apache Spark commented on SPARK-2671: - User 'sarutak' has created a pull request for

[jira] [Commented] (SPARK-2679) Ser/De for Double to enable calling Java API from python in MLlib

2014-07-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14073833#comment-14073833 ] Apache Spark commented on SPARK-2679: - User 'dorx' has created a pull request for this

[jira] [Commented] (SPARK-2529) Clean the closure in foreach and foreachPartition

2014-07-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14073964#comment-14073964 ] Apache Spark commented on SPARK-2529: - User 'rxin' has created a pull request for this

[jira] [Commented] (SPARK-2682) Javadoc generated from Scala source code is not in javadoc's index

2014-07-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2682?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14074109#comment-14074109 ] Apache Spark commented on SPARK-2682: - User 'yhuai' has created a pull request for

[jira] [Commented] (SPARK-2683) unidoc failed because org.apache.spark.util.CallSite uses Java keywords as value names

2014-07-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2683?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14074129#comment-14074129 ] Apache Spark commented on SPARK-2683: - User 'yhuai' has created a pull request for

[jira] [Commented] (SPARK-2686) Add Length support to Spark SQL and HQL and Strlen support to SQL

2014-07-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2686?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14074135#comment-14074135 ] Apache Spark commented on SPARK-2686: - User 'javadba' has created a pull request for

[jira] [Commented] (SPARK-2620) case class cannot be used as key for reduce

2014-07-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2620?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14074196#comment-14074196 ] Apache Spark commented on SPARK-2620: - User 'ash211' has created a pull request for

[jira] [Commented] (SPARK-2687) after receving allocated containers,amClient should remove ContainerRequest.

2014-07-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2687?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14074234#comment-14074234 ] Apache Spark commented on SPARK-2687: - User 'lianhuiwang' has created a pull request

[jira] [Commented] (SPARK-2547) The clustering documentaion example provided for spark 0.9.1/docs is having a error

2014-07-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2547?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14074282#comment-14074282 ] Apache Spark commented on SPARK-2547: - User 'yu-iskw' has created a pull request for

[jira] [Commented] (SPARK-2314) RDD actions are only overridden in Scala, not java or python

2014-07-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14074840#comment-14074840 ] Apache Spark commented on SPARK-2314: - User 'staple' has created a pull request for

[jira] [Commented] (SPARK-2680) Lower spark.shuffle.memoryFraction to 0.2 by default

2014-07-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2680?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14074972#comment-14074972 ] Apache Spark commented on SPARK-2680: - User 'mateiz' has created a pull request for

[jira] [Commented] (SPARK-2696) Reduce default spark.serializer.objectStreamReset

2014-07-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2696?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14075117#comment-14075117 ] Apache Spark commented on SPARK-2696: - User 'falaki' has created a pull request for

[jira] [Commented] (SPARK-1458) Expose sc.version in PySpark

2014-07-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14075126#comment-14075126 ] Apache Spark commented on SPARK-1458: - User 'JoshRosen' has created a pull request for

[jira] [Commented] (SPARK-2279) JavaSparkContext should allow creation of EmptyRDD

2014-07-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2279?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14075162#comment-14075162 ] Apache Spark commented on SPARK-2279: - User 'bobpaulin' has created a pull request for

[jira] [Commented] (SPARK-2010) Support for nested data in PySpark SQL

2014-07-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14075188#comment-14075188 ] Apache Spark commented on SPARK-2010: - User 'davies' has created a pull request for

[jira] [Commented] (SPARK-2700) Hidden files (such as .impala_insert_staging) should be filtered out by sqlContext.parquetFile

2014-07-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2700?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14075237#comment-14075237 ] Apache Spark commented on SPARK-2700: - User 'chutium' has created a pull request for

[jira] [Commented] (SPARK-2674) Add date and time types to inferSchema

2014-07-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2674?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14075284#comment-14075284 ] Apache Spark commented on SPARK-2674: - User 'davies' has created a pull request for

[jira] [Commented] (SPARK-2704) ConnectionManager threads should be named and daemon

2014-07-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2704?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14075478#comment-14075478 ] Apache Spark commented on SPARK-2704: - User 'rxin' has created a pull request for this

[jira] [Commented] (SPARK-2601) py4j.Py4JException on sc.pickleFile

2014-07-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2601?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14075498#comment-14075498 ] Apache Spark commented on SPARK-2601: - User 'JoshRosen' has created a pull request for

[jira] [Commented] (SPARK-1550) Successive creation of spark context fails in pyspark, if the previous initialization of spark context had failed.

2014-07-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14075517#comment-14075517 ] Apache Spark commented on SPARK-1550: - User 'JoshRosen' has created a pull request for

[jira] [Commented] (SPARK-2684) Update ExternalAppendOnlyMap to take an iterator as input

2014-07-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2684?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14075526#comment-14075526 ] Apache Spark commented on SPARK-2684: - User 'mateiz' has created a pull request for

[jira] [Commented] (SPARK-2532) Fix issues with consolidated shuffle

2014-07-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14075639#comment-14075639 ] Apache Spark commented on SPARK-2532: - User 'mridulm' has created a pull request for

[jira] [Commented] (SPARK-2420) Change Spark build to minimize library conflicts

2014-07-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14075754#comment-14075754 ] Apache Spark commented on SPARK-2420: - User 'srowen' has created a pull request for

[jira] [Commented] (SPARK-2614) Add the spark-examples-xxx-.jar to the Debian packages created with mvn ... -Pdeb (using assembly/pom.xml)

2014-07-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14075768#comment-14075768 ] Apache Spark commented on SPARK-2614: - User 'tzolov' has created a pull request for

[jira] [Commented] (SPARK-2710) Build SchemaRDD from a JdbcRDD with MetaData (no hard code case class)

2014-07-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2710?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14075789#comment-14075789 ] Apache Spark commented on SPARK-2710: - User 'chutium' has created a pull request for

[jira] [Commented] (SPARK-2713) Executors of same application in same host should only download files jars once

2014-07-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2713?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14075916#comment-14075916 ] Apache Spark commented on SPARK-2713: - User 'li-zhihui' has created a pull request for

[jira] [Commented] (SPARK-2714) DAGScheduler logs jobid when runJob finishes

2014-07-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14076232#comment-14076232 ] Apache Spark commented on SPARK-2714: - User 'YanTangZhai' has created a pull request

[jira] [Commented] (SPARK-2715) ExternalAppendOnlyMap adds max limit of times and max limit of disk bytes written for spilling

2014-07-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2715?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14076257#comment-14076257 ] Apache Spark commented on SPARK-2715: - User 'YanTangZhai' has created a pull request

[jira] [Commented] (SPARK-2677) BasicBlockFetchIterator#next can wait forever

2014-07-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2677?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14076278#comment-14076278 ] Apache Spark commented on SPARK-2677: - User 'witgo' has created a pull request for

[jira] [Commented] (SPARK-2410) Thrift/JDBC Server

2014-07-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2410?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14076453#comment-14076453 ] Apache Spark commented on SPARK-2410: - User 'liancheng' has created a pull request for

[jira] [Commented] (SPARK-2022) Spark 1.0.0 is failing if mesos.coarse set to true

2014-07-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14077110#comment-14077110 ] Apache Spark commented on SPARK-2022: - User 'tnachen' has created a pull request for

[jira] [Commented] (SPARK-1687) Support NamedTuples in RDDs

2014-07-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1687?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14077136#comment-14077136 ] Apache Spark commented on SPARK-1687: - User 'davies' has created a pull request for

[jira] [Commented] (SPARK-2550) Support regularization and intercept in pyspark's linear methods

2014-07-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14077158#comment-14077158 ] Apache Spark commented on SPARK-2550: - User 'miccagiann' has created a pull request

[jira] [Commented] (SPARK-2580) broken pipe collecting schemardd results

2014-07-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2580?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14077193#comment-14077193 ] Apache Spark commented on SPARK-2580: - User 'davies' has created a pull request for

[jira] [Commented] (SPARK-2305) pyspark - depend on py4j 0.8.1

2014-07-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14077202#comment-14077202 ] Apache Spark commented on SPARK-2305: - User 'JoshRosen' has created a pull request for

[jira] [Commented] (SPARK-2724) Python version of Random RDD without support for arbitrary distribution

2014-07-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2724?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14077286#comment-14077286 ] Apache Spark commented on SPARK-2724: - User 'dorx' has created a pull request for this

[jira] [Commented] (SPARK-2677) BasicBlockFetchIterator#next can wait forever

2014-07-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2677?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14077439#comment-14077439 ] Apache Spark commented on SPARK-2677: - User 'sarutak' has created a pull request for

[jira] [Commented] (SPARK-2632) Importing a method of class in Spark REPL causes the REPL to pulls in unnecessary stuff.

2014-07-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2632?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14077552#comment-14077552 ] Apache Spark commented on SPARK-2632: - User 'ScrapCodes' has created a pull request

[jira] [Commented] (SPARK-2576) slave node throws NoClassDefFoundError $line11.$read$ when executing a Spark QL query on HDFS CSV file

2014-07-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2576?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14077553#comment-14077553 ] Apache Spark commented on SPARK-2576: - User 'ScrapCodes' has created a pull request

  1   2   3   4   5   6   7   8   9   10   >