[jira] [Updated] (SPARK-6884) random forest predict probabilities functionality (like in sklearn)

2015-04-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6884?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-6884: - Affects Version/s: (was: 1.4.0) 1.3.0 random forest predict

[jira] [Commented] (SPARK-6703) Provide a way to discover existing SparkContext's

2015-04-13 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14492888#comment-14492888 ] Patrick Wendell commented on SPARK-6703: Hey [~ilganeli] - sure thing. I've pinged

[jira] [Updated] (SPARK-6703) Provide a way to discover existing SparkContext's

2015-04-13 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6703?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-6703: --- Assignee: Ilya Ganelin Provide a way to discover existing SparkContext's

[jira] [Commented] (SPARK-3727) Trees and ensembles: More prediction functionality

2015-04-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14492928#comment-14492928 ] Joseph K. Bradley commented on SPARK-3727: -- [~maxkaznady] [~mqk] I split this

[jira] [Commented] (SPARK-3727) DecisionTree, RandomForest: More prediction functionality

2015-04-13 Thread Max Kaznady (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14492839#comment-14492839 ] Max Kaznady commented on SPARK-3727: I implemented the same thing but for PySpark.

[jira] [Commented] (SPARK-6113) Stabilize DecisionTree and ensembles APIs

2015-04-13 Thread Max Kaznady (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6113?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14492959#comment-14492959 ] Max Kaznady commented on SPARK-6113: [~josephkb] Is it possible to host the API Design

[jira] [Resolved] (SPARK-6662) Allow variable substitution in spark.yarn.historyServer.address

2015-04-13 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6662?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves resolved SPARK-6662. -- Resolution: Fixed Fix Version/s: 1.4.0 Assignee: Cheolsoo Park Allow variable

[jira] [Commented] (SPARK-3727) DecisionTree, RandomForest: More prediction functionality

2015-04-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14492887#comment-14492887 ] Joseph K. Bradley commented on SPARK-3727: -- Thanks for your initial works on this

[jira] [Updated] (SPARK-6703) Provide a way to discover existing SparkContext's

2015-04-13 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6703?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-6703: --- Priority: Critical (was: Major) Provide a way to discover existing SparkContext's

[jira] [Updated] (SPARK-6884) Random forest: predict class probabilities

2015-04-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6884?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-6884: - Assignee: Max Kaznady Random forest: predict class probabilities

[jira] [Updated] (SPARK-6884) Random forest: predict class probabilities

2015-04-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6884?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-6884: - Assignee: (was: Max Kaznady) Random forest: predict class probabilities

[jira] [Commented] (SPARK-6703) Provide a way to discover existing SparkContext's

2015-04-13 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14492898#comment-14492898 ] Patrick Wendell commented on SPARK-6703: /cc [~velvia] Provide a way to discover

[jira] [Commented] (SPARK-6113) Stabilize DecisionTree and ensembles APIs

2015-04-13 Thread Max Kaznady (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6113?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14492989#comment-14492989 ] Max Kaznady commented on SPARK-6113: Other places need serious improvement as well,

[jira] [Commented] (SPARK-3727) DecisionTree, RandomForest: More prediction functionality

2015-04-13 Thread Max Kaznady (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14492906#comment-14492906 ] Max Kaznady commented on SPARK-3727: Yes, probabilities have to be added to other

[jira] [Commented] (SPARK-6884) Random forest: predict class probabilities

2015-04-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14492904#comment-14492904 ] Joseph K. Bradley commented on SPARK-6884: -- I'd recommend: Under spark.ml, have

[jira] [Updated] (SPARK-3727) Trees and ensembles: More prediction functionality

2015-04-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3727?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-3727: - Summary: Trees and ensembles: More prediction functionality (was: DecisionTree,

[jira] [Commented] (SPARK-6703) Provide a way to discover existing SparkContext's

2015-04-13 Thread Ilya Ganelin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14492909#comment-14492909 ] Ilya Ganelin commented on SPARK-6703: - Patrick - what¹s the time line for the 1.4

[jira] [Commented] (SPARK-6682) Deprecate static train and use builder instead for Scala/Java

2015-04-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6682?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14492867#comment-14492867 ] Joseph K. Bradley commented on SPARK-6682: -- Do you mean (a) tests to make sure

[jira] [Commented] (SPARK-6884) random forest predict probabilities functionality (like in sklearn)

2015-04-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14492892#comment-14492892 ] Joseph K. Bradley commented on SPARK-6884: -- Is this not a duplicate of

[jira] [Created] (SPARK-6885) Decision trees: predict class probabilities

2015-04-13 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-6885: Summary: Decision trees: predict class probabilities Key: SPARK-6885 URL: https://issues.apache.org/jira/browse/SPARK-6885 Project: Spark Issue

[jira] [Resolved] (SPARK-5988) Model import/export for PowerIterationClusteringModel

2015-04-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5988. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5450

[jira] [Created] (SPARK-6884) random forest predict probabilities functionality (like in sklearn)

2015-04-13 Thread Max Kaznady (JIRA)
Max Kaznady created SPARK-6884: -- Summary: random forest predict probabilities functionality (like in sklearn) Key: SPARK-6884 URL: https://issues.apache.org/jira/browse/SPARK-6884 Project: Spark

[jira] [Commented] (SPARK-3727) Trees and ensembles: More prediction functionality

2015-04-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14492931#comment-14492931 ] Joseph K. Bradley commented on SPARK-3727: -- [~maxkaznady] Implementations should

[jira] [Updated] (SPARK-6887) ColumnBuilder misses FloatType

2015-04-13 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6887?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-6887: Description: To reproduce ... {code} import org.apache.spark.sql.types._ import org.apache.spark.sql.Row

[jira] [Assigned] (SPARK-6887) ColumnBuilder misses FloatType

2015-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6887?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6887: --- Assignee: Apache Spark (was: Yin Huai) ColumnBuilder misses FloatType

[jira] [Commented] (SPARK-6865) Decide on semantics for string identifiers in DataFrame API

2015-04-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14493193#comment-14493193 ] Reynold Xin commented on SPARK-6865: As discussed offline, it would makes more sense

[jira] [Updated] (SPARK-4638) Spark's MLlib SVM classification to include Kernels like Gaussian / (RBF) to find non linear boundaries

2015-04-13 Thread Mandar Chandorkar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mandar Chandorkar updated SPARK-4638: - Attachment: kernels-1.3.patch Patch for the kernels implementation taken against the

[jira] [Resolved] (SPARK-5972) Cache residuals for GradientBoostedTrees during training

2015-04-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-5972. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5330

[jira] [Comment Edited] (SPARK-6511) Publish hadoop provided build with instructions for different distros

2015-04-13 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14493183#comment-14493183 ] Patrick Wendell edited comment on SPARK-6511 at 4/13/15 10:11 PM:

[jira] [Created] (SPARK-6886) Big closure in PySpark will fail during shuffle

2015-04-13 Thread Davies Liu (JIRA)
Davies Liu created SPARK-6886: - Summary: Big closure in PySpark will fail during shuffle Key: SPARK-6886 URL: https://issues.apache.org/jira/browse/SPARK-6886 Project: Spark Issue Type: Bug

[jira] [Resolved] (SPARK-6130) support if not exists for insert overwrite into partition in hiveQl

2015-04-13 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6130?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-6130. - Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 4865

[jira] [Assigned] (SPARK-6886) Big closure in PySpark will fail during shuffle

2015-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6886: --- Assignee: Davies Liu (was: Apache Spark) Big closure in PySpark will fail during shuffle

[jira] [Commented] (SPARK-6886) Big closure in PySpark will fail during shuffle

2015-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14493068#comment-14493068 ] Apache Spark commented on SPARK-6886: - User 'davies' has created a pull request for

[jira] [Assigned] (SPARK-6886) Big closure in PySpark will fail during shuffle

2015-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6886: --- Assignee: Apache Spark (was: Davies Liu) Big closure in PySpark will fail during shuffle

[jira] [Assigned] (SPARK-6368) Build a specialized serializer for Exchange operator.

2015-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6368?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6368: --- Assignee: Yin Huai (was: Apache Spark) Build a specialized serializer for Exchange

[jira] [Assigned] (SPARK-6368) Build a specialized serializer for Exchange operator.

2015-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6368?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6368: --- Assignee: Apache Spark (was: Yin Huai) Build a specialized serializer for Exchange

[jira] [Commented] (SPARK-6368) Build a specialized serializer for Exchange operator.

2015-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6368?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14493092#comment-14493092 ] Apache Spark commented on SPARK-6368: - User 'yhuai' has created a pull request for

[jira] [Commented] (SPARK-6703) Provide a way to discover existing SparkContext's

2015-04-13 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14493179#comment-14493179 ] Patrick Wendell commented on SPARK-6703: Yes, ideally we get it into 1.4 - though

[jira] [Commented] (SPARK-4638) Spark's MLlib SVM classification to include Kernels like Gaussian / (RBF) to find non linear boundaries

2015-04-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14493180#comment-14493180 ] Sean Owen commented on SPARK-4638: -- [~mandar2812] Spark does not use patches in JIRA but

[jira] [Updated] (SPARK-5632) not able to resolve dot('.') in field name

2015-04-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-5632: --- Issue Type: Sub-task (was: Bug) Parent: SPARK-6116 not able to resolve dot('.') in field

[jira] [Commented] (SPARK-6511) Publish hadoop provided build with instructions for different distros

2015-04-13 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14493183#comment-14493183 ] Patrick Wendell commented on SPARK-6511: Just as an example I tried to wire Spark

[jira] [Commented] (SPARK-6511) Publish hadoop provided build with instructions for different distros

2015-04-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14493200#comment-14493200 ] Sean Owen commented on SPARK-6511: -- Yeah that might be the fastest way to find all the

[jira] [Assigned] (SPARK-1701) Inconsistent naming: slice or partition

2015-04-13 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves reassigned SPARK-1701: Assignee: Thomas Graves Inconsistent naming: slice or partition

[jira] [Resolved] (SPARK-6742) Spark pushes down filters in old parquet path that reference partitioning columns

2015-04-13 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6742?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-6742. - Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5390

[jira] [Updated] (SPARK-5632) not able to resolve dot('.') in field name

2015-04-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-5632: --- Priority: Blocker (was: Major) Target Version/s: 1.4.0 Affects Version/s: 1.3.0

[jira] [Comment Edited] (SPARK-6865) Decide on semantics for string identifiers in DataFrame API

2015-04-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14493193#comment-14493193 ] Reynold Xin edited comment on SPARK-6865 at 4/13/15 10:26 PM: --

[jira] [Resolved] (SPARK-6865) Decide on semantics for string identifiers in DataFrame API

2015-04-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6865?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-6865. Resolution: Fixed Fix Version/s: 1.4.0 Assignee: Reynold Xin This is now decided.

[jira] [Updated] (SPARK-2873) Support disk spilling in Spark SQL aggregation / join

2015-04-13 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-2873: Priority: Blocker (was: Major) Support disk spilling in Spark SQL aggregation / join

[jira] [Commented] (SPARK-1701) Inconsistent naming: slice or partition

2015-04-13 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14493147#comment-14493147 ] Nicholas Chammas commented on SPARK-1701: - [~tgraves] - Shouldn't this issue be

[jira] [Comment Edited] (SPARK-4638) Spark's MLlib SVM classification to include Kernels like Gaussian / (RBF) to find non linear boundaries

2015-04-13 Thread Mandar Chandorkar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14493174#comment-14493174 ] Mandar Chandorkar edited comment on SPARK-4638 at 4/13/15 10:05 PM:

[jira] [Created] (SPARK-6887) ColumnBuilder misses FloatType

2015-04-13 Thread Yin Huai (JIRA)
Yin Huai created SPARK-6887: --- Summary: ColumnBuilder misses FloatType Key: SPARK-6887 URL: https://issues.apache.org/jira/browse/SPARK-6887 Project: Spark Issue Type: Bug Components: SQL

[jira] [Assigned] (SPARK-6887) ColumnBuilder misses FloatType

2015-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6887?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6887: --- Assignee: Yin Huai (was: Apache Spark) ColumnBuilder misses FloatType

[jira] [Commented] (SPARK-6887) ColumnBuilder misses FloatType

2015-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6887?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14493186#comment-14493186 ] Apache Spark commented on SPARK-6887: - User 'yhuai' has created a pull request for

[jira] [Commented] (SPARK-6820) Convert NAs to null type in SparkR DataFrames

2015-04-13 Thread Antonio Piccolboni (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6820?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14493223#comment-14493223 ] Antonio Piccolboni commented on SPARK-6820: --- For the distinction between NAs and

[jira] [Commented] (SPARK-6889) Streamline contribution process with update to Contribution wiki, JIRA rules

2015-04-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6889?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14493252#comment-14493252 ] Sean Owen commented on SPARK-6889: -- For those that would like to comment directly on the

[jira] [Commented] (SPARK-6703) Provide a way to discover existing SparkContext's

2015-04-13 Thread Evan Chan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14493303#comment-14493303 ] Evan Chan commented on SPARK-6703: -- Hey folks, Thought I would just put in my 2 cents as

[jira] [Resolved] (SPARK-6881) Change the checkpoint directory name from checkpoints to checkpoint

2015-04-13 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6881?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman resolved SPARK-6881. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request

[jira] [Commented] (SPARK-6511) Publish hadoop provided build with instructions for different distros

2015-04-13 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14493274#comment-14493274 ] Patrick Wendell commented on SPARK-6511: Can we just run HADOOP_HOME/bin/hadoop

[jira] [Created] (SPARK-6890) Local cluster mode in Mac is broken

2015-04-13 Thread Davies Liu (JIRA)
Davies Liu created SPARK-6890: - Summary: Local cluster mode in Mac is broken Key: SPARK-6890 URL: https://issues.apache.org/jira/browse/SPARK-6890 Project: Spark Issue Type: Bug

[jira] [Comment Edited] (SPARK-6151) schemaRDD to parquetfile with saveAsParquetFile control the HDFS block size

2015-04-13 Thread Littlestar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6151?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14491881#comment-14491881 ] Littlestar edited comment on SPARK-6151 at 4/14/15 1:04 AM:

[jira] [Resolved] (SPARK-6877) Add code generation support for Min

2015-04-13 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6877?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-6877. - Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5487

[jira] [Commented] (SPARK-6890) Local cluster mode is broken

2015-04-13 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14493448#comment-14493448 ] Andrew Or commented on SPARK-6890: -- I'm not actively working on this. Feel free to fix it

[jira] [Created] (SPARK-6891) ExecutorAllocationManager will request negative number executors

2015-04-13 Thread meiyoula (JIRA)
meiyoula created SPARK-6891: --- Summary: ExecutorAllocationManager will request negative number executors Key: SPARK-6891 URL: https://issues.apache.org/jira/browse/SPARK-6891 Project: Spark Issue

[jira] [Commented] (SPARK-6889) Streamline contribution process with update to Contribution wiki, JIRA rules

2015-04-13 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6889?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14493254#comment-14493254 ] Patrick Wendell commented on SPARK-6889: Thanks for posting this Sean. Overall, I

[jira] [Updated] (SPARK-5931) Use consistent naming for time properties

2015-04-13 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5931?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5931: - Assignee: Ilya Ganelin (was: Andrew Or) Use consistent naming for time properties

[jira] [Commented] (SPARK-5111) HiveContext and Thriftserver cannot work in secure cluster beyond hadoop2.5

2015-04-13 Thread Yu Gao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14493270#comment-14493270 ] Yu Gao commented on SPARK-5111: --- Hi Zhan, which spark version is going to have this fix? We

[jira] [Closed] (SPARK-5931) Use consistent naming for time properties

2015-04-13 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5931?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-5931. Resolution: Fixed Fix Version/s: 1.4.0 Use consistent naming for time properties

[jira] [Assigned] (SPARK-6703) Provide a way to discover existing SparkContext's

2015-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6703?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6703: --- Assignee: Apache Spark (was: Ilya Ganelin) Provide a way to discover existing

[jira] [Commented] (SPARK-6703) Provide a way to discover existing SparkContext's

2015-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14493306#comment-14493306 ] Apache Spark commented on SPARK-6703: - User 'ilganeli' has created a pull request for

[jira] [Assigned] (SPARK-6703) Provide a way to discover existing SparkContext's

2015-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6703?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6703: --- Assignee: Ilya Ganelin (was: Apache Spark) Provide a way to discover existing

[jira] [Updated] (SPARK-6890) Local cluster mode in Mac is broken

2015-04-13 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6890?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-6890: - Affects Version/s: 1.4.0 Local cluster mode in Mac is broken ---

[jira] [Updated] (SPARK-6890) Local cluster mode is broken

2015-04-13 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6890?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-6890: - Assignee: Marcelo Vanzin (was: Andrew Or) Local cluster mode is broken

[jira] [Commented] (SPARK-6890) Local cluster mode is broken

2015-04-13 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14493447#comment-14493447 ] Marcelo Vanzin commented on SPARK-6890: --- Also, another possible way to fix this is

[jira] [Assigned] (SPARK-6890) Local cluster mode is broken

2015-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6890?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6890: --- Assignee: Apache Spark (was: Marcelo Vanzin) Local cluster mode is broken

[jira] [Commented] (SPARK-6890) Local cluster mode is broken

2015-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14493478#comment-14493478 ] Apache Spark commented on SPARK-6890: - User 'vanzin' has created a pull request for

[jira] [Assigned] (SPARK-6890) Local cluster mode is broken

2015-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6890?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6890: --- Assignee: Marcelo Vanzin (was: Apache Spark) Local cluster mode is broken

[jira] [Commented] (SPARK-5111) HiveContext and Thriftserver cannot work in secure cluster beyond hadoop2.5

2015-04-13 Thread Zhan Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14493483#comment-14493483 ] Zhan Zhang commented on SPARK-5111: --- [~crystal_gaoyu] I am not sure. You may try to

[jira] [Created] (SPARK-6888) Make DriverQuirks editable

2015-04-13 Thread Rene Treffer (JIRA)
Rene Treffer created SPARK-6888: --- Summary: Make DriverQuirks editable Key: SPARK-6888 URL: https://issues.apache.org/jira/browse/SPARK-6888 Project: Spark Issue Type: Improvement

[jira] [Resolved] (SPARK-6872) external sort need to copy

2015-04-13 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6872?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-6872. - Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5481

[jira] [Comment Edited] (SPARK-6682) Deprecate static train and use builder instead for Scala/Java

2015-04-13 Thread Yu Ishikawa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6682?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14493218#comment-14493218 ] Yu Ishikawa edited comment on SPARK-6682 at 4/14/15 12:37 AM: --

[jira] [Updated] (SPARK-6890) Local cluster mode is broken

2015-04-13 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6890?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-6890: - Summary: Local cluster mode is broken (was: Local cluster mode in Mac is broken) Local cluster mode is

[jira] [Updated] (SPARK-6890) Local cluster mode in Mac is broken

2015-04-13 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6890?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-6890: - Priority: Critical (was: Blocker) Local cluster mode in Mac is broken

[jira] [Commented] (SPARK-6889) Streamline contribution process with update to Contribution wiki, JIRA rules

2015-04-13 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6889?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14493394#comment-14493394 ] Nicholas Chammas commented on SPARK-6889: - Thanks for continuing to work on

[jira] [Resolved] (SPARK-6303) Remove unnecessary Average in GeneratedAggregate

2015-04-13 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-6303. - Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 4996

[jira] [Commented] (SPARK-6511) Publish hadoop provided build with instructions for different distros

2015-04-13 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14493283#comment-14493283 ] Marcelo Vanzin commented on SPARK-6511: --- I think {{hadoop classpath}} would be safer

[jira] [Assigned] (SPARK-5888) Add OneHotEncoder as a Transformer

2015-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5888?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-5888: --- Assignee: Apache Spark (was: Sandy Ryza) Add OneHotEncoder as a Transformer

[jira] [Assigned] (SPARK-5888) Add OneHotEncoder as a Transformer

2015-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5888?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-5888: --- Assignee: Sandy Ryza (was: Apache Spark) Add OneHotEncoder as a Transformer

[jira] [Commented] (SPARK-5888) Add OneHotEncoder as a Transformer

2015-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14493302#comment-14493302 ] Apache Spark commented on SPARK-5888: - User 'sryza' has created a pull request for

[jira] [Updated] (SPARK-4766) ML Estimator Params should subclass Transformer Params

2015-04-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-4766: - Target Version/s: 1.4.0 (was: 1.3.0) ML Estimator Params should subclass Transformer

[jira] [Commented] (SPARK-6890) Local cluster mode is broken

2015-04-13 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14493432#comment-14493432 ] Marcelo Vanzin commented on SPARK-6890: --- Do you have `SPARK_PREPEND_CLASSES` set by

[jira] [Updated] (SPARK-6891) ExecutorAllocationManager will request negative number executors

2015-04-13 Thread meiyoula (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6891?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] meiyoula updated SPARK-6891: Description: Below is the exception: 15/04/14 10:10:18 ERROR Utils: Uncaught exception in thread

[jira] [Updated] (SPARK-6889) Streamline contribution process with update to Contribution wiki, JIRA rules

2015-04-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6889: - Attachment: ContributingtoSpark.pdf SparkProjectMechanicsChallenges.pdf Streamline

[jira] [Resolved] (SPARK-5941) Unit Test loads the table `src` twice for leftsemijoin.q

2015-04-13 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5941?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-5941. - Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 4506

[jira] [Commented] (SPARK-1529) Support setting spark.local.dirs to a hadoop FileSystem

2015-04-13 Thread Kannan Rajah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14493251#comment-14493251 ] Kannan Rajah commented on SPARK-1529: - You can use the Compare functionality to see a

[jira] [Commented] (SPARK-6889) Streamline contribution process with update to Contribution wiki, JIRA rules

2015-04-13 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6889?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14493299#comment-14493299 ] Marcelo Vanzin commented on SPARK-6889: --- I left a couple of comments on the docs

[jira] [Closed] (SPARK-4848) Allow different Worker configurations in standalone cluster

2015-04-13 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-4848. Resolution: Fixed Fix Version/s: 1.4.0 Assignee: Nathan Kronenfeld Target

[jira] [Resolved] (SPARK-5794) add jar should return 0

2015-04-13 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5794?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-5794. - Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 4586

[jira] [Commented] (SPARK-3727) Trees and ensembles: More prediction functionality

2015-04-13 Thread Michael Kuhlen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14493455#comment-14493455 ] Michael Kuhlen commented on SPARK-3727: --- [~josephkb] The design document is great,

[jira] [Commented] (SPARK-6682) Deprecate static train and use builder instead for Scala/Java

2015-04-13 Thread Yu Ishikawa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6682?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14493218#comment-14493218 ] Yu Ishikawa commented on SPARK-6682: I meant (a). I agree with that we only add a

[jira] [Commented] (SPARK-6511) Publish hadoop provided build with instructions for different distros

2015-04-13 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14493217#comment-14493217 ] Marcelo Vanzin commented on SPARK-6511: --- We add a bunch of things to that variable,

  1   2   >