[jira] [Created] (SPARK-9931) Flaky test: mllib/tests.py StreamingLogisticRegressionWithSGDTests. test_training_and_prediction

2015-08-13 Thread Davies Liu (JIRA)
Davies Liu created SPARK-9931: - Summary: Flaky test: mllib/tests.py StreamingLogisticRegressionWithSGDTests. test_training_and_prediction Key: SPARK-9931 URL: https://issues.apache.org/jira/browse/SPARK-9931

[jira] [Created] (SPARK-9934) Deprecate NIO ConnectionManager

2015-08-13 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-9934: -- Summary: Deprecate NIO ConnectionManager Key: SPARK-9934 URL: https://issues.apache.org/jira/browse/SPARK-9934 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-6795) Avoid reading Parquet footers on driver side when an global arbitrative schema is available

2015-08-13 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6795?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-6795: -- Fix Version/s: 1.5.0 Avoid reading Parquet footers on driver side when an global arbitrative schema

[jira] [Updated] (SPARK-6795) Avoid reading Parquet footers on driver side when an global arbitrative schema is available

2015-08-13 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6795?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-6795: -- Target Version/s: 1.5.0 (was: 1.6.0) Avoid reading Parquet footers on driver side when an global

[jira] [Commented] (SPARK-5594) SparkException: Failed to get broadcast (TorrentBroadcast)

2015-08-13 Thread Kaveen Raajan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5594?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14694780#comment-14694780 ] Kaveen Raajan commented on SPARK-5594: -- Hi I'm also facing such error, when two

[jira] [Updated] (SPARK-9757) Can't create persistent data source tables with decimal

2015-08-13 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-9757: -- Description: {{ParquetHiveSerDe}} in Hive versions 1.2.0 doesn't support decimal. Persisting Parquet

[jira] [Created] (SPARK-9933) Test the new receiver scheduling

2015-08-13 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-9933: --- Summary: Test the new receiver scheduling Key: SPARK-9933 URL: https://issues.apache.org/jira/browse/SPARK-9933 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-9092) Make --num-executors compatible with dynamic allocation

2015-08-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-9092: - Assignee: Niranjan Padmanabhan Make --num-executors compatible with dynamic allocation

[jira] [Updated] (SPARK-9931) Flaky test: mllib/tests.py StreamingLogisticRegressionWithSGDTests. test_training_and_prediction

2015-08-13 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9931?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-9931: -- Priority: Critical (was: Major) Flaky test: mllib/tests.py StreamingLogisticRegressionWithSGDTests.

[jira] [Resolved] (SPARK-9885) IsolatedClientLoader ignores shared prefixes and barrier prefixes when spark.sql.hive.metastore.jars is set to maven

2015-08-13 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-9885. --- Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 8158

[jira] [Assigned] (SPARK-9757) Can't create persistent data source tables with decimal

2015-08-13 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian reassigned SPARK-9757: - Assignee: Cheng Lian Can't create persistent data source tables with decimal

[jira] [Resolved] (SPARK-9757) Can't create persistent data source tables with decimal

2015-08-13 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-9757. --- Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 8130

[jira] [Assigned] (SPARK-9934) Deprecate NIO ConnectionManager

2015-08-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9934?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9934: --- Assignee: Apache Spark (was: Reynold Xin) Deprecate NIO ConnectionManager

[jira] [Commented] (SPARK-9213) Improve regular expression performance (via joni)

2015-08-13 Thread Yadong Qi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9213?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14694928#comment-14694928 ] Yadong Qi commented on SPARK-9213: -- [~rxin] Use Joni regex instead of java regex, or add

[jira] [Comment Edited] (SPARK-5594) SparkException: Failed to get broadcast (TorrentBroadcast)

2015-08-13 Thread Kaveen Raajan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5594?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14694780#comment-14694780 ] Kaveen Raajan edited comment on SPARK-5594 at 8/13/15 6:31 AM:

[jira] [Commented] (SPARK-5594) SparkException: Failed to get broadcast (TorrentBroadcast)

2015-08-13 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5594?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14694790#comment-14694790 ] Josh Rosen commented on SPARK-5594: --- See my comments upthread regarding

[jira] [Updated] (SPARK-9328) Netty IO layer should implement read timeouts

2015-08-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9328?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-9328: --- Target Version/s: 1.6.0 (was: 1.5.0) Netty IO layer should implement read timeouts

[jira] [Updated] (SPARK-8949) Remove references to preferredNodeLocalityData in javadoc and print warning when used

2015-08-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8949?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-8949: --- Priority: Critical (was: Blocker) Remove references to preferredNodeLocalityData in javadoc and

[jira] [Commented] (SPARK-9705) outdated Python 3 and IPython information

2015-08-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14694836#comment-14694836 ] Reynold Xin commented on SPARK-9705: [~pmigdal] do you mind submitting a pull request

[jira] [Assigned] (SPARK-9767) Remove ConnectionManager

2015-08-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9767: --- Assignee: Apache Spark Remove ConnectionManager

[jira] [Assigned] (SPARK-9767) Remove ConnectionManager

2015-08-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9767: --- Assignee: (was: Apache Spark) Remove ConnectionManager

[jira] [Commented] (SPARK-9767) Remove ConnectionManager

2015-08-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9767?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14694872#comment-14694872 ] Apache Spark commented on SPARK-9767: - User 'rxin' has created a pull request for this

[jira] [Commented] (SPARK-6795) Avoid reading Parquet footers on driver side when an global arbitrative schema is available

2015-08-13 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6795?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14694899#comment-14694899 ] Cheng Lian commented on SPARK-6795: --- As explained on GitHub, usually we only backport

[jira] [Commented] (SPARK-9883) Distance to each cluster given a point

2015-08-13 Thread Bertrand Dechoux (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9883?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14694920#comment-14694920 ] Bertrand Dechoux commented on SPARK-9883: - A colleague of mine is working on it

[jira] [Commented] (SPARK-9893) User guide for VectorSlicer

2015-08-13 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9893?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14694926#comment-14694926 ] Xusen Yin commented on SPARK-9893: -- Sure, I'll do it ASAP. User guide for VectorSlicer

[jira] [Updated] (SPARK-5180) Data source API improvement (Spark 1.5)

2015-08-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-5180: --- Summary: Data source API improvement (Spark 1.5) (was: Data source API improvement) Data source

[jira] [Created] (SPARK-9932) Data source API improvement (Spark 1.6)

2015-08-13 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-9932: -- Summary: Data source API improvement (Spark 1.6) Key: SPARK-9932 URL: https://issues.apache.org/jira/browse/SPARK-9932 Project: Spark Issue Type: Umbrella

[jira] [Updated] (SPARK-6624) Convert filters into CNF for data sources

2015-08-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6624?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-6624: --- Parent Issue: SPARK-9932 (was: SPARK-5180) Convert filters into CNF for data sources

[jira] [Resolved] (SPARK-5180) Data source API improvement (Spark 1.5)

2015-08-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-5180. Resolution: Fixed Fix Version/s: 1.5.0 Closing this one for 1.5. For future improvements,

[jira] [Updated] (SPARK-9346) Conversion is applied three times on partitioned data sources that require conversion

2015-08-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9346?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-9346: --- Parent Issue: SPARK-9932 (was: SPARK-5180) Conversion is applied three times on partitioned data

[jira] [Updated] (SPARK-8887) Explicitly define which data types can be used as dynamic partition columns

2015-08-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8887?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-8887: --- Parent Issue: SPARK-9932 (was: SPARK-5180) Explicitly define which data types can be used as

[jira] [Assigned] (SPARK-9934) Deprecate NIO ConnectionManager

2015-08-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9934?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9934: --- Assignee: Reynold Xin (was: Apache Spark) Deprecate NIO ConnectionManager

[jira] [Commented] (SPARK-9934) Deprecate NIO ConnectionManager

2015-08-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14694888#comment-14694888 ] Apache Spark commented on SPARK-9934: - User 'rxin' has created a pull request for this

[jira] [Commented] (SPARK-9213) Improve regular expression performance (via joni)

2015-08-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9213?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14694952#comment-14694952 ] Reynold Xin commented on SPARK-9213: Are there any semantic differences between the

[jira] [Commented] (SPARK-9594) Failed to get broadcast_33_piece0 while using Accumulators in UDF

2015-08-13 Thread Poorvi Lashkary (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9594?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14695092#comment-14695092 ] Poorvi Lashkary commented on SPARK-9594: Why it is not assigned? Failed to get

[jira] [Comment Edited] (SPARK-9213) Improve regular expression performance (via joni)

2015-08-13 Thread Yadong Qi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9213?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14694928#comment-14694928 ] Yadong Qi edited comment on SPARK-9213 at 8/13/15 9:23 AM: ---

[jira] [Assigned] (SPARK-9935) EqualNotNull not processed in ORC

2015-08-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9935?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9935: --- Assignee: Apache Spark EqualNotNull not processed in ORC -

[jira] [Created] (SPARK-9936) decimal precision lost when loading DataFrame from RDD

2015-08-13 Thread Tzach Zohar (JIRA)
Tzach Zohar created SPARK-9936: -- Summary: decimal precision lost when loading DataFrame from RDD Key: SPARK-9936 URL: https://issues.apache.org/jira/browse/SPARK-9936 Project: Spark Issue Type:

[jira] [Created] (SPARK-9935) EqualNotNull not processed in ORC

2015-08-13 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-9935: --- Summary: EqualNotNull not processed in ORC Key: SPARK-9935 URL: https://issues.apache.org/jira/browse/SPARK-9935 Project: Spark Issue Type: Improvement

[jira] [Assigned] (SPARK-9935) EqualNotNull not processed in ORC

2015-08-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9935?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9935: --- Assignee: (was: Apache Spark) EqualNotNull not processed in ORC

[jira] [Commented] (SPARK-9935) EqualNotNull not processed in ORC

2015-08-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9935?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14695082#comment-14695082 ] Apache Spark commented on SPARK-9935: - User 'HyukjinKwon' has created a pull request

[jira] [Updated] (SPARK-9935) EqualNotNull not processed in ORC

2015-08-13 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9935?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-9935: Component/s: SQL EqualNotNull not processed in ORC -

[jira] [Resolved] (SPARK-9918) Remove runs from KMeans under the pipeline API

2015-08-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9918?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-9918. -- Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 8148

[jira] [Commented] (SPARK-9670) ML 1.5 QA: Examples: Check for new APIs requiring example code

2015-08-13 Thread Ram Sriharsha (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9670?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14694756#comment-14694756 ] Ram Sriharsha commented on SPARK-9670: -- Hey Yuhao There is already a JIRA to add a

[jira] [Commented] (SPARK-9670) ML 1.5 QA: Examples: Check for new APIs requiring example code

2015-08-13 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9670?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14694765#comment-14694765 ] yuhao yang commented on SPARK-9670: --- Oh, Thanks Ram. That should be a more comprehensive

[jira] [Updated] (SPARK-9937) GraphX Performance: Partition overhead scales quadratically

2015-08-13 Thread Tobias Bertelsen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9937?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tobias Bertelsen updated SPARK-9937: Attachment: Scaleservers-log.png GraphX Performance: Partition overhead scales

[jira] [Commented] (SPARK-9841) Params.clear needs to be public

2015-08-13 Thread Kai Sasaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9841?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14695103#comment-14695103 ] Kai Sasaki commented on SPARK-9841: --- [~josephkb] Do you have any use case when public

[jira] [Commented] (SPARK-9906) User guide for LogisticRegressionSummary

2015-08-13 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14695156#comment-14695156 ] Manoj Kumar commented on SPARK-9906: Sure ! User guide for LogisticRegressionSummary

[jira] [Updated] (SPARK-9935) EqualNotNull not processed in OrcRelation

2015-08-13 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9935?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-9935: Summary: EqualNotNull not processed in OrcRelation (was: EqualNotNull not processed in ORC)

[jira] [Commented] (SPARK-5594) SparkException: Failed to get broadcast (TorrentBroadcast)

2015-08-13 Thread Kaveen Raajan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5594?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14695107#comment-14695107 ] Kaveen Raajan commented on SPARK-5594: -- Removing *spark.cleaner.ttl* configuration

[jira] [Commented] (SPARK-9499) Possible file handle leak in spilling/sort code

2015-08-13 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9499?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14695148#comment-14695148 ] Herman van Hovell commented on SPARK-9499: -- This has been fixed by the PR for

[jira] [Updated] (SPARK-9937) GraphX Performance: Partition overhead scales quadratically

2015-08-13 Thread Tobias Bertelsen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9937?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tobias Bertelsen updated SPARK-9937: Attachment: Scaleservers-lin.png GraphX Performance: Partition overhead scales

[jira] [Commented] (SPARK-9594) Failed to get broadcast_33_piece0 while using Accumulators in UDF

2015-08-13 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9594?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14695202#comment-14695202 ] Herman van Hovell commented on SPARK-9594: -- This is more of a question for the

[jira] [Created] (SPARK-9937) GraphX Performance: Partition overhead scales quadratically

2015-08-13 Thread Tobias Bertelsen (JIRA)
Tobias Bertelsen created SPARK-9937: --- Summary: GraphX Performance: Partition overhead scales quadratically Key: SPARK-9937 URL: https://issues.apache.org/jira/browse/SPARK-9937 Project: Spark

[jira] [Commented] (SPARK-9936) decimal precision lost when loading DataFrame from RDD

2015-08-13 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9936?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14695274#comment-14695274 ] Liang-Chi Hsieh commented on SPARK-9936: I think this problem is solved in current

[jira] [Assigned] (SPARK-9938) Constant folding in binaryComparison

2015-08-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9938?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9938: --- Assignee: Apache Spark Constant folding in binaryComparison

[jira] [Assigned] (SPARK-9938) Constant folding in binaryComparison

2015-08-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9938?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9938: --- Assignee: (was: Apache Spark) Constant folding in binaryComparison

[jira] [Commented] (SPARK-9938) Constant folding in binaryComparison

2015-08-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9938?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14695285#comment-14695285 ] Apache Spark commented on SPARK-9938: - User 'yjshen' has created a pull request for

[jira] [Commented] (SPARK-9936) decimal precision lost when loading DataFrame from RDD

2015-08-13 Thread Tzach Zohar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9936?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14695307#comment-14695307 ] Tzach Zohar commented on SPARK-9936: [~viirya] indeed! I've just located the

[jira] [Comment Edited] (SPARK-9705) outdated Python 3 and IPython information

2015-08-13 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-9705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14695273#comment-14695273 ] Piotr Migdał edited comment on SPARK-9705 at 8/13/15 2:13 PM: --

[jira] [Commented] (SPARK-9705) outdated Python 3 and IPython information

2015-08-13 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-9705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14695273#comment-14695273 ] Piotr Migdał commented on SPARK-9705: - I added it as I am having problems with running

[jira] [Created] (SPARK-9938) Constant folding in binaryComparison

2015-08-13 Thread Yijie Shen (JIRA)
Yijie Shen created SPARK-9938: - Summary: Constant folding in binaryComparison Key: SPARK-9938 URL: https://issues.apache.org/jira/browse/SPARK-9938 Project: Spark Issue Type: Improvement

[jira] [Closed] (SPARK-9936) decimal precision lost when loading DataFrame from RDD

2015-08-13 Thread Tzach Zohar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9936?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tzach Zohar closed SPARK-9936. -- Resolution: Fixed Fix Version/s: 1.5.0 decimal precision lost when loading DataFrame from RDD

[jira] [Assigned] (SPARK-9793) PySpark DenseVector, SparseVector should override __eq__

2015-08-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9793?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9793: --- Assignee: Apache Spark PySpark DenseVector, SparseVector should override __eq__

[jira] [Commented] (SPARK-9793) PySpark DenseVector, SparseVector should override __eq__

2015-08-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9793?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14695326#comment-14695326 ] Apache Spark commented on SPARK-9793: - User 'yanboliang' has created a pull request

[jira] [Assigned] (SPARK-9793) PySpark DenseVector, SparseVector should override __eq__

2015-08-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9793?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9793: --- Assignee: (was: Apache Spark) PySpark DenseVector, SparseVector should override __eq__

[jira] [Updated] (SPARK-9935) EqualNotNull not processed in OrcRelation

2015-08-13 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9935?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-9935: Description: it is an issue followed by SPARK-9814. Now datasources (after {{selectFilters()}} in

[jira] [Updated] (SPARK-8530) Add Python API for MinMaxScaler

2015-08-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8530?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-8530: - Shepherd: Joseph K. Bradley Assignee: yuhao yang Affects

[jira] [Commented] (SPARK-9919) Matrices should respect Java's equals and hashCode contract

2015-08-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9919?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14695716#comment-14695716 ] Joseph K. Bradley commented on SPARK-9919: -- Update: We may just revert the

[jira] [Updated] (SPARK-9919) Matrices should respect Java's equals and hashCode contract

2015-08-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9919?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-9919: - Target Version/s: 1.6.0 (was: 1.5.0) Matrices should respect Java's equals and hashCode

[jira] [Updated] (SPARK-9906) User guide for LogisticRegressionSummary

2015-08-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9906?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-9906: - Assignee: Manoj Kumar User guide for LogisticRegressionSummary

[jira] [Created] (SPARK-9942) Broken pandas could crash PySpark SQL

2015-08-13 Thread Davies Liu (JIRA)
Davies Liu created SPARK-9942: - Summary: Broken pandas could crash PySpark SQL Key: SPARK-9942 URL: https://issues.apache.org/jira/browse/SPARK-9942 Project: Spark Issue Type: Bug

[jira] [Assigned] (SPARK-9725) spark sql query string field return empty/garbled string

2015-08-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9725?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9725: --- Assignee: Apache Spark (was: Davies Liu) spark sql query string field return empty/garbled

[jira] [Comment Edited] (SPARK-8922) Add @since tags to mllib.evaluation

2015-08-13 Thread Shikai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8922?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14694729#comment-14694729 ] Shikai edited comment on SPARK-8922 at 8/13/15 4:41 PM: [~mengxr]

[jira] [Commented] (SPARK-9890) User guide for CountVectorizer

2015-08-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14695548#comment-14695548 ] Joseph K. Bradley commented on SPARK-9890: -- I want to...but it's really late to

[jira] [Created] (SPARK-9943) Failed to serialize a deserialized UnsafeHashedRelation

2015-08-13 Thread Davies Liu (JIRA)
Davies Liu created SPARK-9943: - Summary: Failed to serialize a deserialized UnsafeHashedRelation Key: SPARK-9943 URL: https://issues.apache.org/jira/browse/SPARK-9943 Project: Spark Issue Type:

[jira] [Commented] (SPARK-9793) PySpark DenseVector, SparseVector should override __eq__

2015-08-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9793?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14695723#comment-14695723 ] Joseph K. Bradley commented on SPARK-9793: -- Discussed with [~mengxr]: We're going

[jira] [Commented] (SPARK-9750) SparseMatrix should override equals

2015-08-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9750?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14695721#comment-14695721 ] Joseph K. Bradley commented on SPARK-9750: -- Discussed with [~mengxr]: We're going

[jira] [Assigned] (SPARK-9943) Failed to serialize a deserialized UnsafeHashedRelation

2015-08-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9943?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9943: --- Assignee: Davies Liu (was: Apache Spark) Failed to serialize a deserialized

[jira] [Commented] (SPARK-9943) Failed to serialize a deserialized UnsafeHashedRelation

2015-08-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9943?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14695751#comment-14695751 ] Apache Spark commented on SPARK-9943: - User 'davies' has created a pull request for

[jira] [Assigned] (SPARK-9943) Failed to serialize a deserialized UnsafeHashedRelation

2015-08-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9943?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9943: --- Assignee: Apache Spark (was: Davies Liu) Failed to serialize a deserialized

[jira] [Commented] (SPARK-9604) Unsafe ArrayData and MapData is very very slow

2015-08-13 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14695617#comment-14695617 ] Davies Liu commented on SPARK-9604: --- [~cloud_fan] Yeah, The test looks much better now.

[jira] [Assigned] (SPARK-9725) spark sql query string field return empty/garbled string

2015-08-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9725?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9725: --- Assignee: Davies Liu (was: Apache Spark) spark sql query string field return empty/garbled

[jira] [Updated] (SPARK-9941) Try ML pipeline API on Kaggle competitions

2015-08-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9941?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-9941: - Description: This is an umbrella JIRA to track some fun tasks :) We have built many features

[jira] [Created] (SPARK-9944) hive.metastore.warehouse.dir is not respected

2015-08-13 Thread Manku Timma (JIRA)
Manku Timma created SPARK-9944: -- Summary: hive.metastore.warehouse.dir is not respected Key: SPARK-9944 URL: https://issues.apache.org/jira/browse/SPARK-9944 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-9906) User guide for LogisticRegressionSummary

2015-08-13 Thread Feynman Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14695526#comment-14695526 ] Feynman Liang commented on SPARK-9906: -- [~mengxr] please assign User guide for

[jira] [Updated] (SPARK-9893) User guide for VectorSlicer

2015-08-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9893?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-9893: - Assignee: Xusen Yin User guide for VectorSlicer ---

[jira] [Assigned] (SPARK-7379) pickle.loads expects a string instead of bytes in Python 3.

2015-08-13 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7379?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu reassigned SPARK-7379: - Assignee: Davies Liu pickle.loads expects a string instead of bytes in Python 3.

[jira] [Created] (SPARK-9941) Try ML pipeline API on Kaggle competitions

2015-08-13 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-9941: Summary: Try ML pipeline API on Kaggle competitions Key: SPARK-9941 URL: https://issues.apache.org/jira/browse/SPARK-9941 Project: Spark Issue Type:

[jira] [Updated] (SPARK-9941) Try ML pipeline API on Kaggle competitions

2015-08-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9941?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-9941: - Description: This is an umbrella JIRA to track some fun tasks :) We have built many features

[jira] [Commented] (SPARK-9592) Last implemented based on AggregateExpression1 are calculating the values for entire DataFrame partition not on GroupedData partition.

2015-08-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9592?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14695665#comment-14695665 ] Apache Spark commented on SPARK-9592: - User 'yhuai' has created a pull request for

[jira] [Commented] (SPARK-9725) spark sql query string field return empty/garbled string

2015-08-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9725?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14695694#comment-14695694 ] Apache Spark commented on SPARK-9725: - User 'davies' has created a pull request for

[jira] [Updated] (SPARK-9793) PySpark DenseVector, SparseVector should override __eq__

2015-08-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9793?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-9793: - Target Version/s: 1.6.0 (was: 1.5.0) PySpark DenseVector, SparseVector should override

[jira] [Resolved] (SPARK-9499) Possible file handle leak in spilling/sort code

2015-08-13 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-9499. --- Resolution: Duplicate Assignee: Davies Liu (was: Josh Rosen) Fix Version/s: 1.5.0

[jira] [Commented] (SPARK-9725) spark sql query string field return empty/garbled string

2015-08-13 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9725?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14695652#comment-14695652 ] Davies Liu commented on SPARK-9725: --- [~lian cheng] [~yhuai] Can you reproduce this

[jira] [Updated] (SPARK-9792) PySpark DenseMatrix, SparseMatrix should override __eq__

2015-08-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9792?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-9792: - Target Version/s: 1.6.0 (was: 1.5.0) PySpark DenseMatrix, SparseMatrix should override

[jira] [Commented] (SPARK-9792) PySpark DenseMatrix, SparseMatrix should override __eq__

2015-08-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9792?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14695722#comment-14695722 ] Joseph K. Bradley commented on SPARK-9792: -- Discussed with [~mengxr]: We're going

[jira] [Commented] (SPARK-9649) Flaky test: o.a.s.deploy.master.MasterSuite: recovery

2015-08-13 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9649?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14695744#comment-14695744 ] Andrew Or commented on SPARK-9649: -- Also resolved by commit:

[jira] [Created] (SPARK-9952) Fix N^2 loop when DAGScheduler.getPreferredLocsInternal accesses cacheLocs

2015-08-13 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-9952: - Summary: Fix N^2 loop when DAGScheduler.getPreferredLocsInternal accesses cacheLocs Key: SPARK-9952 URL: https://issues.apache.org/jira/browse/SPARK-9952 Project: Spark

[jira] [Updated] (SPARK-9952) Fix N^2 loop when DAGScheduler.getPreferredLocsInternal accesses cacheLocs

2015-08-13 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9952?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-9952: -- Priority: Critical (was: Major) Fix N^2 loop when DAGScheduler.getPreferredLocsInternal accesses

  1   2   3   >