[jira] [Created] (SPARK-13156) JDBC using multiple partitions creates additional tasks but only executes on one

2016-02-02 Thread Charles Drotar (JIRA)
Charles Drotar created SPARK-13156: -- Summary: JDBC using multiple partitions creates additional tasks but only executes on one Key: SPARK-13156 URL: https://issues.apache.org/jira/browse/SPARK-13156

[jira] [Updated] (SPARK-13156) JDBC using multiple partitions creates additional tasks but only executes on one

2016-02-02 Thread Charles Drotar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13156?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Charles Drotar updated SPARK-13156: --- Description: I can successfully kick off a query through JDBC to Teradata, and when it runs

[jira] [Commented] (SPARK-13065) streaming-twitter pass twitter4j.FilterQuery argument to TwitterUtils.createStream()

2016-02-02 Thread sachin aggarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13065?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15129802#comment-15129802 ] sachin aggarwal commented on SPARK-13065: - happy to see, thats exactly what I have added have a

[jira] [Commented] (SPARK-13145) checkAnswer should tolerate small float number error

2016-02-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15129810#comment-15129810 ] Sean Owen commented on SPARK-13145: --- Isn't this what ~== does? The error shouldn't be relative to the

[jira] [Created] (SPARK-13138) Add "logical" package prefix for ddl.scala

2016-02-02 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-13138: --- Summary: Add "logical" package prefix for ddl.scala Key: SPARK-13138 URL: https://issues.apache.org/jira/browse/SPARK-13138 Project: Spark Issue Type:

[jira] [Commented] (SPARK-13137) NullPoingException in schema inference for CSV when the first line is empty

2016-02-02 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15127861#comment-15127861 ] Hyukjin Kwon commented on SPARK-13137: -- I will work on this. > NullPoingException in schema

[jira] [Assigned] (SPARK-13138) Add "logical" package prefix for ddl.scala

2016-02-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13138?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13138: Assignee: Reynold Xin (was: Apache Spark) > Add "logical" package prefix for ddl.scala >

[jira] [Resolved] (SPARK-12362) Create a full-fledged built-in SQL parser

2016-02-02 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12362?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-12362. - Resolution: Fixed Fix Version/s: 2.0.0 > Create a full-fledged built-in SQL parser >

[jira] [Updated] (SPARK-13139) Create native DDL commands

2016-02-02 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13139?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-13139: Description: We currently delegate most DDLs directly to Hive, through NativePlaceholder in

[jira] [Resolved] (SPARK-13133) When the option --master of spark-submit script is inconsistent with SparkConf.setMaster in Spark appliction code, the behavior of Spark application is difficult to und

2016-02-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-13133. --- Resolution: Not A Problem That's an application error. You're specifying execution context in the

[jira] [Commented] (SPARK-12344) Remove env-based configurations

2016-02-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15127893#comment-15127893 ] Sean Owen commented on SPARK-12344: --- Fwiw I fully support removing all env configs everywhere. It's

[jira] [Assigned] (SPARK-13137) NullPoingException in schema inference for CSV when the first line is empty

2016-02-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13137: Assignee: Apache Spark > NullPoingException in schema inference for CSV when the first

[jira] [Assigned] (SPARK-13137) NullPoingException in schema inference for CSV when the first line is empty

2016-02-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13137: Assignee: (was: Apache Spark) > NullPoingException in schema inference for CSV when

[jira] [Commented] (SPARK-13137) NullPoingException in schema inference for CSV when the first line is empty

2016-02-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15127908#comment-15127908 ] Apache Spark commented on SPARK-13137: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Assigned] (SPARK-12739) Details of batch in Streaming tab uses two Duration columns

2016-02-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12739?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12739: Assignee: (was: Apache Spark) > Details of batch in Streaming tab uses two Duration

[jira] [Commented] (SPARK-12739) Details of batch in Streaming tab uses two Duration columns

2016-02-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15127907#comment-15127907 ] Apache Spark commented on SPARK-12739: -- User 'mariobriggs' has created a pull request for this

[jira] [Assigned] (SPARK-12739) Details of batch in Streaming tab uses two Duration columns

2016-02-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12739?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12739: Assignee: Apache Spark > Details of batch in Streaming tab uses two Duration columns >

[jira] [Commented] (SPARK-13133) When the option --master of spark-submit script is inconsistent with SparkConf.setMaster in Spark appliction code, the behavior of Spark application is difficult to un

2016-02-02 Thread Li Ye (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15127909#comment-15127909 ] Li Ye commented on SPARK-13133: --- I cannot agree with you. I think it is a bug because the driver's log is

[jira] [Commented] (SPARK-13087) Grouping by a complex expression may lead to incorrect AttributeReferences in aggregations

2016-02-02 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13087?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15127919#comment-15127919 ] Yin Huai commented on SPARK-13087: -- https://github.com/apache/spark/pull/11011 has been merged into

[jira] [Commented] (SPARK-12988) Can't drop columns that contain dots

2016-02-02 Thread Yan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15127883#comment-15127883 ] Yan commented on SPARK-12988: - [~marmbrus] For the same reason of "`a.c` is an invalid column name. toDF(...)

[jira] [Updated] (SPARK-12772) Better error message for parsing failure?

2016-02-02 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-12772: Issue Type: Bug (was: Sub-task) Parent: (was: SPARK-12362) > Better error message for

[jira] [Created] (SPARK-13139) Create native DDL commands

2016-02-02 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-13139: --- Summary: Create native DDL commands Key: SPARK-13139 URL: https://issues.apache.org/jira/browse/SPARK-13139 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-13133) When the option --master of spark-submit script is inconsistent with SparkConf.setMaster in Spark appliction code, the behavior of Spark application is difficult to un

2016-02-02 Thread Ji Hao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15127911#comment-15127911 ] Ji Hao commented on SPARK-13133: Sean Owen, I think you should consider this issue, the error log may be

[jira] [Issue Comment Deleted] (SPARK-13133) When the option --master of spark-submit script is inconsistent with SparkConf.setMaster in Spark appliction code, the behavior of Spark application is dif

2016-02-02 Thread Ji Hao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ji Hao updated SPARK-13133: --- Comment: was deleted (was: Sean Owen, I think you should consider this issue, the error log may be more

[jira] [Commented] (SPARK-13133) When the option --master of spark-submit script is inconsistent with SparkConf.setMaster in Spark appliction code, the behavior of Spark application is difficult to un

2016-02-02 Thread Ji Hao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15127912#comment-15127912 ] Ji Hao commented on SPARK-13133: Sean Owen, I think you should consider this issue, the error log may be

[jira] [Updated] (SPARK-12772) Better error message for parsing failure?

2016-02-02 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-12772: Affects Version/s: 2.0.0 > Better error message for parsing failure? >

[jira] [Updated] (SPARK-12772) Better error message for parsing failure?

2016-02-02 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-12772: Issue Type: Improvement (was: Bug) > Better error message for parsing failure? >

[jira] [Commented] (SPARK-13139) Create native DDL commands

2016-02-02 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15127895#comment-15127895 ] Reynold Xin commented on SPARK-13139: - cc [~viirya] can you work on this? This is fairly important

[jira] [Updated] (SPARK-13009) spark-streaming-twitter_2.10 does not make it possible to access the raw twitter json

2016-02-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13009?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-13009: -- Labels: (was: twitter) Priority: Minor (was: Blocker) [~aedwip] you should never set Blocker

[jira] [Resolved] (SPARK-13087) Grouping by a complex expression may lead to incorrect AttributeReferences in aggregations

2016-02-02 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-13087. -- Resolution: Fixed Assignee: Michael Armbrust Fix Version/s: 2.0.0

[jira] [Commented] (SPARK-13065) streaming-twitter pass twitter4j.FilterQuery argument to TwitterUtils.createStream()

2016-02-02 Thread sachin aggarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13065?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15127925#comment-15127925 ] sachin aggarwal commented on SPARK-13065: - List of changes: 1) Added support for passing

[jira] [Updated] (SPARK-13138) Add "logical" package prefix for ddl.scala

2016-02-02 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13138?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-13138: Description: ddl.scala is defined in the execution package, and yet its reference of "UnaryNode"

[jira] [Assigned] (SPARK-13138) Add "logical" package prefix for ddl.scala

2016-02-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13138?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13138: Assignee: Apache Spark (was: Reynold Xin) > Add "logical" package prefix for ddl.scala >

[jira] [Commented] (SPARK-13138) Add "logical" package prefix for ddl.scala

2016-02-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15127866#comment-15127866 ] Apache Spark commented on SPARK-13138: -- User 'rxin' has created a pull request for this issue:

[jira] [Updated] (SPARK-13139) Create native DDL commands

2016-02-02 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13139?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-13139: Description: We currently delegate most DDLs directly to Hive, through NativePlaceholder in

[jira] [Created] (SPARK-13140) spark sql aggregate performance decrease

2016-02-02 Thread spencerlee (JIRA)
spencerlee created SPARK-13140: -- Summary: spark sql aggregate performance decrease Key: SPARK-13140 URL: https://issues.apache.org/jira/browse/SPARK-13140 Project: Spark Issue Type: Question

[jira] [Updated] (SPARK-13140) spark sql aggregate performance decrease

2016-02-02 Thread spencerlee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13140?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] spencerlee updated SPARK-13140: --- Remaining Estimate: 10h (was: 168h) Original Estimate: 10h (was: 168h) > spark sql aggregate

[jira] [Commented] (SPARK-13119) SparkR Ser/De fail to handle "columns(df)"

2016-02-02 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13119?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15127928#comment-15127928 ] Sun Rui commented on SPARK-13119: - this is a known issue, refer to

[jira] [Commented] (SPARK-13119) SparkR Ser/De fail to handle "columns(df)"

2016-02-02 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13119?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15128336#comment-15128336 ] Xusen Yin commented on SPARK-13119: --- Thank you Sun Rui. As a novice to R, it costs me 2 days to locate

[jira] [Commented] (SPARK-12177) Update KafkaDStreams to new Kafka 0.9 Consumer API

2016-02-02 Thread Cody Koeninger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15128377#comment-15128377 ] Cody Koeninger commented on SPARK-12177: It's probably worth either waiting for a point release

[jira] [Commented] (SPARK-13119) SparkR Ser/De fail to handle "columns(df)"

2016-02-02 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13119?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15128339#comment-15128339 ] Xusen Yin commented on SPARK-13119: --- The JIRA can be solved by SPARK-10312, so I'll close it. > SparkR

[jira] [Closed] (SPARK-13119) SparkR Ser/De fail to handle "columns(df)"

2016-02-02 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13119?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xusen Yin closed SPARK-13119. - Resolution: Duplicate > SparkR Ser/De fail to handle "columns(df)" >

[jira] [Commented] (SPARK-13125) makes the ratio of KafkaRDD partition to kafka topic partition configurable.

2016-02-02 Thread Cody Koeninger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13125?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15128358#comment-15128358 ] Cody Koeninger commented on SPARK-13125: This doesn't make sense. You can either shuffle in

[jira] [Updated] (SPARK-13141) Dataframe created from Hive partitioned tables using HiveContext returns wrong results

2016-02-02 Thread Simone (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Simone updated SPARK-13141: --- Description: I get wrong dataframe results using HiveContext with Spark 1.5.0 on CDH 5.5.1 in yarn-client

[jira] [Commented] (SPARK-12986) Fix pydoc warnings in mllib/regression.py

2016-02-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12986?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15127980#comment-15127980 ] Apache Spark commented on SPARK-12986: -- User 'nampham2' has created a pull request for this issue:

[jira] [Assigned] (SPARK-12986) Fix pydoc warnings in mllib/regression.py

2016-02-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12986?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12986: Assignee: Apache Spark (was: Yu Ishikawa) > Fix pydoc warnings in mllib/regression.py >

[jira] [Commented] (SPARK-12986) Fix pydoc warnings in mllib/regression.py

2016-02-02 Thread Nam Pham (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12986?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15127978#comment-15127978 ] Nam Pham commented on SPARK-12986: -- [~bryanc] was right. I have fixed the warnings and made a pull

[jira] [Assigned] (SPARK-12986) Fix pydoc warnings in mllib/regression.py

2016-02-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12986?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12986: Assignee: Yu Ishikawa (was: Apache Spark) > Fix pydoc warnings in mllib/regression.py >

[jira] [Updated] (SPARK-12423) Mesos executor home should not be resolved on the driver's file system

2016-02-02 Thread Iulian Dragos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12423?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Iulian Dragos updated SPARK-12423: -- Fix Version/s: 2.0.0 > Mesos executor home should not be resolved on the driver's file system

[jira] [Resolved] (SPARK-12423) Mesos executor home should not be resolved on the driver's file system

2016-02-02 Thread Iulian Dragos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12423?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Iulian Dragos resolved SPARK-12423. --- Resolution: Fixed > Mesos executor home should not be resolved on the driver's file system >

[jira] [Commented] (SPARK-12423) Mesos executor home should not be resolved on the driver's file system

2016-02-02 Thread Iulian Dragos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12423?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15128036#comment-15128036 ] Iulian Dragos commented on SPARK-12423: --- Yes, closing this. > Mesos executor home should not be

[jira] [Created] (SPARK-13142) Problem accessing Web UI /logPage/ on Microsoft Windows

2016-02-02 Thread Neil Andrassy (JIRA)
Neil Andrassy created SPARK-13142: - Summary: Problem accessing Web UI /logPage/ on Microsoft Windows Key: SPARK-13142 URL: https://issues.apache.org/jira/browse/SPARK-13142 Project: Spark

[jira] [Commented] (SPARK-12874) ML StringIndexer does not protect itself from column name duplication

2016-02-02 Thread Wojciech Jurczyk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15127934#comment-15127934 ] Wojciech Jurczyk commented on SPARK-12874: -- Thank you for feedback and willingness to help,

[jira] [Commented] (SPARK-13065) streaming-twitter pass twitter4j.FilterQuery argument to TwitterUtils.createStream()

2016-02-02 Thread sachin aggarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13065?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15127935#comment-15127935 ] sachin aggarwal commented on SPARK-13065: - [~aedwip] I got a doubt after reading ur last comment

[jira] [Updated] (SPARK-13133) When the option --master of spark-submit script is inconsistent with SparkConf.setMaster in Spark appliction code, the behavior of Spark application is difficult to unde

2016-02-02 Thread Li Ye (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Li Ye updated SPARK-13133: -- Component/s: Deploy > When the option --master of spark-submit script is inconsistent with >

[jira] [Updated] (SPARK-13140) spark sql aggregate performance decrease

2016-02-02 Thread spencerlee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13140?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] spencerlee updated SPARK-13140: --- Issue Type: Bug (was: Question) > spark sql aggregate performance decrease >

[jira] [Created] (SPARK-13141) Dataframe created from Hive partitioned tables using HiveContext returns wrong results

2016-02-02 Thread Simone (JIRA)
Simone created SPARK-13141: -- Summary: Dataframe created from Hive partitioned tables using HiveContext returns wrong results Key: SPARK-13141 URL: https://issues.apache.org/jira/browse/SPARK-13141 Project:

[jira] [Updated] (SPARK-13141) Dataframe created from Hive partitioned tables using HiveContext returns wrong results

2016-02-02 Thread Simone (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Simone updated SPARK-13141: --- Description: I get wrong dataframe results using HiveContext with Spark 1.5.0 on CDH 5.5.1 in yarn-client

[jira] [Commented] (SPARK-13132) LogisticRegression spends 35% of its time fetching the standardization parameter

2016-02-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15128495#comment-15128495 ] Apache Spark commented on SPARK-13132: -- User 'idigary' has created a pull request for this issue:

[jira] [Assigned] (SPARK-13132) LogisticRegression spends 35% of its time fetching the standardization parameter

2016-02-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13132?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13132: Assignee: (was: Apache Spark) > LogisticRegression spends 35% of its time fetching

[jira] [Assigned] (SPARK-13132) LogisticRegression spends 35% of its time fetching the standardization parameter

2016-02-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13132?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13132: Assignee: Apache Spark > LogisticRegression spends 35% of its time fetching the

[jira] [Created] (SPARK-13144) Enabling efficient and transparent use of GPUs for accelerating MLLib functions

2016-02-02 Thread Rajesh Bordawekar (JIRA)
Rajesh Bordawekar created SPARK-13144: - Summary: Enabling efficient and transparent use of GPUs for accelerating MLLib functions Key: SPARK-13144 URL: https://issues.apache.org/jira/browse/SPARK-13144

[jira] [Commented] (SPARK-12868) ADD JAR via sparkSQL JDBC will fail when using a HDFS URL

2016-02-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12868?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15128443#comment-15128443 ] Apache Spark commented on SPARK-12868: -- User 'trystanleftwich' has created a pull request for this

[jira] [Created] (SPARK-13143) EC2 cluster silently not destroyed for non-default regions

2016-02-02 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created SPARK-13143: --- Summary: EC2 cluster silently not destroyed for non-default regions Key: SPARK-13143 URL: https://issues.apache.org/jira/browse/SPARK-13143 Project:

[jira] [Commented] (SPARK-13132) LogisticRegression spends 35% of its time fetching the standardization parameter

2016-02-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15128136#comment-15128136 ] Sean Owen commented on SPARK-13132: --- If this is really a measurable bottleneck and it is as simple to

[jira] [Commented] (SPARK-13143) EC2 cluster silently not destroyed for non-default regions

2016-02-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15128144#comment-15128144 ] Sean Owen commented on SPARK-13143: --- Yes EC2 bits have moved out to the amplab repo. > EC2 cluster

[jira] [Updated] (SPARK-13143) EC2 cluster silently not destroyed for non-default regions

2016-02-02 Thread Theodore Vasiloudis (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13143?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Theodore Vasiloudis updated SPARK-13143: Description: If you start a cluster in a non-default region using the EC2 scripts

[jira] [Commented] (SPARK-13143) EC2 cluster silently not destroyed for non-default regions

2016-02-02 Thread Theodore Vasiloudis (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15128126#comment-15128126 ] Theodore Vasiloudis commented on SPARK-13143: - In truth there should be a more permanent

[jira] [Commented] (SPARK-13129) Spark SQL can't query hive table, which is create by Hive HCatalog Streaming API

2016-02-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15128134#comment-15128134 ] Sean Owen commented on SPARK-13129: --- Yeah that's basically what you said above. The error is from Huve

[jira] [Commented] (SPARK-13139) Create native DDL commands

2016-02-02 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15128142#comment-15128142 ] Liang-Chi Hsieh commented on SPARK-13139: - Yes. I would like to do this. > Create native DDL

[jira] [Commented] (SPARK-13115) RandomForest is stuck at computing same stage over and over

2016-02-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15128146#comment-15128146 ] Sean Owen commented on SPARK-13115: --- Failing repeatedly is still likely to be a function of the rest of

[jira] [Commented] (SPARK-13143) EC2 cluster silently not destroyed for non-default regions

2016-02-02 Thread Theodore Vasiloudis (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15128138#comment-15128138 ] Theodore Vasiloudis commented on SPARK-13143: - I'm actually a bit confused re. the fix. It

[jira] [Resolved] (SPARK-7009) Build assembly JAR via ant to avoid zip64 problems

2016-02-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7009?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-7009. -- Resolution: Duplicate > Build assembly JAR via ant to avoid zip64 problems >

[jira] [Commented] (SPARK-12984) Not able to read CSV file using Spark 1.4.0

2016-02-02 Thread Jai Murugesh Rajasekaran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15128573#comment-15128573 ] Jai Murugesh Rajasekaran commented on SPARK-12984: -- Yes Sun Rui...It needs to be

[jira] [Commented] (SPARK-12868) ADD JAR via sparkSQL JDBC will fail when using a HDFS URL

2016-02-02 Thread Trystan Leftwich (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12868?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15128445#comment-15128445 ] Trystan Leftwich commented on SPARK-12868: -- Added: https://github.com/apache/spark/pull/11026

[jira] [Updated] (SPARK-9339) Use of Class.forName(String) should be replaced with version taking classloader

2016-02-02 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-9339?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marco Zanghì updated SPARK-9339: Attachment: screenshot-1yarn-cluster error.png > Use of Class.forName(String) should be replaced

[jira] [Commented] (SPARK-9339) Use of Class.forName(String) should be replaced with version taking classloader

2016-02-02 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-9339?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15128512#comment-15128512 ] Marco Zanghì commented on SPARK-9339: - I have a problem using Utils.classForName() . it returns always

[jira] [Updated] (SPARK-12868) ADD JAR via sparkSQL JDBC will fail when using a HDFS URL

2016-02-02 Thread Trystan Leftwich (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12868?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Trystan Leftwich updated SPARK-12868: - Fix Version/s: 1.6.1 Description: When trying to add a jar with a HDFS URI, i.E

[jira] [Commented] (SPARK-12177) Update KafkaDStreams to new Kafka 0.9 Consumer API

2016-02-02 Thread Mark Grover (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15128465#comment-15128465 ] Mark Grover commented on SPARK-12177: - Thanks Cody, will do! > Update KafkaDStreams to new Kafka 0.9

[jira] [Commented] (SPARK-12987) Drop fails when columns contain dots

2016-02-02 Thread kevin yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12987?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15128659#comment-15128659 ] kevin yu commented on SPARK-12987: -- it seems this jira is the duplicate of 12988. I closed my pr. >

[jira] [Assigned] (SPARK-13037) PySpark ml.recommendation support export/import

2016-02-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13037?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13037: Assignee: (was: Apache Spark) > PySpark ml.recommendation support export/import >

[jira] [Commented] (SPARK-13037) PySpark ml.recommendation support export/import

2016-02-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15129890#comment-15129890 ] Apache Spark commented on SPARK-13037: -- User 'vectorijk' has created a pull request for this issue:

[jira] [Assigned] (SPARK-13037) PySpark ml.recommendation support export/import

2016-02-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13037?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13037: Assignee: Apache Spark > PySpark ml.recommendation support export/import >

[jira] [Resolved] (SPARK-12957) Derive and propagate data constrains in logical plan

2016-02-02 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12957?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-12957. -- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 10844

[jira] [Updated] (SPARK-12957) Derive and propagate data constrains in logical plan

2016-02-02 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12957?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-12957: - Assignee: Sameer Agarwal > Derive and propagate data constrains in logical plan >

[jira] [Resolved] (SPARK-13090) Add initial support for constraint propagation in SparkSQL

2016-02-02 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13090?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-13090. -- Resolution: Fixed Fix Version/s: 2.0.0 > Add initial support for constraint

[jira] [Reopened] (SPARK-12957) Derive and propagate data constrains in logical plan

2016-02-02 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12957?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust reopened SPARK-12957: -- > Derive and propagate data constrains in logical plan >

[jira] [Assigned] (SPARK-12957) Derive and propagate data constrains in logical plan

2016-02-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12957?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12957: Assignee: Apache Spark (was: Sameer Agarwal) > Derive and propagate data constrains in

[jira] [Assigned] (SPARK-12957) Derive and propagate data constrains in logical plan

2016-02-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12957?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12957: Assignee: Sameer Agarwal (was: Apache Spark) > Derive and propagate data constrains in

[jira] [Assigned] (SPARK-13131) Use median time in benchmark

2016-02-02 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu reassigned SPARK-13131: -- Assignee: Davies Liu > Use median time in benchmark > > >

[jira] [Updated] (SPARK-13131) Use median time in benchmark

2016-02-02 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-13131: --- Summary: Use median time in benchmark (was: Use best time in benchmark) > Use median time in

[jira] [Updated] (SPARK-13131) Use median time in benchmark

2016-02-02 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-13131: --- Component/s: SQL > Use median time in benchmark > > >

[jira] [Updated] (SPARK-13131) Use median time in benchmark

2016-02-02 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-13131: --- Description: Median time should be more stable than average time in benchmark. (was: Best time

[jira] [Commented] (SPARK-13145) checkAnswer should tolerate small float number error

2016-02-02 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15129913#comment-15129913 ] Xiangrui Meng commented on SPARK-13145: --- This is for Spark SQL, which uses string match to check

[jira] [Updated] (SPARK-13145) checkAnswer in SQL query suites should tolerate small float number error

2016-02-02 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-13145: -- Summary: checkAnswer in SQL query suites should tolerate small float number error (was:

[jira] [Updated] (SPARK-13145) checkAnswer should tolerate small float number error

2016-02-02 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-13145: -- Component/s: SQL > checkAnswer should tolerate small float number error >

[jira] [Created] (SPARK-13157) ADD JAR command cannot handle path with @ character

2016-02-02 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-13157: -- Summary: ADD JAR command cannot handle path with @ character Key: SPARK-13157 URL: https://issues.apache.org/jira/browse/SPARK-13157 Project: Spark Issue Type:

[jira] [Comment Edited] (SPARK-13145) checkAnswer in SQL query suites should tolerate small float number error

2016-02-02 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15129913#comment-15129913 ] Xiangrui Meng edited comment on SPARK-13145 at 2/3/16 6:51 AM: --- This is for

[jira] [Commented] (SPARK-10399) Off Heap Memory Access for non-JVM libraries (C++)

2016-02-02 Thread Kent Yao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15129928#comment-15129928 ] Kent Yao commented on SPARK-10399: -- How does this work go? > Off Heap Memory Access for non-JVM

[jira] [Updated] (SPARK-4036) Add Conditional Random Fields (CRF) algorithm to Spark MLlib

2016-02-02 Thread hujiayin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4036?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] hujiayin updated SPARK-4036: Attachment: crf-spark.zip latest CRF codes > Add Conditional Random Fields (CRF) algorithm to Spark MLlib

<    1   2   3   >