[jira] [Updated] (SPARK-13960) HTTP-based JAR Server doesn't respect spark.driver.host and there is no "spark.fileserver.host" option

2016-03-19 Thread Ilya Ostrovskiy (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13960?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ilya Ostrovskiy updated SPARK-13960: Description: There is no option to specify which hostname/IP address the jar/file server

[jira] [Commented] (SPARK-13783) Model export/import for spark.ml: GBTs

2016-03-19 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15201937#comment-15201937 ] yuhao yang commented on SPARK-13783: I haven't started it yet. Go ahead please. > Model

[jira] [Updated] (SPARK-13965) TaskSetManager should kill the other running task attempts if any one task attempt succeeds for the same task

2016-03-19 Thread Devaraj K (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13965?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Devaraj K updated SPARK-13965: -- Summary: TaskSetManager should kill the other running task attempts if any one task attempt succeeds

[jira] [Resolved] (SPARK-13281) Switch broadcast of RDD to exception from warning

2016-03-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-13281. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11735

[jira] [Commented] (SPARK-12981) Dataframe distinct() followed by a filter(udf) in pyspark throws a casting error

2016-03-19 Thread Fabian Boehnlein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15197759#comment-15197759 ] Fabian Boehnlein commented on SPARK-12981: -- Seems similar to what I'm hitting when filtering

[jira] [Created] (SPARK-14013) Properly implement temporary functions in SessionCatalog

2016-03-19 Thread Andrew Or (JIRA)
Andrew Or created SPARK-14013: - Summary: Properly implement temporary functions in SessionCatalog Key: SPARK-14013 URL: https://issues.apache.org/jira/browse/SPARK-14013 Project: Spark Issue

[jira] [Resolved] (SPARK-13458) Datasets cannot be sorted

2016-03-19 Thread Oliver Beattie (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Oliver Beattie resolved SPARK-13458. Resolution: Fixed Fix Version/s: 2.0.0 Target Version/s: 2.0.0 > Datasets

[jira] [Commented] (SPARK-13863) TPCDS query 66 returns wrong results compared to TPC official result set

2016-03-19 Thread Xin Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15200160#comment-15200160 ] Xin Wu commented on SPARK-13863: In terms of the ordering. the only difference is that the row with Null

[jira] [Updated] (SPARK-13719) Bad JSON record raises java.lang.ClassCastException

2016-03-19 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13719?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-13719: - Assignee: Hyukjin Kwon > Bad JSON record raises java.lang.ClassCastException >

[jira] [Commented] (SPARK-13985) WAL for determistic batches with IDs

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13985?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15200679#comment-15200679 ] Apache Spark commented on SPARK-13985: -- User 'marmbrus' has created a pull request for this issue:

[jira] [Commented] (SPARK-13998) HashingTF should extend UnaryTransformer

2016-03-19 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15201236#comment-15201236 ] Nick Pentreath commented on SPARK-13998: [~jlaskowski] I've moved this to a sub-task under

[jira] [Commented] (SPARK-13937) PySpark ML JavaWrapper, variable _java_obj should not be static

2016-03-19 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13937?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15197805#comment-15197805 ] Bryan Cutler commented on SPARK-13937: -- I'll submit a PR for this > PySpark ML JavaWrapper,

[jira] [Updated] (SPARK-13948) MiMa exclusions should not automatically apply to companion objects

2016-03-19 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-13948: --- Summary: MiMa exclusions should not automatically apply to companion objects (was: MiMa Check

[jira] [Resolved] (SPARK-14002) SQLBuilder should add subquery to Aggregate child when necessary

2016-03-19 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14002?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-14002. Resolution: Duplicate Fix Version/s: 2.0.0 This issue is actually covered by SPARK-13976.

[jira] [Updated] (SPARK-13968) Use MurmurHash3 for hashing String features

2016-03-19 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath updated SPARK-13968: --- Summary: Use MurmurHash3 for hashing String features (was: User MurmurHash3 for hashing

[jira] [Created] (SPARK-13962) spark.ml Evaluators should support other numeric types for label

2016-03-19 Thread Nick Pentreath (JIRA)
Nick Pentreath created SPARK-13962: -- Summary: spark.ml Evaluators should support other numeric types for label Key: SPARK-13962 URL: https://issues.apache.org/jira/browse/SPARK-13962 Project: Spark

[jira] [Commented] (SPARK-13865) TPCDS query 87 returns wrong results compared to TPC official result set

2016-03-19 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15200861#comment-15200861 ] Xiao Li commented on SPARK-13865: - First, if you use the query in this JIRA, you definitely should see a

[jira] [Assigned] (SPARK-14000) case class with a tuple field can't work in Dataset

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14000?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14000: Assignee: Apache Spark > case class with a tuple field can't work in Dataset >

[jira] [Assigned] (SPARK-13988) Large history files block new applications from showing up in History UI.

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13988: Assignee: (was: Apache Spark) > Large history files block new applications from

[jira] [Commented] (SPARK-13955) Spark in yarn mode fails

2016-03-19 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13955?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15199201#comment-15199201 ] Jeff Zhang commented on SPARK-13955: I just run spark-shell in yarn-client mode. > Spark in yarn

[jira] [Updated] (SPARK-13964) Feature hashing improvements

2016-03-19 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13964?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath updated SPARK-13964: --- Description: Investigate improvements to Spark ML feature hashing (see e.g.

[jira] [Commented] (SPARK-13863) TPCDS query 66 returns wrong results compared to TPC official result set

2016-03-19 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15198607#comment-15198607 ] Xiao Li commented on SPARK-13863: - It could be the same issue like

[jira] [Assigned] (SPARK-13963) Add binary toggle Param to ml.HashingTF

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13963?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13963: Assignee: Bryan Cutler (was: Apache Spark) > Add binary toggle Param to ml.HashingTF >

[jira] [Assigned] (SPARK-14006) Builds of 1.6 branch fail R style check

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14006?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14006: Assignee: Apache Spark > Builds of 1.6 branch fail R style check >

[jira] [Commented] (SPARK-13948) MiMa Check should catch if the visibility change to `private`

2016-03-19 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13948?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15198328#comment-15198328 ] Dongjoon Hyun commented on SPARK-13948: --- Thank you! > MiMa Check should catch if the

[jira] [Resolved] (SPARK-13873) Avoid the copy in whole stage codegen when there is no joins

2016-03-19 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-13873. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11740

[jira] [Updated] (SPARK-13905) Change signature of as.data.frame() to be consistent with the R base package

2016-03-19 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13905?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sun Rui updated SPARK-13905: Summary: Change signature of as.data.frame() to be consistent with the R base package (was: Change

[jira] [Commented] (SPARK-13719) Bad JSON record raises java.lang.ClassCastException

2016-03-19 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13719?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15198537#comment-15198537 ] Hyukjin Kwon commented on SPARK-13719: -- Sorry, I will remove the duplicate link because it is a

[jira] [Commented] (SPARK-13889) Integer overflow when calculating the max number of executor failure

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13889?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15201032#comment-15201032 ] Apache Spark commented on SPARK-13889: -- User 'carsonwang' has created a pull request for this issue:

[jira] [Commented] (SPARK-13992) Add support for off-heap caching

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15200682#comment-15200682 ] Apache Spark commented on SPARK-13992: -- User 'JoshRosen' has created a pull request for this issue:

[jira] [Updated] (SPARK-13963) Add binary toggle Param to ml.HashingTF

2016-03-19 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13963?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath updated SPARK-13963: --- Description: It would be handy to add a binary toggle Param to {{HashingTF}, as in the

[jira] [Comment Edited] (SPARK-13456) Cannot create encoders for case classes defined in Spark shell after upgrading to Scala 2.11

2016-03-19 Thread Arjen P. de Vries (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15199774#comment-15199774 ] Arjen P. de Vries edited comment on SPARK-13456 at 3/17/16 4:02 PM: I

[jira] [Assigned] (SPARK-14007) Manage the memory for hash map for shuffle hash join

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14007?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14007: Assignee: Davies Liu (was: Apache Spark) > Manage the memory for hash map for shuffle

[jira] [Commented] (SPARK-7992) Hide private classes/objects in in generated Java API doc

2016-03-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15200106#comment-15200106 ] Xiangrui Meng commented on SPARK-7992: -- Great! I pinged you on the old genjavadoc PR page. Hopefully

[jira] [Commented] (SPARK-13949) PySpark ml DecisionTreeClassifier, Regressor support export/import

2016-03-19 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13949?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15198296#comment-15198296 ] Joseph K. Bradley commented on SPARK-13949: --- Please do! > PySpark ml DecisionTreeClassifier,

[jira] [Assigned] (SPARK-13994) Investigate types that are not supported by vectorized parquet record reader

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13994?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13994: Assignee: (was: Apache Spark) > Investigate types that are not supported by

[jira] [Assigned] (SPARK-13977) Bring back ShuffledHashJoin

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13977?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13977: Assignee: Apache Spark (was: Davies Liu) > Bring back ShuffledHashJoin >

[jira] [Commented] (SPARK-13975) Cannot specify extra libs for executor from /extra-lib

2016-03-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13975?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15199775#comment-15199775 ] Sean Owen commented on SPARK-13975: --- You can transmit the files separately with --files, and then

[jira] [Commented] (SPARK-13832) TPC-DS Query 36 fails with Parser error

2016-03-19 Thread Roy Cecil (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13832?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15201962#comment-15201962 ] Roy Cecil commented on SPARK-13832: --- Xin, are you saying you the query passed compile in Spark 2.0 but

[jira] [Assigned] (SPARK-13973) `ipython notebook` is going away...

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13973: Assignee: (was: Apache Spark) > `ipython notebook` is going away... >

[jira] [Assigned] (SPARK-13965) Driver should kill the other running task attempts if any one task attempt succeeds for the same task

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13965?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13965: Assignee: (was: Apache Spark) > Driver should kill the other running task attempts if

[jira] [Assigned] (SPARK-13289) Word2Vec generate infinite distances when numIterations>5

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13289: Assignee: Apache Spark > Word2Vec generate infinite distances when numIterations>5 >

[jira] [Updated] (SPARK-13983) HiveThriftServer2 can not get "--hiveconf" or ''--hivevar" variables since 1.6 version (both multi-session and single session)

2016-03-19 Thread Teng Qiu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13983?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Teng Qiu updated SPARK-13983: - Description: HiveThriftServer2 should be able to get "\--hiveconf" or ''\-\-hivevar" variables from

[jira] [Created] (SPARK-13990) Automatically pick serializer when caching RDDs

2016-03-19 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-13990: -- Summary: Automatically pick serializer when caching RDDs Key: SPARK-13990 URL: https://issues.apache.org/jira/browse/SPARK-13990 Project: Spark Issue Type:

[jira] [Assigned] (SPARK-13826) Revise ScalaDoc of the new Dataset API

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13826?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13826: Assignee: Cheng Lian (was: Apache Spark) > Revise ScalaDoc of the new Dataset API >

[jira] [Commented] (SPARK-13934) SqlParser.parseTableIdentifier cannot recognize table name start with scientific notation

2016-03-19 Thread Yang Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15197546#comment-15197546 ] Yang Wang commented on SPARK-13934: --- The master branch don't have this problem. So should create

[jira] [Commented] (SPARK-13988) Large history files block new applications from showing up in History UI.

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15200534#comment-15200534 ] Apache Spark commented on SPARK-13988: -- User 'Parth-Brahmbhatt' has created a pull request for this

[jira] [Assigned] (SPARK-13908) Limit not pushed down

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13908?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13908: Assignee: Apache Spark > Limit not pushed down > - > >

[jira] [Commented] (SPARK-13865) TPCDS query 87 returns wrong results compared to TPC official result set

2016-03-19 Thread JESSE CHEN (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15200869#comment-15200869 ] JESSE CHEN commented on SPARK-13865: I am onto that. Thanks. Also, good to know the parsing error is

[jira] [Commented] (SPARK-11319) PySpark silently accepts null values in non-nullable DataFrame fields.

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11319?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15199741#comment-15199741 ] Apache Spark commented on SPARK-11319: -- User 'francoisprunier' has created a pull request for this

[jira] [Commented] (SPARK-13938) word2phrase feature created in ML

2016-03-19 Thread Steve Weng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13938?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15198082#comment-15198082 ] Steve Weng commented on SPARK-13938: looks good thanks! > word2phrase feature created in ML >

[jira] [Assigned] (SPARK-13993) PySpark ml.feature.RFormula/RFormulaModel support export/import

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13993: Assignee: (was: Apache Spark) > PySpark ml.feature.RFormula/RFormulaModel support

[jira] [Assigned] (SPARK-13948) MiMa Check should catch if the visibility change to `private`

2016-03-19 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen reassigned SPARK-13948: -- Assignee: Josh Rosen > MiMa Check should catch if the visibility change to `private` >

[jira] [Updated] (SPARK-13978) [GSoC 2016] Build monitoring UI and related infrastructure for Spark SQL and structured streaming

2016-03-19 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-13978: - Assignee: Tao Lin > [GSoC 2016] Build monitoring UI and related infrastructure for Spark SQL and >

[jira] [Commented] (SPARK-13978) [GSoC 2016] Build monitoring UI and related infrastructure for Spark SQL and structured streaming

2016-03-19 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13978?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15202933#comment-15202933 ] Yin Huai commented on SPARK-13978: -- Thank you for your interests. We already have an interested

[jira] [Updated] (SPARK-10680) Flaky test: network.RequestTimeoutIntegrationSuite.timeoutInactiveRequests

2016-03-19 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10680?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-10680: - Assignee: Shixiong Zhu (was: Josh Rosen) > Flaky test:

[jira] [Resolved] (SPARK-10680) Flaky test: network.RequestTimeoutIntegrationSuite.timeoutInactiveRequests

2016-03-19 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10680?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-10680. -- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11833

[jira] [Created] (SPARK-13980) Incrementally serialize blocks while unrolling them in MemoryStore

2016-03-19 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-13980: -- Summary: Incrementally serialize blocks while unrolling them in MemoryStore Key: SPARK-13980 URL: https://issues.apache.org/jira/browse/SPARK-13980 Project: Spark

[jira] [Resolved] (SPARK-14010) ColumnPruning is conflict with PushPredicateThroughProject

2016-03-19 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14010?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-14010. Resolution: Duplicate > ColumnPruning is conflict with PushPredicateThroughProject >

[jira] [Commented] (SPARK-13783) Model export/import for spark.ml: GBTs

2016-03-19 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15201303#comment-15201303 ] Yanbo Liang commented on SPARK-13783: - Hi [~yuhaoyan], are you working on this issue? If not, I can

[jira] [Updated] (SPARK-12789) Support order by position in SQL

2016-03-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-12789: Description: This is to support order by position in SQL, e.g. {noformat} select c1, c2, c3 from

[jira] [Commented] (SPARK-13952) spark.ml GBT algs need to use random seed

2016-03-19 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13952?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15198859#comment-15198859 ] Nick Pentreath commented on SPARK-13952: [~josephkb] As far as I can see, the old MLlib GBT

[jira] [Commented] (SPARK-13774) IllegalArgumentException: Can not create a Path from an empty string for incorrect file path

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13774?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15198475#comment-15198475 ] Apache Spark commented on SPARK-13774: -- User 'skambha' has created a pull request for this issue:

[jira] [Commented] (SPARK-13858) TPCDS query 21 returns wrong results compared to TPC official result set

2016-03-19 Thread JESSE CHEN (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13858?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15198558#comment-15198558 ] JESSE CHEN commented on SPARK-13858: Good job, Bo! I would like to test this on my cluster if you

[jira] [Comment Edited] (SPARK-13969) Extend input format that feature hashing can handle

2016-03-19 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13969?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15201257#comment-15201257 ] Nick Pentreath edited comment on SPARK-13969 at 3/18/16 10:00 AM: -- What

[jira] [Created] (SPARK-13956) Spark 1.6.1 Client can't connect to 1.6.1 Server

2016-03-19 Thread Pranas Baliuka (JIRA)
Pranas Baliuka created SPARK-13956: -- Summary: Spark 1.6.1 Client can't connect to 1.6.1 Server Key: SPARK-13956 URL: https://issues.apache.org/jira/browse/SPARK-13956 Project: Spark Issue

[jira] [Commented] (SPARK-13983) HiveThriftServer2 can not get "--hiveconf" or ''--hivevar" variables since 1.6 version (both multi-session and single session)

2016-03-19 Thread Teng Qiu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13983?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15200661#comment-15200661 ] Teng Qiu commented on SPARK-13983: -- now i can confirm that this issue was caused by PR

[jira] [Updated] (SPARK-13968) User MurmurHash for feature hashing

2016-03-19 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath updated SPARK-13968: --- Summary: User MurmurHash for feature hashing (was: User MurmurHash in for feature hashing)

[jira] [Resolved] (SPARK-13885) Spark On Yarn attempt id representation regression

2016-03-19 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-13885. Resolution: Fixed Assignee: Saisai Shao Fix Version/s: 2.0.0 > Spark On

[jira] [Commented] (SPARK-13952) spark.ml GBT algs need to use random seed

2016-03-19 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13952?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15198394#comment-15198394 ] Joseph K. Bradley commented on SPARK-13952: --- [~sethah] Would you have time to work on this? >

[jira] [Updated] (SPARK-13922) Filter rows with null attributes in parquet vectorized reader

2016-03-19 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13922?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-13922: - Assignee: Sameer Agarwal > Filter rows with null attributes in parquet vectorized reader >

[jira] [Updated] (SPARK-12721) SQL generation support for script transformation

2016-03-19 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12721?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-12721: - Assignee: Xiao Li > SQL generation support for script transformation >

[jira] [Created] (SPARK-13964) Feature hashing improvements

2016-03-19 Thread Nick Pentreath (JIRA)
Nick Pentreath created SPARK-13964: -- Summary: Feature hashing improvements Key: SPARK-13964 URL: https://issues.apache.org/jira/browse/SPARK-13964 Project: Spark Issue Type: Umbrella

[jira] [Commented] (SPARK-13861) TPCDS query 40 returns wrong results compared to TPC official result set

2016-03-19 Thread Bo Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15198421#comment-15198421 ] Bo Meng commented on SPARK-13861: - This involves 2 issues: the first one will be the same as

[jira] [Assigned] (SPARK-13990) Automatically pick serializer when caching RDDs

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13990?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13990: Assignee: Apache Spark (was: Josh Rosen) > Automatically pick serializer when caching

[jira] [Created] (SPARK-13948) MiMa Check should catch if the visibility change to `private`

2016-03-19 Thread Dongjoon Hyun (JIRA)
Dongjoon Hyun created SPARK-13948: - Summary: MiMa Check should catch if the visibility change to `private` Key: SPARK-13948 URL: https://issues.apache.org/jira/browse/SPARK-13948 Project: Spark

[jira] [Commented] (SPARK-13954) spar-shell starts with exceptions

2016-03-19 Thread Pranas Baliuka (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15198688#comment-15198688 ] Pranas Baliuka commented on SPARK-13954: Work around: {code}mkdir -p /tmp/spark-events{code} The

[jira] [Assigned] (SPARK-13815) Provide better Exception messages in Pipeline load methods

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13815?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13815: Assignee: Apache Spark > Provide better Exception messages in Pipeline load methods >

[jira] [Updated] (SPARK-13974) sub-query names do not need to be globally unique while generate SQL

2016-03-19 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13974?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-13974: --- Assignee: Wenchen Fan > sub-query names do not need to be globally unique while generate SQL >

[jira] [Resolved] (SPARK-13921) Store serialized blocks as multiple chunks in MemoryStore

2016-03-19 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-13921. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11748

[jira] [Updated] (SPARK-13960) HTTP-based JAR Server doesn't respect spark.driver.host and there is no "spark.fileserver.host" option

2016-03-19 Thread Ilya Ostrovskiy (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13960?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ilya Ostrovskiy updated SPARK-13960: Description: There is no option to specify which hostname/IP address the jar/file server

[jira] [Updated] (SPARK-13982) SparkR - KMeans predict: Output column name of features is an unclear, automatically genetared text

2016-03-19 Thread Narine Kokhlikyan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13982?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Narine Kokhlikyan updated SPARK-13982: -- Summary: SparkR - KMeans predict: Output column name of features is an unclear,

[jira] [Commented] (SPARK-13978) [GSoC 2016] Build monitoring UI and related infrastructure for Spark SQL and structured streaming

2016-03-19 Thread Ilias Sarantopoulos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13978?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15202900#comment-15202900 ] Ilias Sarantopoulos commented on SPARK-13978: - Hi, I am interested in this project. I

[jira] [Commented] (SPARK-13939) Kafka createDirectStream not parallelizing properly

2016-03-19 Thread Cody Koeninger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15199638#comment-15199638 ] Cody Koeninger commented on SPARK-13939: When you say print to screen, are you using print() on

[jira] [Resolved] (SPARK-13897) GroupedData vs GroupedDataset naming is confusing

2016-03-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13897?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-13897. - Resolution: Fixed Fix Version/s: 2.0.0 > GroupedData vs GroupedDataset naming is

[jira] [Commented] (SPARK-13859) TPCDS query 38 returns wrong results compared to TPC official result set

2016-03-19 Thread Dilip Biswal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13859?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15199097#comment-15199097 ] Dilip Biswal commented on SPARK-13859: -- I have looked into this issue. After changing the query to

[jira] [Updated] (SPARK-13897) GroupedData vs GroupedDataset naming is confusing

2016-03-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13897?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-13897: Description: A placeholder to figure out a better naming scheme for the two. Update

[jira] [Commented] (SPARK-13919) Resolving the Conflicts of ColumnPruning and PushPredicateThroughProject

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13919?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15202116#comment-15202116 ] Apache Spark commented on SPARK-13919: -- User 'davies' has created a pull request for this issue:

[jira] [Resolved] (SPARK-13930) Apply fast serialization on collect limit

2016-03-19 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13930?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-13930. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11759

[jira] [Commented] (SPARK-13458) Datasets cannot be sorted

2016-03-19 Thread Rishabh Bhardwaj (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15199324#comment-15199324 ] Rishabh Bhardwaj commented on SPARK-13458: -- [~obeattie] In the master branch,DataSet.scala have

[jira] [Updated] (SPARK-13942) Remove Shark-related docs for 2.x

2016-03-19 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13942?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-13942: -- Summary: Remove Shark-related docs for 2.x (was: Remove Shark-related docs and visibility for

[jira] [Assigned] (SPARK-13874) Move docs of streaming-flume, streaming-mqtt, streaming-zeromq, streaming-akka, streaming-twitter to Spark packages

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13874: Assignee: Apache Spark > Move docs of streaming-flume, streaming-mqtt, streaming-zeromq,

[jira] [Commented] (SPARK-14005) Make RDD more compatible with Scala's collection

2016-03-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14005?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15201422#comment-15201422 ] Sean Owen commented on SPARK-14005: --- I think we just discussed this? nonEmpty is just !isEmpty.

[jira] [Reopened] (SPARK-13458) Datasets cannot be sorted

2016-03-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reopened SPARK-13458: --- > Datasets cannot be sorted > - > > Key: SPARK-13458 >

[jira] [Commented] (SPARK-13990) Automatically pick serializer when caching RDDs

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13990?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15200596#comment-15200596 ] Apache Spark commented on SPARK-13990: -- User 'JoshRosen' has created a pull request for this issue:

[jira] [Updated] (SPARK-13768) Set hive conf failed use --hiveconf when beeline connect to thriftserver

2016-03-19 Thread Weizhong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13768?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weizhong updated SPARK-13768: - Description: 1. Start thriftserver 2. ./bin/beeline -u '...' --hiveconf

[jira] [Assigned] (SPARK-13991) Extend mvn enforcer rule

2016-03-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13991?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13991: Assignee: (was: Apache Spark) > Extend mvn enforcer rule > >

[jira] [Created] (SPARK-14005) Make RDD more compatible with Scala's collection

2016-03-19 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-14005: Summary: Make RDD more compatible with Scala's collection Key: SPARK-14005 URL: https://issues.apache.org/jira/browse/SPARK-14005 Project: Spark Issue

[jira] [Commented] (SPARK-12148) SparkR: rename DataFrame to SparkDataFrame

2016-03-19 Thread Anupama Joshi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12148?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15202376#comment-15202376 ] Anupama Joshi commented on SPARK-12148: --- Run the code in this sequence to reproduce the issue - >

[jira] [Resolved] (SPARK-913) log the size of each shuffle block in block manager

2016-03-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-913?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-913. - Resolution: Not A Problem > log the size of each shuffle block in block manager >

<    1   2   3   4   5   6   >