[jira] [Commented] (SPARK-5992) Locality Sensitive Hashing (LSH) for MLlib

2016-10-17 Thread Yun Ni (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15581416#comment-15581416 ] Yun Ni commented on SPARK-5992: --- Yes, I have implemented cosine, jaccard, euclidean and hamming distance.

[jira] [Commented] (SPARK-17898) --repositories needs username and password

2016-10-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17898?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15581473#comment-15581473 ] Sean Owen commented on SPARK-17898: --- Yeah, that's a general way to add authentication to an HTTP URL

[jira] [Updated] (SPARK-17817) PySpark RDD Repartitioning Results in Highly Skewed Partition Sizes

2016-10-17 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-17817: Component/s: PySpark > PySpark RDD Repartitioning Results in Highly Skewed Partition Sizes >

[jira] [Updated] (SPARK-16063) Add storageLevel to Dataset

2016-10-17 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16063?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-16063: Affects Version/s: SQL > Add storageLevel to Dataset > --- > >

[jira] [Updated] (SPARK-16063) Add storageLevel to Dataset

2016-10-17 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16063?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-16063: Affects Version/s: (was: SQL) > Add storageLevel to Dataset > --- > >

[jira] [Updated] (SPARK-17254) Filter operator should have “stop if false” semantics for sorted data

2016-10-17 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17254?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh updated SPARK-17254: Priority: Major (was: Minor) > Filter operator should have “stop if false” semantics for

[jira] [Updated] (SPARK-17254) Filter operator should have “stop if false” semantics for sorted data

2016-10-17 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17254?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh updated SPARK-17254: Attachment: (was: stop-after-physical-plan.pdf) > Filter operator should have “stop if

[jira] [Updated] (SPARK-17254) Filter operator should have “stop if false” semantics for sorted data

2016-10-17 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17254?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh updated SPARK-17254: Attachment: stop-after-physical-plan.pdf > Filter operator should have “stop if false”

[jira] [Commented] (SPARK-17930) The SerializerInstance instance used when deserializing a TaskResult is not reused

2016-10-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17930?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15581668#comment-15581668 ] Apache Spark commented on SPARK-17930: -- User 'witgo' has created a pull request for this issue:

[jira] [Assigned] (SPARK-17930) The SerializerInstance instance used when deserializing a TaskResult is not reused

2016-10-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17930?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17930: Assignee: Apache Spark > The SerializerInstance instance used when deserializing a

[jira] [Resolved] (SPARK-17951) BlockFetch with multiple threads slows down after spark 1.6

2016-10-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17951?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-17951. --- Resolution: Not A Problem This does not show a slow-down in an actual Spark operation though, and

[jira] [Commented] (SPARK-10590) Spark with YARN build is broken

2016-10-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10590?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15581470#comment-15581470 ] Sean Owen commented on SPARK-10590: --- Maven 3.3.x has been required (literally required by the Maven

[jira] [Resolved] (SPARK-17892) Query in CTAS is Optimized Twice (branch-2.0)

2016-10-17 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17892?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-17892. - Resolution: Fixed Fix Version/s: 2.0.2 > Query in CTAS is Optimized Twice (branch-2.0) >

[jira] [Comment Edited] (SPARK-17954) FetchFailedException executor cannot connect to another worker executor

2016-10-17 Thread Vitaly Gerasimov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15581147#comment-15581147 ] Vitaly Gerasimov edited comment on SPARK-17954 at 10/17/16 8:03 AM: I

[jira] [Comment Edited] (SPARK-17954) FetchFailedException executor cannot connect to another worker executor

2016-10-17 Thread Vitaly Gerasimov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15581147#comment-15581147 ] Vitaly Gerasimov edited comment on SPARK-17954 at 10/17/16 8:01 AM: I

[jira] [Commented] (SPARK-11115) Host verification is not correct for IPv6

2016-10-17 Thread Tao Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15581599#comment-15581599 ] Tao Meng commented on SPARK-5: -- I find `org.apache.spark.util.Utils.localHostName` call

[jira] [Assigned] (SPARK-17930) The SerializerInstance instance used when deserializing a TaskResult is not reused

2016-10-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17930?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17930: Assignee: (was: Apache Spark) > The SerializerInstance instance used when

[jira] [Commented] (SPARK-650) Add a "setup hook" API for running initialization code on each executor

2016-10-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-650?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15581774#comment-15581774 ] Sean Owen commented on SPARK-650: - BTW I am not suggesting an "empty RDD" for your case. That was specific

[jira] [Commented] (SPARK-17954) FetchFailedException executor cannot connect to another worker executor

2016-10-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15581803#comment-15581803 ] Sean Owen commented on SPARK-17954: --- I recall there were changes about the default bind behavior. This

[jira] [Commented] (SPARK-17929) Deadlock when AM restart and send RemoveExecutor on reset

2016-10-17 Thread xwebber (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17929?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15581446#comment-15581446 ] xwebber commented on SPARK-17929: - meet the same problem, seems the deadlock is obvious in code. will thy

[jira] [Commented] (SPARK-17954) FetchFailedException executor cannot connect to another worker executor

2016-10-17 Thread Vitaly Gerasimov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15581532#comment-15581532 ] Vitaly Gerasimov commented on SPARK-17954: -- /etc/hosts for worker1.test (worker2.test hosts

[jira] [Commented] (SPARK-11115) Host verification is not correct for IPv6

2016-10-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15581616#comment-15581616 ] Sean Owen commented on SPARK-5: --- The host name is not the same as the host address. I don't think

[jira] [Updated] (SPARK-16321) [Spark 2.0] Performance regression when reading parquet and using PPD and non-vectorized reader

2016-10-17 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Zhang updated SPARK-16321: --- Component/s: (was: PySpark) SQL > [Spark 2.0] Performance regression when

[jira] [Updated] (SPARK-16063) Add storageLevel to Dataset

2016-10-17 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16063?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-16063: Component/s: SQL > Add storageLevel to Dataset > --- > > Key:

[jira] [Assigned] (SPARK-17969) I think it's user unfriendly to process standard json file with DataFrame

2016-10-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17969: Assignee: Apache Spark > I think it's user unfriendly to process standard json file with

[jira] [Assigned] (SPARK-17969) I think it's user unfriendly to process standard json file with DataFrame

2016-10-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17969: Assignee: (was: Apache Spark) > I think it's user unfriendly to process standard json

[jira] [Commented] (SPARK-17969) I think it's user unfriendly to process standard json file with DataFrame

2016-10-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17969?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15581607#comment-15581607 ] Apache Spark commented on SPARK-17969: -- User 'codlife' has created a pull request for this issue:

[jira] [Commented] (SPARK-11115) Host verification is not correct for IPv6

2016-10-17 Thread Tao Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15581686#comment-15581686 ] Tao Meng commented on SPARK-5: -- Yes. Maybe the method name is not very accurate. It is better to

[jira] [Comment Edited] (SPARK-17954) FetchFailedException executor cannot connect to another worker executor

2016-10-17 Thread Vitaly Gerasimov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15581147#comment-15581147 ] Vitaly Gerasimov edited comment on SPARK-17954 at 10/17/16 8:00 AM: I

[jira] [Updated] (SPARK-17731) Metrics for Structured Streaming

2016-10-17 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-17731: -- Fix Version/s: 2.0.2 > Metrics for Structured Streaming > > >

[jira] [Comment Edited] (SPARK-17950) Match SparseVector behavior with DenseVector

2016-10-17 Thread AbderRahman Sobh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17950?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15583877#comment-15583877 ] AbderRahman Sobh edited comment on SPARK-17950 at 10/18/16 12:05 AM: -

[jira] [Comment Edited] (SPARK-17950) Match SparseVector behavior with DenseVector

2016-10-17 Thread AbderRahman Sobh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17950?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15583877#comment-15583877 ] AbderRahman Sobh edited comment on SPARK-17950 at 10/18/16 12:07 AM: -

[jira] [Comment Edited] (SPARK-17950) Match SparseVector behavior with DenseVector

2016-10-17 Thread AbderRahman Sobh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17950?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15583877#comment-15583877 ] AbderRahman Sobh edited comment on SPARK-17950 at 10/18/16 12:07 AM: -

[jira] [Closed] (SPARK-3132) Avoid serialization for Array[Byte] in TorrentBroadcast

2016-10-17 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3132?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin closed SPARK-3132. -- Resolution: Not A Problem Marking it as not-a-problem for now given Josh's comment. > Avoid

[jira] [Created] (SPARK-17979) Remove deprecated support for config SPARK_YARN_USER_ENV

2016-10-17 Thread Kishor Patil (JIRA)
Kishor Patil created SPARK-17979: Summary: Remove deprecated support for config SPARK_YARN_USER_ENV Key: SPARK-17979 URL: https://issues.apache.org/jira/browse/SPARK-17979 Project: Spark

[jira] [Commented] (SPARK-10915) Add support for UDAFs in Python

2016-10-17 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15583638#comment-15583638 ] Davies Liu commented on SPARK-10915: Currently all the aggregate functions are implemented in Scala,

[jira] [Commented] (SPARK-7721) Generate test coverage report from Python

2016-10-17 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15583642#comment-15583642 ] Josh Rosen commented on SPARK-7721: --- IIRC when I looked into this I hit problems with the HTML Publisher

[jira] [Assigned] (SPARK-17980) Fix refreshByPath for converted Hive tables

2016-10-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17980?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17980: Assignee: Apache Spark > Fix refreshByPath for converted Hive tables >

[jira] [Commented] (SPARK-17980) Fix refreshByPath for converted Hive tables

2016-10-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17980?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15583652#comment-15583652 ] Apache Spark commented on SPARK-17980: -- User 'ericl' has created a pull request for this issue:

[jira] [Assigned] (SPARK-17980) Fix refreshByPath for converted Hive tables

2016-10-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17980?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17980: Assignee: (was: Apache Spark) > Fix refreshByPath for converted Hive tables >

[jira] [Commented] (SPARK-15708) Tasks table in Detailed Stage page shows ip instead of hostname under Executor ID/Host

2016-10-17 Thread Alex Bozarth (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15708?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15583751#comment-15583751 ] Alex Bozarth commented on SPARK-15708: -- I'm not sure closing this as cannot reproduce was correct,

[jira] [Created] (SPARK-17981) Incorrectly Set Nullability to False in FilterExec

2016-10-17 Thread Xiao Li (JIRA)
Xiao Li created SPARK-17981: --- Summary: Incorrectly Set Nullability to False in FilterExec Key: SPARK-17981 URL: https://issues.apache.org/jira/browse/SPARK-17981 Project: Spark Issue Type: Bug

[jira] [Assigned] (SPARK-17981) Incorrectly Set Nullability to False in FilterExec

2016-10-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17981?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17981: Assignee: Apache Spark (was: Xiao Li) > Incorrectly Set Nullability to False in

[jira] [Assigned] (SPARK-17957) Calling outer join and na.fill(0) and then inner join will miss rows

2016-10-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17957?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17957: Assignee: Xiao Li (was: Apache Spark) > Calling outer join and na.fill(0) and then inner

[jira] [Commented] (SPARK-17368) Scala value classes create encoder problems and break at runtime

2016-10-17 Thread Aris Vlasakakis (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17368?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15583873#comment-15583873 ] Aris Vlasakakis commented on SPARK-17368: - That is great, thank you for the help with this. >

[jira] [Commented] (SPARK-17957) Calling outer join and na.fill(0) and then inner join will miss rows

2016-10-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17957?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15583874#comment-15583874 ] Apache Spark commented on SPARK-17957: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Assigned] (SPARK-17957) Calling outer join and na.fill(0) and then inner join will miss rows

2016-10-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17957?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17957: Assignee: Apache Spark (was: Xiao Li) > Calling outer join and na.fill(0) and then inner

[jira] [Assigned] (SPARK-17981) Incorrectly Set Nullability to False in FilterExec

2016-10-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17981?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17981: Assignee: Xiao Li (was: Apache Spark) > Incorrectly Set Nullability to False in

[jira] [Commented] (SPARK-17981) Incorrectly Set Nullability to False in FilterExec

2016-10-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15583872#comment-15583872 ] Apache Spark commented on SPARK-17981: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Commented] (SPARK-17950) Match SparseVector behavior with DenseVector

2016-10-17 Thread AbderRahman Sobh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17950?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15583877#comment-15583877 ] AbderRahman Sobh commented on SPARK-17950: -- Yes, the full array needs to be expanded since the

[jira] [Commented] (SPARK-10872) Derby error (XSDB6) when creating new HiveContext after restarting SparkContext

2016-10-17 Thread Angus Gerry (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10872?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15584052#comment-15584052 ] Angus Gerry commented on SPARK-10872: - Hi [~srowen], I'm chasing down something in our code base at

[jira] [Created] (SPARK-17982) Spark 2.0.0 CREATE VIEW statement fails when select statement contains limit clause

2016-10-17 Thread Franck Tago (JIRA)
Franck Tago created SPARK-17982: --- Summary: Spark 2.0.0 CREATE VIEW statement fails when select statement contains limit clause Key: SPARK-17982 URL: https://issues.apache.org/jira/browse/SPARK-17982

[jira] [Updated] (SPARK-17983) Can't filter over mixed case parquet columns of converted Hive tables

2016-10-17 Thread Eric Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17983?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Liang updated SPARK-17983: --- Description: We should probably revive https://github.com/apache/spark/pull/14750 in order to fix

[jira] [Created] (SPARK-17983) Can't filter over mixed case parquet columns of converted Hive tables

2016-10-17 Thread Eric Liang (JIRA)
Eric Liang created SPARK-17983: -- Summary: Can't filter over mixed case parquet columns of converted Hive tables Key: SPARK-17983 URL: https://issues.apache.org/jira/browse/SPARK-17983 Project: Spark

[jira] [Commented] (SPARK-4160) Standalone cluster mode does not upload all needed jars to driver node

2016-10-17 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4160?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15584137#comment-15584137 ] Marcelo Vanzin commented on SPARK-4160: --- You don't need to ask for permission to work on things. >

[jira] [Commented] (SPARK-14212) Add configuration element for --packages option

2016-10-17 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14212?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15584142#comment-15584142 ] Marcelo Vanzin commented on SPARK-14212: SPARK-15760 added the docs to 2.0 only, but I'm pretty

[jira] [Commented] (SPARK-17147) Spark Streaming Kafka 0.10 Consumer Can't Handle Non-consecutive Offsets

2016-10-17 Thread Justin Miller (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15584169#comment-15584169 ] Justin Miller commented on SPARK-17147: --- Could this possibly be related to why I'm seeing the

[jira] [Created] (SPARK-17984) Add support for numa aware

2016-10-17 Thread quanfuwang (JIRA)
quanfuwang created SPARK-17984: -- Summary: Add support for numa aware Key: SPARK-17984 URL: https://issues.apache.org/jira/browse/SPARK-17984 Project: Spark Issue Type: Task

[jira] [Commented] (SPARK-17982) Spark 2.0.0 CREATE VIEW statement fails when select statement contains limit clause

2016-10-17 Thread Franck Tago (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17982?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15584178#comment-15584178 ] Franck Tago commented on SPARK-17982: - == SQL == SELECT `gen_attr_0` AS `WHERE_ID`, `gen_attr_2` AS

[jira] [Resolved] (SPARK-12280) "--packages" command doesn't work in "spark-submit"

2016-10-17 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-12280. Resolution: Cannot Reproduce Please reopen with more info if you're still running into

[jira] [Updated] (SPARK-17984) Add support for numa aware

2016-10-17 Thread quanfuwang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] quanfuwang updated SPARK-17984: --- Description: This Jira is target to add support numa aware feature which can help improve

[jira] [Updated] (SPARK-17984) Add support for numa aware

2016-10-17 Thread quanfuwang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] quanfuwang updated SPARK-17984: --- Description: This Jira is target to add support numa aware feature which can help improve

[jira] [Resolved] (SPARK-7882) HBase Input Format Example does not allow passing ZK parent node

2016-10-17 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-7882. --- Resolution: Not A Problem HBase examples are not included anymore. > HBase Input Format

[jira] [Resolved] (SPARK-5925) YARN - Spark progress bar stucks at 10% but after finishing shows 100%

2016-10-17 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-5925. --- Resolution: Won't Fix I don't think this can be fixed in Spark at all. There's no way to know

[jira] [Updated] (SPARK-17984) Add support for numa aware

2016-10-17 Thread quanfuwang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] quanfuwang updated SPARK-17984: --- Description: This Jira is target to add support numa aware feature which can help improve

[jira] [Updated] (SPARK-17984) Add support for numa aware feature

2016-10-17 Thread quanfuwang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] quanfuwang updated SPARK-17984: --- Summary: Add support for numa aware feature (was: Add support for numa aware) > Add support for

[jira] [Created] (SPARK-17985) Bump commons-lang3 version to 3.5.

2016-10-17 Thread Takuya Ueshin (JIRA)
Takuya Ueshin created SPARK-17985: - Summary: Bump commons-lang3 version to 3.5. Key: SPARK-17985 URL: https://issues.apache.org/jira/browse/SPARK-17985 Project: Spark Issue Type: Bug

[jira] [Comment Edited] (SPARK-14212) Add configuration element for --packages option

2016-10-17 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14212?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15584142#comment-15584142 ] Marcelo Vanzin edited comment on SPARK-14212 at 10/18/16 3:50 AM: --

[jira] [Resolved] (SPARK-17504) Spark App Handle from SparkLauncher always returns UNKNOWN app state when used with Mesos in Client Mode

2016-10-17 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17504?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-17504. Resolution: Duplicate > Spark App Handle from SparkLauncher always returns UNKNOWN app

[jira] [Commented] (SPARK-17147) Spark Streaming Kafka 0.10 Consumer Can't Handle Non-consecutive Offsets

2016-10-17 Thread Cody Koeninger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15584172#comment-15584172 ] Cody Koeninger commented on SPARK-17147: Well, are you using compacted topics? > Spark Streaming

[jira] [Assigned] (SPARK-17984) Add support for numa aware

2016-10-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17984: Assignee: Apache Spark > Add support for numa aware > -- > >

[jira] [Commented] (SPARK-17984) Add support for numa aware

2016-10-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15584206#comment-15584206 ] Apache Spark commented on SPARK-17984: -- User 'quanfuw' has created a pull request for this issue:

[jira] [Assigned] (SPARK-17984) Add support for numa aware

2016-10-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17984: Assignee: (was: Apache Spark) > Add support for numa aware >

[jira] [Resolved] (SPARK-8122) ParquetRelation.enableLogForwarding() may fail to configure loggers

2016-10-17 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-8122. --- Resolution: Won't Fix This code doesn't exist anymore in 2.x at least, so I'll assume this

[jira] [Updated] (SPARK-17984) Add support for numa aware

2016-10-17 Thread quanfuwang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] quanfuwang updated SPARK-17984: --- Shepherd: (was: quanfuwang) > Add support for numa aware > -- > >

[jira] [Updated] (SPARK-17984) Add support for numa aware

2016-10-17 Thread quanfuwang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] quanfuwang updated SPARK-17984: --- Issue Type: New Feature (was: Task) > Add support for numa aware > -- > >

[jira] [Resolved] (SPARK-6108) No application number limit in spark history server

2016-10-17 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6108?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-6108. --- Resolution: Won't Fix There are many ways currently to control how many applications are kept

[jira] [Resolved] (SPARK-5230) Print usage for spark-submit and spark-class in Windows

2016-10-17 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-5230. --- Resolution: Done Pretty sure I implemented this somewhere in the 1.x line. > Print usage for

[jira] [Assigned] (SPARK-17985) Bump commons-lang3 version to 3.5.

2016-10-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17985?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17985: Assignee: (was: Apache Spark) > Bump commons-lang3 version to 3.5. >

[jira] [Commented] (SPARK-17985) Bump commons-lang3 version to 3.5.

2016-10-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17985?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15584263#comment-15584263 ] Apache Spark commented on SPARK-17985: -- User 'ueshin' has created a pull request for this issue:

[jira] [Assigned] (SPARK-17985) Bump commons-lang3 version to 3.5.

2016-10-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17985?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17985: Assignee: Apache Spark > Bump commons-lang3 version to 3.5. >

[jira] [Created] (SPARK-17986) SQLTransformer leaks temporary tables

2016-10-17 Thread Drew Robb (JIRA)
Drew Robb created SPARK-17986: - Summary: SQLTransformer leaks temporary tables Key: SPARK-17986 URL: https://issues.apache.org/jira/browse/SPARK-17986 Project: Spark Issue Type: Bug

[jira] [Resolved] (SPARK-17620) hive.default.fileformat=orc does not set OrcSerde

2016-10-17 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-17620. - Resolution: Fixed Fix Version/s: 2.1.0 Issue resolved by pull request 15495

[jira] [Updated] (SPARK-17970) store partition spec in metastore for data source table

2016-10-17 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17970?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-17970: Issue Type: Sub-task (was: New Feature) Parent: SPARK-17861 > store partition spec in

[jira] [Commented] (SPARK-17862) Feature flag SPARK-16980

2016-10-17 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17862?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15584374#comment-15584374 ] Reynold Xin commented on SPARK-17862: - cc [~ekhliang] this was done right? Can you put the flag here?

[jira] [Resolved] (SPARK-17974) Refactor FileCatalog classes to simplify the inheritance tree

2016-10-17 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17974?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-17974. - Resolution: Fixed Assignee: Eric Liang Fix Version/s: 2.1.0 > Refactor

[jira] [Updated] (SPARK-17974) Refactor FileCatalog classes to simplify the inheritance tree

2016-10-17 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17974?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-17974: Issue Type: Sub-task (was: Improvement) Parent: SPARK-17861 > Refactor FileCatalog

[jira] [Reopened] (SPARK-17974) Refactor FileCatalog classes to simplify the inheritance tree

2016-10-17 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17974?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin reopened SPARK-17974: - Reopening since the previous commit was not tested by Jenkins (failed Scala linter). > Refactor

[jira] [Assigned] (SPARK-17974) Refactor FileCatalog classes to simplify the inheritance tree

2016-10-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17974?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17974: Assignee: Apache Spark (was: Eric Liang) > Refactor FileCatalog classes to simplify the

[jira] [Assigned] (SPARK-17974) Refactor FileCatalog classes to simplify the inheritance tree

2016-10-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17974?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17974: Assignee: Eric Liang (was: Apache Spark) > Refactor FileCatalog classes to simplify the

[jira] [Closed] (SPARK-17956) ProjectExec has incorrect outputOrdering property

2016-10-17 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17956?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh closed SPARK-17956. --- Resolution: Won't Fix > ProjectExec has incorrect outputOrdering property >

[jira] [Updated] (SPARK-17986) SQLTransformer leaks temporary tables

2016-10-17 Thread Drew Robb (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17986?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Robb updated SPARK-17986: -- Description: The SQLTransformer creates a temporary table when called, and does not delete this

[jira] [Commented] (SPARK-17986) SQLTransformer leaks temporary tables

2016-10-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17986?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15584466#comment-15584466 ] Apache Spark commented on SPARK-17986: -- User 'drewrobb' has created a pull request for this issue:

[jira] [Assigned] (SPARK-17986) SQLTransformer leaks temporary tables

2016-10-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17986?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17986: Assignee: Apache Spark > SQLTransformer leaks temporary tables >

[jira] [Assigned] (SPARK-17986) SQLTransformer leaks temporary tables

2016-10-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17986?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17986: Assignee: (was: Apache Spark) > SQLTransformer leaks temporary tables >

[jira] [Commented] (SPARK-17813) Maximum data per trigger

2016-10-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17813?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15584515#comment-15584515 ] Apache Spark commented on SPARK-17813: -- User 'koeninger' has created a pull request for this issue:

[jira] [Assigned] (SPARK-17813) Maximum data per trigger

2016-10-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17813: Assignee: Apache Spark > Maximum data per trigger > > >

[jira] [Assigned] (SPARK-17813) Maximum data per trigger

2016-10-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17813: Assignee: (was: Apache Spark) > Maximum data per trigger > >

[jira] [Updated] (SPARK-17973) is there any way to split Dataset into 2 or more based on the given condition

2016-10-17 Thread sriram kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sriram kumar updated SPARK-17973: - Description: i cannot able to split Dataset exactly with condition. i have a scenario where i

[jira] [Commented] (SPARK-17368) Scala value classes create encoder problems and break at runtime

2016-10-17 Thread Jakob Odersky (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17368?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15582882#comment-15582882 ] Jakob Odersky commented on SPARK-17368: --- [~arisofala...@gmail.com] Let me explain the fix to what I

  1   2   >