[jira] [Resolved] (SPARK-21374) Reading globbed paths from S3 into DF doesn't work if filesystem caching is disabled

2017-08-04 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21374?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-21374. - Resolution: Fixed Fix Version/s: 2.3.0 > Reading globbed paths from S3 into DF doesn't work if

[jira] [Commented] (SPARK-21631) Building Spark with SBT unsuccessful when source code in Mllib is modified, But with MVN is ok

2017-08-04 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21631?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16115186#comment-16115186 ] Liang-Chi Hsieh commented on SPARK-21631: - I've tried. {{NOLINT_ON_COMPILE=1 build/sbt "testOnly

[jira] [Created] (SPARK-21644) LocalLimit.maxRows is defined incorrectly

2017-08-04 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-21644: --- Summary: LocalLimit.maxRows is defined incorrectly Key: SPARK-21644 URL: https://issues.apache.org/jira/browse/SPARK-21644 Project: Spark Issue Type: Bug

[jira] [Comment Edited] (SPARK-21067) Thrift Server - CTAS fail with Unable to move source

2017-08-04 Thread xinzhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16108303#comment-16108303 ] xinzhang edited comment on SPARK-21067 at 8/5/17 1:13 AM: -- hi srowen.could u

[jira] [Issue Comment Deleted] (SPARK-21067) Thrift Server - CTAS fail with Unable to move source

2017-08-04 Thread xinzhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21067?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xinzhang updated SPARK-21067: - Comment: was deleted (was: hi.Reynold Xin i am looking forwad to your reply [~rxin]) > Thrift Server -

[jira] [Commented] (SPARK-19116) LogicalPlan.statistics.sizeInBytes wrong for trivial parquet file

2017-08-04 Thread Shea Parkes (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19116?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16115176#comment-16115176 ] Shea Parkes commented on SPARK-19116: - Apologies for not responding earlier. I'm struggling to

[jira] [Commented] (SPARK-18838) High latency of event processing for large jobs

2017-08-04 Thread Jason Dunkelberger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16115147#comment-16115147 ] Jason Dunkelberger commented on SPARK-18838: I saw a couple of PRs to the same effect, but

[jira] [Commented] (SPARK-18838) High latency of event processing for large jobs

2017-08-04 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16115139#comment-16115139 ] Marcelo Vanzin commented on SPARK-18838: I think the main issue with a blocking bus is that you'd

[jira] [Commented] (SPARK-18838) High latency of event processing for large jobs

2017-08-04 Thread Jason Dunkelberger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16115129#comment-16115129 ] Jason Dunkelberger commented on SPARK-18838: I cut a PR here which just makes this queue

[jira] [Assigned] (SPARK-21374) Reading globbed paths from S3 into DF doesn't work if filesystem caching is disabled

2017-08-04 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21374?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li reassigned SPARK-21374: --- Assignee: Andrey Taptunov > Reading globbed paths from S3 into DF doesn't work if filesystem

[jira] [Commented] (SPARK-8288) ScalaReflection should also try apply methods defined in companion objects when inferring schema from a Product type

2017-08-04 Thread Drew Robb (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8288?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16115036#comment-16115036 ] Drew Robb commented on SPARK-8288: -- An additional fix beyond my PR would be needed to handle reading this

[jira] [Commented] (SPARK-21643) LR dataset worked in Spark 1.6.3, 2.0.2 stopped working in 2.1.0 onward

2017-08-04 Thread Thomas Kwan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21643?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16115002#comment-16115002 ] Thomas Kwan commented on SPARK-21643: - For 1.6, i used the following test codes in spark-shell

[jira] [Created] (SPARK-21643) LR dataset worked in Spark 1.6.3, 2.0.2 stopped working in 2.1.0 onward

2017-08-04 Thread Thomas Kwan (JIRA)
Thomas Kwan created SPARK-21643: --- Summary: LR dataset worked in Spark 1.6.3, 2.0.2 stopped working in 2.1.0 onward Key: SPARK-21643 URL: https://issues.apache.org/jira/browse/SPARK-21643 Project: Spark

[jira] [Commented] (SPARK-21631) Building Spark with SBT unsuccessful when source code in Mllib is modified, But with MVN is ok

2017-08-04 Thread Sean Wong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21631?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114895#comment-16114895 ] Sean Wong commented on SPARK-21631: --- Set a system environment variable NOLINT_ON_COMPILE is

[jira] [Commented] (SPARK-21631) Building Spark with SBT unsuccessful when source code in Mllib is modified, But with MVN is ok

2017-08-04 Thread Sean Wong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21631?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114893#comment-16114893 ] Sean Wong commented on SPARK-21631: --- I used to modify the source code like "import..." in Spark 1.6.1

[jira] [Commented] (SPARK-18683) REST APIs for standalone Master、Workers and Applications

2017-08-04 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18683?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114806#comment-16114806 ] Bryan Cutler commented on SPARK-18683: -- Sorry for the late response. From what I remember about

[jira] [Comment Edited] (SPARK-21631) Building Spark with SBT unsuccessful when source code in Mllib is modified, But with MVN is ok

2017-08-04 Thread Sean Wong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21631?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114693#comment-16114693 ] Sean Wong edited comment on SPARK-21631 at 8/4/17 6:14 PM: --- I added

[jira] [Commented] (SPARK-19116) LogicalPlan.statistics.sizeInBytes wrong for trivial parquet file

2017-08-04 Thread Andrew Ash (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19116?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114751#comment-16114751 ] Andrew Ash commented on SPARK-19116: [~shea.parkes] does this answer your question? >

[jira] [Commented] (SPARK-15799) Release SparkR on CRAN

2017-08-04 Thread Brendan Dwyer (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114733#comment-16114733 ] Brendan Dwyer commented on SPARK-15799: --- [~felixcheung] awesome! Looking forward to this. >

[jira] [Commented] (SPARK-21595) introduction of spark.sql.windowExec.buffer.spill.threshold in spark 2.2 breaks existing workflow

2017-08-04 Thread Tejas Patil (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21595?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114694#comment-16114694 ] Tejas Patil commented on SPARK-21595: - [~sreiling] Spilling will happen only when _both_ these are

[jira] [Commented] (SPARK-21631) Building Spark with SBT unsuccessful when source code in Mllib is modified, But with MVN is ok

2017-08-04 Thread Sean Wong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21631?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114693#comment-16114693 ] Sean Wong commented on SPARK-21631: --- I added NOLINT_ON_COMPILE in ~/.bashrc: export

[jira] [Commented] (SPARK-21634) Change OneRowRelation from a case object to case class

2017-08-04 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21634?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114689#comment-16114689 ] Reynold Xin commented on SPARK-21634: - Done in https://github.com/apache/spark/pull/18839 > Change

[jira] [Resolved] (SPARK-21634) Change OneRowRelation from a case object to case class

2017-08-04 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21634?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-21634. - Resolution: Fixed Fix Version/s: 2.3.0 > Change OneRowRelation from a case object to case class >

[jira] [Commented] (SPARK-20589) Allow limiting task concurrency per stage

2017-08-04 Thread Dhruve Ashar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114628#comment-16114628 ] Dhruve Ashar commented on SPARK-20589: -- I have a patch for this and would like to have some

[jira] [Commented] (SPARK-20589) Allow limiting task concurrency per stage

2017-08-04 Thread Dhruve Ashar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114626#comment-16114626 ] Dhruve Ashar commented on SPARK-20589: -- Spark defines stages based on the shuffle dependencies and

[jira] [Commented] (SPARK-21631) Building Spark with SBT unsuccessful when source code in Mllib is modified, But with MVN is ok

2017-08-04 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21631?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114532#comment-16114532 ] Liang-Chi Hsieh commented on SPARK-21631: - I've not tried. But from the building code, seems you

[jira] [Updated] (SPARK-21642) Use FQDN for DRIVER_HOST_ADDRESS instead of ip address

2017-08-04 Thread Aki Tanaka (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Aki Tanaka updated SPARK-21642: --- Description: In current implementation, ip address of a driver host is set to DRIVER_HOST_ADDRESS

[jira] [Commented] (SPARK-21631) Building Spark with SBT unsuccessful when source code in Mllib is modified, But with MVN is ok

2017-08-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21631?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114525#comment-16114525 ] Sean Owen commented on SPARK-21631: --- I don't know that the Maven build itself run scala style checks,

[jira] [Commented] (SPARK-21631) Building Spark with SBT unsuccessful when source code in Mllib is modified, But with MVN is ok

2017-08-04 Thread Sean Wong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21631?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114518#comment-16114518 ] Sean Wong commented on SPARK-21631: --- You are right. It is a style thing. Because I added the import at

[jira] [Updated] (SPARK-21642) Use FQDN for DRIVER_HOST_ADDRESS instead of ip address

2017-08-04 Thread Aki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Aki updated SPARK-21642: Description: In current implementation, ip address of a driver host is set to DRIVER_HOST_ADDRESS [1]. This

[jira] [Updated] (SPARK-21642) Use FQDN for DRIVER_HOST_ADDRESS instead of ip address

2017-08-04 Thread Aki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Aki updated SPARK-21642: Description: In current implementation, ip address of a driver host is set to DRIVER_HOST_ADDRESS [1]. This

[jira] [Created] (SPARK-21642) Use FQDN for DRIVER_HOST_ADDRESS instead of ip address

2017-08-04 Thread Aki (JIRA)
Aki created SPARK-21642: --- Summary: Use FQDN for DRIVER_HOST_ADDRESS instead of ip address Key: SPARK-21642 URL: https://issues.apache.org/jira/browse/SPARK-21642 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-20853) spark.ui.reverseProxy=true leads to hanging communication to master

2017-08-04 Thread Trevor McKay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20853?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114436#comment-16114436 ] Trevor McKay commented on SPARK-20853: -- @Josh Bacon, Hi Josh, we tracked this down independently

[jira] [Commented] (SPARK-21618) http(s) not accepted in spark-submit jar uri

2017-08-04 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114379#comment-16114379 ] Steve Loughran commented on SPARK-21618: yes, and that 2.9+ feature breaks things, because when

[jira] [Resolved] (SPARK-21620) Add metrics url in spark web ui.

2017-08-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-21620. --- Resolution: Won't Fix > Add metrics url in spark web ui. > > >

[jira] [Comment Edited] (SPARK-21640) Method mode with String parameters within DataFrameWriter is error prone

2017-08-04 Thread Alberto (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114317#comment-16114317 ] Alberto edited comment on SPARK-21640 at 8/4/17 12:49 PM: -- Yes, that's is

[jira] [Commented] (SPARK-21640) Method mode with String parameters within DataFrameWriter is error prone

2017-08-04 Thread Alberto (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114317#comment-16114317 ] Alberto commented on SPARK-21640: - Yes, that's is [~srowen]. I don't like to use string to do such things

[jira] [Comment Edited] (SPARK-21640) Method mode with String parameters within DataFrameWriter is error prone

2017-08-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114311#comment-16114311 ] Sean Owen edited comment on SPARK-21640 at 8/4/17 12:43 PM: -Why would you

[jira] [Updated] (SPARK-21640) Method mode with String parameters within DataFrameWriter is error prone

2017-08-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21640?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-21640: -- Priority: Trivial (was: Major) > Method mode with String parameters within DataFrameWriter is error

[jira] [Updated] (SPARK-21641) Combining windowing (groupBy) and mapGroupsWithState (groupByKey) in Spark Structured Streaming

2017-08-04 Thread Tudor Miu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21641?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tudor Miu updated SPARK-21641: -- Description: Given a stream of timestamped data with watermarking, there seems to be no way to

[jira] [Created] (SPARK-21641) Combining windowing (groupBy) and mapGroupsWithState (groupByKey) in Spark Structured Streaming

2017-08-04 Thread Tudor Miu (JIRA)
Tudor Miu created SPARK-21641: - Summary: Combining windowing (groupBy) and mapGroupsWithState (groupByKey) in Spark Structured Streaming Key: SPARK-21641 URL: https://issues.apache.org/jira/browse/SPARK-21641

[jira] [Commented] (SPARK-21640) Method mode with String parameters within DataFrameWriter is error prone

2017-08-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114311#comment-16114311 ] Sean Owen commented on SPARK-21640: --- Why would you lower-case it? you're saying that doesn't work, and

[jira] [Created] (SPARK-21640) Method mode with String parameters within DataFrameWriter is error prone

2017-08-04 Thread Alberto (JIRA)
Alberto created SPARK-21640: --- Summary: Method mode with String parameters within DataFrameWriter is error prone Key: SPARK-21640 URL: https://issues.apache.org/jira/browse/SPARK-21640 Project: Spark

[jira] [Commented] (SPARK-21618) http(s) not accepted in spark-submit jar uri

2017-08-04 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114291#comment-16114291 ] Saisai Shao commented on SPARK-21618: - Hi Steve, I'm not quite following your comments. You mean that

[jira] [Comment Edited] (SPARK-21595) introduction of spark.sql.windowExec.buffer.spill.threshold in spark 2.2 breaks existing workflow

2017-08-04 Thread Stephan Reiling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21595?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114255#comment-16114255 ] Stephan Reiling edited comment on SPARK-21595 at 8/4/17 11:19 AM: -- I

[jira] [Commented] (SPARK-21595) introduction of spark.sql.windowExec.buffer.spill.threshold in spark 2.2 breaks existing workflow

2017-08-04 Thread Stephan Reiling (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21595?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114255#comment-16114255 ] Stephan Reiling commented on SPARK-21595: - I have tried out a couple of settings for

[jira] [Commented] (SPARK-21618) http(s) not accepted in spark-submit jar uri

2017-08-04 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114227#comment-16114227 ] Steve Loughran commented on SPARK-21618: Thinking about this some more: what's happening in

[jira] [Resolved] (SPARK-21205) pmod(number, 0) should be null

2017-08-04 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell resolved SPARK-21205. --- Resolution: Fixed Assignee: Yuming Wang Fix Version/s: 2.3.0 >

[jira] [Commented] (SPARK-21402) Java encoders - switch fields on collectAsList

2017-08-04 Thread Tom (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21402?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114179#comment-16114179 ] Tom commented on SPARK-21402: - Additional comment when there are multiple datatypes which are not easily

[jira] [Resolved] (SPARK-21639) Getting an error while installing spark on windows

2017-08-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-21639. --- Resolution: Invalid JIRA isn't really a place for help questions. It sounds like you have some

[jira] [Created] (SPARK-21639) Getting an error while installing spark on windows

2017-08-04 Thread sandhya harane (JIRA)
sandhya harane created SPARK-21639: -- Summary: Getting an error while installing spark on windows Key: SPARK-21639 URL: https://issues.apache.org/jira/browse/SPARK-21639 Project: Spark Issue

[jira] [Commented] (SPARK-21639) Getting an error while installing spark on windows

2017-08-04 Thread sandhya harane (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21639?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114176#comment-16114176 ] sandhya harane commented on SPARK-21639: Can anyone please help here > Getting an error while

[jira] [Resolved] (SPARK-21632) There is no need to make attempts for createDirectory if the dir had existed

2017-08-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-21632. --- Resolution: Not A Problem > There is no need to make attempts for createDirectory if the dir had

[jira] [Updated] (SPARK-21636) Several configurations which only are used in unit tests should be removed

2017-08-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-21636: -- Priority: Minor (was: Critical) > Several configurations which only are used in unit tests should be

[jira] [Resolved] (SPARK-21636) Several configurations which only are used in unit tests should be removed

2017-08-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-21636. --- Resolution: Not A Problem > Several configurations which only are used in unit tests should be

[jira] [Closed] (SPARK-21627) analyze hive table compute stats for columns with mixed case exception

2017-08-04 Thread Bogdan Raducanu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21627?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bogdan Raducanu closed SPARK-21627. --- Resolution: Duplicate > analyze hive table compute stats for columns with mixed case

[jira] [Commented] (SPARK-21627) analyze hive table compute stats for columns with mixed case exception

2017-08-04 Thread Bogdan Raducanu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21627?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114163#comment-16114163 ] Bogdan Raducanu commented on SPARK-21627: - You're right, it's fixed. > analyze hive table

[jira] [Commented] (SPARK-21638) Warning message of RF is not accurate

2017-08-04 Thread Peng Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114156#comment-16114156 ] Peng Meng commented on SPARK-21638: --- The first data should - nodeMemUsage > Warning message of RF is

[jira] [Commented] (SPARK-21638) Warning message of RF is not accurate

2017-08-04 Thread Peng Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114153#comment-16114153 ] Peng Meng commented on SPARK-21638: --- In the example warning message, the split node shoud be 2621; >

[jira] [Resolved] (SPARK-21570) File __spark_libs__XXX.zip does not exist on networked file system w/ yarn

2017-08-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21570?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-21570. --- Resolution: Not A Problem > File __spark_libs__XXX.zip does not exist on networked file system w/

[jira] [Resolved] (SPARK-21586) Read CSV (SQL Context) Doesnt ignore delimiters within special types of quotes, other special characters

2017-08-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-21586. --- Resolution: Invalid > Read CSV (SQL Context) Doesnt ignore delimiters within special types of >

[jira] [Resolved] (SPARK-21547) Spark cleaner cost too many time

2017-08-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21547?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-21547. --- Resolution: Duplicate > Spark cleaner cost too many time > > >

[jira] [Commented] (SPARK-21638) Warning message of RF is not accurate

2017-08-04 Thread Peng Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114133#comment-16114133 ] Peng Meng commented on SPARK-21638: --- I will be back home now, will answer your question next week.

[jira] [Commented] (SPARK-21638) Warning message of RF is not accurate

2017-08-04 Thread Peng Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114132#comment-16114132 ] Peng Meng commented on SPARK-21638: --- This is because "we not add the node to mutableNodesForGroup, but

[jira] [Updated] (SPARK-21638) Warning message of RF is not accurate

2017-08-04 Thread Peng Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peng Meng updated SPARK-21638: -- Description: When train RF model, there is many warning message like this: {quote}WARN RandomForest:

[jira] [Commented] (SPARK-21638) Warning message of RF is not accurate

2017-08-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114126#comment-16114126 ] Sean Owen commented on SPARK-21638: --- What's inaccurate and why isn't it necessary? Please start with

[jira] [Updated] (SPARK-21638) Warning message of RF is not accurate

2017-08-04 Thread Peng Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peng Meng updated SPARK-21638: -- Description: When train RF model, there is many warning message like this: {quote}WARN RandomForest:

[jira] [Created] (SPARK-21638) Warning message of RF is not accurate

2017-08-04 Thread Peng Meng (JIRA)
Peng Meng created SPARK-21638: - Summary: Warning message of RF is not accurate Key: SPARK-21638 URL: https://issues.apache.org/jira/browse/SPARK-21638 Project: Spark Issue Type: Bug

[jira] [Resolved] (SPARK-21633) Unary Transformer in Python

2017-08-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-21633. --- Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 18746

[jira] [Assigned] (SPARK-21633) Unary Transformer in Python

2017-08-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-21633: - Assignee: Ajay Saini > Unary Transformer in Python >

[jira] [Updated] (SPARK-21633) Unary Transformer in Python

2017-08-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-21633: -- Shepherd: Joseph K. Bradley > Unary Transformer in Python >

[jira] [Assigned] (SPARK-21330) Bad partitioning does not allow to read a JDBC table with extreme values on the partition column

2017-08-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-21330: - Assignee: Andrew Ray > Bad partitioning does not allow to read a JDBC table with extreme values

[jira] [Resolved] (SPARK-21330) Bad partitioning does not allow to read a JDBC table with extreme values on the partition column

2017-08-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-21330. --- Resolution: Fixed Fix Version/s: 2.1.2 2.3.0 2.2.1

[jira] [Commented] (SPARK-21631) Building Spark with SBT unsuccessful when source code in Mllib is modified, But with MVN is ok

2017-08-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21631?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114082#comment-16114082 ] Sean Owen commented on SPARK-21631: --- Oh I see it is a style thing. OK, well, it's probably because the

[jira] [Created] (SPARK-21637) `hive.metastore.warehouse` in --hiveconf is not respected

2017-08-04 Thread Kent Yao (JIRA)
Kent Yao created SPARK-21637: Summary: `hive.metastore.warehouse` in --hiveconf is not respected Key: SPARK-21637 URL: https://issues.apache.org/jira/browse/SPARK-21637 Project: Spark Issue

[jira] [Commented] (SPARK-21630) Pmod should not throw a divide by zero exception

2017-08-04 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114078#comment-16114078 ] Herman van Hovell commented on SPARK-21630: --- Yeah it is. Thanks for pointing this out. > Pmod

[jira] [Closed] (SPARK-21630) Pmod should not throw a divide by zero exception

2017-08-04 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell closed SPARK-21630. - Resolution: Duplicate > Pmod should not throw a divide by zero exception >

[jira] [Commented] (SPARK-21453) Cached Kafka consumer may be closed too early

2017-08-04 Thread Pablo Panero (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114071#comment-16114071 ] Pablo Panero commented on SPARK-21453: -- Yes! the job has been launched on debug mode. Ill send the

[jira] [Commented] (SPARK-21629) OR nullability is incorrect

2017-08-04 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114056#comment-16114056 ] Takeshi Yamamuro commented on SPARK-21629: -- oh, yea. I see. > OR nullability is incorrect >

[jira] [Comment Edited] (SPARK-21631) Building Spark with SBT unsuccessful when source code in Mllib is modified, But with MVN is ok

2017-08-04 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21631?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114044#comment-16114044 ] Liang-Chi Hsieh edited comment on SPARK-21631 at 8/4/17 7:27 AM: - I can

[jira] [Comment Edited] (SPARK-21631) Building Spark with SBT unsuccessful when source code in Mllib is modified, But with MVN is ok

2017-08-04 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21631?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114044#comment-16114044 ] Liang-Chi Hsieh edited comment on SPARK-21631 at 8/4/17 7:27 AM: - I can

[jira] [Commented] (SPARK-21631) Building Spark with SBT unsuccessful when source code in Mllib is modified, But with MVN is ok

2017-08-04 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21631?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114044#comment-16114044 ] Liang-Chi Hsieh commented on SPARK-21631: - I can make the same error by violating scala style

[jira] [Commented] (SPARK-21629) OR nullability is incorrect

2017-08-04 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114041#comment-16114041 ] Herman van Hovell commented on SPARK-21629: --- Thanks for doing this. The first sequence should

[jira] [Assigned] (SPARK-21254) History UI: Taking over 1 minute for initial page display

2017-08-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21254?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-21254: - Assignee: Dmitry Parfenchik > History UI: Taking over 1 minute for initial page display >

[jira] [Resolved] (SPARK-21254) History UI: Taking over 1 minute for initial page display

2017-08-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21254?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-21254. --- Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 18783

[jira] [Resolved] (SPARK-21631) Building Spark with SBT unsuccessful when source code in Mllib is modified, But with MVN is ok

2017-08-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21631?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-21631. --- Resolution: Not A Problem Target Version/s: (was: 2.2.0) I see no evidence it's a style

[jira] [Commented] (SPARK-21453) Cached Kafka consumer may be closed too early

2017-08-04 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114036#comment-16114036 ] Shixiong Zhu commented on SPARK-21453: -- I meant the exception in the JIRA description which looks

[jira] [Commented] (SPARK-21626) The short-circuit local reads feature cannot be used because libhadoop cannot be loaded.

2017-08-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21626?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114035#comment-16114035 ] Sean Owen commented on SPARK-21626: --- This has nothing to do with Spark itself. > The short-circuit

[jira] [Commented] (SPARK-21453) Cached Kafka consumer may be closed too early

2017-08-04 Thread Pablo Panero (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114033#comment-16114033 ] Pablo Panero commented on SPARK-21453: -- [~zsxwing] Yes, I definitely can. No worries, I just did

[jira] [Comment Edited] (SPARK-21453) Cached Kafka consumer may be closed too early

2017-08-04 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114030#comment-16114030 ] Shixiong Zhu edited comment on SPARK-21453 at 8/4/17 7:02 AM: -- Could you

[jira] [Commented] (SPARK-21453) Cached Kafka consumer may be closed too early

2017-08-04 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114030#comment-16114030 ] Shixiong Zhu commented on SPARK-21453: -- Could you increase "spark.kafka.producer.cache.timeout" to

[jira] [Commented] (SPARK-21453) Cached Kafka consumer may be closed too early

2017-08-04 Thread Pablo Panero (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114025#comment-16114025 ] Pablo Panero commented on SPARK-21453: -- [~zsxwing] Yes, I understand. However due to the verbosity

[jira] [Commented] (SPARK-21453) Cached Kafka consumer may be closed too early

2017-08-04 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114020#comment-16114020 ] Shixiong Zhu commented on SPARK-21453: -- [~ppanero] Could you provide all logs? I need the logs

[jira] [Commented] (SPARK-21453) Cached Kafka consumer may be closed too early

2017-08-04 Thread Pablo Panero (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16114015#comment-16114015 ] Pablo Panero commented on SPARK-21453: -- [~zsxwing] {code} 17/07/26 13:28:32 ERROR Executor:

[jira] [Updated] (SPARK-21636) Several configurations which only are used in unit tests should be removed

2017-08-04 Thread liuzhaokun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] liuzhaokun updated SPARK-21636: --- Summary: Several configurations which only are used in unit tests should be removed(was: Several

[jira] [Created] (SPARK-21636) Several configuration which only are used in unit tests should be removed

2017-08-04 Thread liuzhaokun (JIRA)
liuzhaokun created SPARK-21636: -- Summary: Several configuration which only are used in unit tests should be removed Key: SPARK-21636 URL: https://issues.apache.org/jira/browse/SPARK-21636 Project:

[jira] [Commented] (SPARK-21631) Building Spark with SBT unsuccessful when source code in Mllib is modified, But with MVN is ok

2017-08-04 Thread Sean Wong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21631?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16113992#comment-16113992 ] Sean Wong commented on SPARK-21631: --- not compliant with Spark code style? But building with maven is ok