[jira] [Commented] (SPARK-21479) Outer join filter pushdown in null supplying table when condition is on one of the joined columns

2018-04-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16440475#comment-16440475 ] Apache Spark commented on SPARK-21479: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Commented] (SPARK-23564) the optimized logical plan about Left anti join should be further optimization

2018-04-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16440476#comment-16440476 ] Apache Spark commented on SPARK-23564: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Comment Edited] (SPARK-15544) Bouncing Zookeeper node causes Active spark master to exit

2018-04-17 Thread zuotingbing (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16440535#comment-16440535 ] zuotingbing edited comment on SPARK-15544 at 4/17/18 7:47 AM: -- cc [~vanzin] 

[jira] [Comment Edited] (SPARK-15544) Bouncing Zookeeper node causes Active spark master to exit

2018-04-17 Thread zuotingbing (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16440535#comment-16440535 ] zuotingbing edited comment on SPARK-15544 at 4/17/18 7:49 AM: -- cc [~vanzin]  

[jira] [Comment Edited] (SPARK-15544) Bouncing Zookeeper node causes Active spark master to exit

2018-04-17 Thread zuotingbing (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16440535#comment-16440535 ] zuotingbing edited comment on SPARK-15544 at 4/17/18 7:49 AM: -- cc [~vanzin]  

[jira] [Comment Edited] (SPARK-15544) Bouncing Zookeeper node causes Active spark master to exit

2018-04-17 Thread zuotingbing (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16440535#comment-16440535 ] zuotingbing edited comment on SPARK-15544 at 4/17/18 7:48 AM: -- cc [~vanzin] 

[jira] [Comment Edited] (SPARK-15544) Bouncing Zookeeper node causes Active spark master to exit

2018-04-17 Thread zuotingbing (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16440535#comment-16440535 ] zuotingbing edited comment on SPARK-15544 at 4/17/18 7:48 AM: -- cc [~vanzin] 

[jira] [Assigned] (SPARK-23918) High-order function: array_min(x) → x

2018-04-17 Thread Takuya Ueshin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23918?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takuya Ueshin reassigned SPARK-23918: - Assignee: Marco Gaido > High-order function: array_min(x) → x >

[jira] [Comment Edited] (SPARK-15544) Bouncing Zookeeper node causes Active spark master to exit

2018-04-17 Thread zuotingbing (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16440535#comment-16440535 ] zuotingbing edited comment on SPARK-15544 at 4/17/18 7:45 AM: -- cc [~vanzin] 

[jira] [Comment Edited] (SPARK-15544) Bouncing Zookeeper node causes Active spark master to exit

2018-04-17 Thread zuotingbing (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16440535#comment-16440535 ] zuotingbing edited comment on SPARK-15544 at 4/17/18 8:01 AM: -- cc [~vanzin]  

[jira] [Assigned] (SPARK-23998) It may be better to add @transient to field 'taskMemoryManager' in class Task, for it is only be set and used in executor side

2018-04-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23998?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23998: Assignee: Apache Spark > It may be better to add @transient to field 'taskMemoryManager'

[jira] [Commented] (SPARK-15544) Bouncing Zookeeper node causes Active spark master to exit

2018-04-17 Thread zuotingbing (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16440511#comment-16440511 ] zuotingbing commented on SPARK-15544: - The same issue still occurs in spark 2.3.0.   see 

[jira] [Comment Edited] (SPARK-15544) Bouncing Zookeeper node causes Active spark master to exit

2018-04-17 Thread zuotingbing (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16440535#comment-16440535 ] zuotingbing edited comment on SPARK-15544 at 4/17/18 7:51 AM: -- cc [~vanzin]  

[jira] [Comment Edited] (SPARK-15544) Bouncing Zookeeper node causes Active spark master to exit

2018-04-17 Thread zuotingbing (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16440535#comment-16440535 ] zuotingbing edited comment on SPARK-15544 at 4/17/18 8:02 AM: -- cc [~vanzin]  

[jira] [Comment Edited] (SPARK-15544) Bouncing Zookeeper node causes Active spark master to exit

2018-04-17 Thread zuotingbing (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16440535#comment-16440535 ] zuotingbing edited comment on SPARK-15544 at 4/17/18 7:43 AM: -- cc [~vanzin] 

[jira] [Comment Edited] (SPARK-15544) Bouncing Zookeeper node causes Active spark master to exit

2018-04-17 Thread zuotingbing (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16440535#comment-16440535 ] zuotingbing edited comment on SPARK-15544 at 4/17/18 7:44 AM: -- cc [~vanzin] 

[jira] [Comment Edited] (SPARK-15544) Bouncing Zookeeper node causes Active spark master to exit

2018-04-17 Thread zuotingbing (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16440535#comment-16440535 ] zuotingbing edited comment on SPARK-15544 at 4/17/18 7:43 AM: -- cc [~vanzin] 

[jira] [Commented] (SPARK-15544) Bouncing Zookeeper node causes Active spark master to exit

2018-04-17 Thread zuotingbing (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16440535#comment-16440535 ] zuotingbing commented on SPARK-15544: - cc [~vanzin] > Bouncing Zookeeper node causes Active spark

[jira] [Comment Edited] (SPARK-15544) Bouncing Zookeeper node causes Active spark master to exit

2018-04-17 Thread zuotingbing (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16440535#comment-16440535 ] zuotingbing edited comment on SPARK-15544 at 4/17/18 7:53 AM: -- cc [~vanzin]  

[jira] [Resolved] (SPARK-23918) High-order function: array_min(x) → x

2018-04-17 Thread Takuya Ueshin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23918?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takuya Ueshin resolved SPARK-23918. --- Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 21025

[jira] [Commented] (SPARK-23998) It may be better to add @transient to field 'taskMemoryManager' in class Task, for it is only be set and used in executor side

2018-04-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16440598#comment-16440598 ] Apache Spark commented on SPARK-23998: -- User 'eatoncys' has created a pull request for this issue:

[jira] [Assigned] (SPARK-23998) It may be better to add @transient to field 'taskMemoryManager' in class Task, for it is only be set and used in executor side

2018-04-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23998?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23998: Assignee: (was: Apache Spark) > It may be better to add @transient to field

[jira] [Created] (SPARK-23998) It may be better to add @transient to field 'taskMemoryManager' in class Task, for it is only be set and used in executor side

2018-04-17 Thread eaton (JIRA)
eaton created SPARK-23998: - Summary: It may be better to add @transient to field 'taskMemoryManager' in class Task, for it is only be set and used in executor side Key: SPARK-23998 URL:

[jira] [Assigned] (SPARK-23687) Add MemoryStream

2018-04-17 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23687?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das reassigned SPARK-23687: - Assignee: Jose Torres > Add MemoryStream > > > Key:

[jira] [Resolved] (SPARK-23687) Add MemoryStream

2018-04-17 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23687?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-23687. --- Resolution: Done Fix Version/s: 2.4.0 > Add MemoryStream > > >

[jira] [Commented] (SPARK-22371) dag-scheduler-event-loop thread stopped with error Attempted to access garbage collected accumulator 5605982

2018-04-17 Thread Andrew Clegg (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16440683#comment-16440683 ] Andrew Clegg commented on SPARK-22371: -- Another data point -- I've seen this happen (in 2.3.0)

[jira] [Comment Edited] (SPARK-22371) dag-scheduler-event-loop thread stopped with error Attempted to access garbage collected accumulator 5605982

2018-04-17 Thread Andrew Clegg (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16440683#comment-16440683 ] Andrew Clegg edited comment on SPARK-22371 at 4/17/18 9:53 AM: --- Another

[jira] [Created] (SPARK-24000) S3A: Create Table should fail on invalid AK/SK

2018-04-17 Thread Brahma Reddy Battula (JIRA)
Brahma Reddy Battula created SPARK-24000: Summary: S3A: Create Table should fail on invalid AK/SK Key: SPARK-24000 URL: https://issues.apache.org/jira/browse/SPARK-24000 Project: Spark

[jira] [Assigned] (SPARK-23747) Add EpochCoordinator unit tests

2018-04-17 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23747?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das reassigned SPARK-23747: - Assignee: Jose Torres > Add EpochCoordinator unit tests >

[jira] [Resolved] (SPARK-23747) Add EpochCoordinator unit tests

2018-04-17 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23747?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-23747. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 20983

[jira] [Created] (SPARK-23999) Spark SQL shell is a Stable one ? Can we use Spark SQL shell in our production environment?

2018-04-17 Thread Prabhu Bentick (JIRA)
Prabhu Bentick created SPARK-23999: -- Summary: Spark SQL shell is a Stable one ? Can we use Spark SQL shell in our production environment? Key: SPARK-23999 URL: https://issues.apache.org/jira/browse/SPARK-23999

[jira] [Resolved] (SPARK-23835) When Dataset.as converts column from nullable to non-nullable type, null Doubles are converted silently to -1

2018-04-17 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23835?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-23835. - Resolution: Fixed Assignee: Marco Gaido Fix Version/s: 2.4.0

[jira] [Assigned] (SPARK-23948) Trigger mapstage's job listener in submitMissingTasks

2018-04-17 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Imran Rashid reassigned SPARK-23948: Assignee: jin xing > Trigger mapstage's job listener in submitMissingTasks >

[jira] [Resolved] (SPARK-23948) Trigger mapstage's job listener in submitMissingTasks

2018-04-17 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Imran Rashid resolved SPARK-23948. -- Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 21019

[jira] [Updated] (SPARK-24001) Multinode cluster

2018-04-17 Thread Direselign (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24001?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Direselign updated SPARK-24001: --- Attachment: Screenshot from 2018-04-17 22-47-39.png > Multinode cluster > -- > >

[jira] [Commented] (SPARK-24000) S3A: Create Table should fail on invalid AK/SK

2018-04-17 Thread Brahma Reddy Battula (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24000?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16440841#comment-16440841 ] Brahma Reddy Battula commented on SPARK-24000: -- Discussed [~ste...@apache.org] with offline,

[jira] [Commented] (SPARK-23948) Trigger mapstage's job listener in submitMissingTasks

2018-04-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23948?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16440887#comment-16440887 ] Apache Spark commented on SPARK-23948: -- User 'squito' has created a pull request for this issue:

[jira] [Comment Edited] (SPARK-22371) dag-scheduler-event-loop thread stopped with error Attempted to access garbage collected accumulator 5605982

2018-04-17 Thread Ben Doerr (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16402271#comment-16402271 ] Ben Doerr edited comment on SPARK-22371 at 4/17/18 2:04 PM: We've seen this

[jira] [Commented] (SPARK-23963) Queries on text-based Hive tables grow disproportionately slower as the number of columns increase

2018-04-17 Thread Ruslan Dautkhanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23963?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16440942#comment-16440942 ] Ruslan Dautkhanov commented on SPARK-23963: --- Thanks a lot [~bersprockets]  Would it be

[jira] [Updated] (SPARK-23888) speculative task should not run on a given host where another attempt is already running on

2018-04-17 Thread wuyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23888?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] wuyi updated SPARK-23888: - Description:   There's a bug in: {code:java} /** Check whether a task is currently running an attempt on a

[jira] [Assigned] (SPARK-22676) Avoid iterating all partition paths when spark.sql.hive.verifyPartitionPath=true

2018-04-17 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22676?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-22676: --- Assignee: jin xing > Avoid iterating all partition paths when >

[jira] [Resolved] (SPARK-23875) Create IndexedSeq wrapper for ArrayData

2018-04-17 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23875?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell resolved SPARK-23875. --- Resolution: Fixed Assignee: Liang-Chi Hsieh Fix Version/s: 2.4.0 >

[jira] [Commented] (SPARK-15703) Make ListenerBus event queue size configurable

2018-04-17 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16440855#comment-16440855 ] Thomas Graves commented on SPARK-15703: --- this Jira is purely making the size of the event queue

[jira] [Commented] (SPARK-23206) Additional Memory Tuning Metrics

2018-04-17 Thread Edwina Lu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16441006#comment-16441006 ] Edwina Lu commented on SPARK-23206: --- [~assia6], could you please try the new link,

[jira] [Resolved] (SPARK-23999) Spark SQL shell is a Stable one ? Can we use Spark SQL shell in our production environment?

2018-04-17 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23999?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-23999. Resolution: Invalid Fix Version/s: (was: 2.3.0) (was:

[jira] [Assigned] (SPARK-23986) CompileException when using too many avg aggregation after joining

2018-04-17 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23986?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-23986: --- Assignee: Marco Gaido > CompileException when using too many avg aggregation after joining

[jira] [Resolved] (SPARK-23986) CompileException when using too many avg aggregation after joining

2018-04-17 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23986?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-23986. - Resolution: Fixed Fix Version/s: 2.3.1 2.4.0 Issue resolved by pull

[jira] [Updated] (SPARK-24002) Task not serializable caused by org.apache.parquet.io.api.Binary$ByteBufferBackedBinary.getBytes

2018-04-17 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24002?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-24002: Description: {code} java.lang.IllegalArgumentException at

[jira] [Created] (SPARK-24002) Task not serializable caused by org.apache.parquet.io.api.Binary$ByteBufferBackedBinary.getBytes

2018-04-17 Thread Xiao Li (JIRA)
Xiao Li created SPARK-24002: --- Summary: Task not serializable caused by org.apache.parquet.io.api.Binary$ByteBufferBackedBinary.getBytes Key: SPARK-24002 URL: https://issues.apache.org/jira/browse/SPARK-24002

[jira] [Updated] (SPARK-24002) Task not serializable caused by org.apache.parquet.io.api.Binary$ByteBufferBackedBinary.getBytes

2018-04-17 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24002?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-24002: Description: Having two queries one is a 1000-line SQL query and a 3000-line SQL query. Need to run at

[jira] [Assigned] (SPARK-24002) Task not serializable caused by org.apache.parquet.io.api.Binary$ByteBufferBackedBinary.getBytes

2018-04-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24002?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24002: Assignee: Xiao Li (was: Apache Spark) > Task not serializable caused by >

[jira] [Assigned] (SPARK-24002) Task not serializable caused by org.apache.parquet.io.api.Binary$ByteBufferBackedBinary.getBytes

2018-04-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24002?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24002: Assignee: Apache Spark (was: Xiao Li) > Task not serializable caused by >

[jira] [Commented] (SPARK-24002) Task not serializable caused by org.apache.parquet.io.api.Binary$ByteBufferBackedBinary.getBytes

2018-04-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24002?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16441071#comment-16441071 ] Apache Spark commented on SPARK-24002: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Assigned] (SPARK-23997) Configurable max number of buckets

2018-04-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23997?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23997: Assignee: (was: Apache Spark) > Configurable max number of buckets >

[jira] [Commented] (SPARK-23997) Configurable max number of buckets

2018-04-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23997?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16441146#comment-16441146 ] Apache Spark commented on SPARK-23997: -- User 'ferdonline' has created a pull request for this issue:

[jira] [Assigned] (SPARK-23997) Configurable max number of buckets

2018-04-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23997?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23997: Assignee: Apache Spark > Configurable max number of buckets >

[jira] [Assigned] (SPARK-24007) EqualNullSafe for FloatType and DoubleType might generate a wrong result by codegen.

2018-04-17 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24007?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li reassigned SPARK-24007: --- Assignee: Takuya Ueshin > EqualNullSafe for FloatType and DoubleType might generate a wrong result

[jira] [Commented] (SPARK-23340) Upgrade Apache ORC to 1.4.3

2018-04-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23340?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16441926#comment-16441926 ] Apache Spark commented on SPARK-23340: -- User 'dongjoon-hyun' has created a pull request for this

[jira] [Created] (SPARK-24006) ExecutorAllocationManager.onExecutorAdded is an O(n) operation

2018-04-17 Thread Xianjin YE (JIRA)
Xianjin YE created SPARK-24006: -- Summary: ExecutorAllocationManager.onExecutorAdded is an O(n) operation Key: SPARK-24006 URL: https://issues.apache.org/jira/browse/SPARK-24006 Project: Spark

[jira] [Created] (SPARK-24007) EqualNullSafe for FloatType and DoubleType might generate a wrong result by codegen.

2018-04-17 Thread Takuya Ueshin (JIRA)
Takuya Ueshin created SPARK-24007: - Summary: EqualNullSafe for FloatType and DoubleType might generate a wrong result by codegen. Key: SPARK-24007 URL: https://issues.apache.org/jira/browse/SPARK-24007

[jira] [Updated] (SPARK-24006) ExecutorAllocationManager.onExecutorAdded is an O(n) operation

2018-04-17 Thread Xianjin YE (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24006?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xianjin YE updated SPARK-24006: --- Description: The ExecutorAllocationManager.onExecutorAdded is an O(n) operations, I believe it will

[jira] [Created] (SPARK-24008) SQL/Hive Context fails with NullPointerException

2018-04-17 Thread Prabhu Joseph (JIRA)
Prabhu Joseph created SPARK-24008: - Summary: SQL/Hive Context fails with NullPointerException Key: SPARK-24008 URL: https://issues.apache.org/jira/browse/SPARK-24008 Project: Spark Issue

[jira] [Updated] (SPARK-24008) SQL/Hive Context fails with NullPointerException

2018-04-17 Thread Prabhu Joseph (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prabhu Joseph updated SPARK-24008: -- Attachment: Repro > SQL/Hive Context fails with NullPointerException >

[jira] [Resolved] (SPARK-24001) Multinode cluster

2018-04-17 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24001?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saisai Shao resolved SPARK-24001. - Resolution: Invalid > Multinode cluster > -- > > Key:

[jira] [Commented] (SPARK-24001) Multinode cluster

2018-04-17 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24001?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16441918#comment-16441918 ] Saisai Shao commented on SPARK-24001: - Question should go to mail list. > Multinode cluster >

[jira] [Updated] (SPARK-24007) EqualNullSafe for FloatType and DoubleType might generate a wrong result by codegen.

2018-04-17 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24007?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-24007: Labels: correctness (was: ) > EqualNullSafe for FloatType and DoubleType might generate a wrong result by

[jira] [Commented] (SPARK-23843) Deploy yarn meets incorrect LOCALIZED_CONF_DIR

2018-04-17 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23843?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16441930#comment-16441930 ] Saisai Shao commented on SPARK-23843: - I think this issue is due to your "new Hadoop-compatible

[jira] [Commented] (SPARK-23984) PySpark Bindings for K8S

2018-04-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16441921#comment-16441921 ] Apache Spark commented on SPARK-23984: -- User 'ifilonenko' has created a pull request for this issue:

[jira] [Assigned] (SPARK-23984) PySpark Bindings for K8S

2018-04-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23984: Assignee: (was: Apache Spark) > PySpark Bindings for K8S > >

[jira] [Assigned] (SPARK-23984) PySpark Bindings for K8S

2018-04-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23984: Assignee: Apache Spark > PySpark Bindings for K8S > > >

[jira] [Commented] (SPARK-23989) When using `SortShuffleWriter`, the data will be overwritten

2018-04-17 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23989?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16441906#comment-16441906 ] Saisai Shao commented on SPARK-23989: - Please provide a reproducible case. Did you reuse the object

[jira] [Commented] (SPARK-23830) Spark on YARN in cluster deploy mode fail with NullPointerException when a Spark application is a Scala class not object

2018-04-17 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23830?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16441919#comment-16441919 ] Saisai Shao commented on SPARK-23830: - What is the reason to use {{class}} instead of {{object}},

[jira] [Resolved] (SPARK-22676) Avoid iterating all partition paths when spark.sql.hive.verifyPartitionPath=true

2018-04-17 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22676?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-22676. - Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 19868

[jira] [Created] (SPARK-24001) Multinode cluster

2018-04-17 Thread Direselign (JIRA)
Direselign created SPARK-24001: -- Summary: Multinode cluster Key: SPARK-24001 URL: https://issues.apache.org/jira/browse/SPARK-24001 Project: Spark Issue Type: Bug Components: PySpark

[jira] [Created] (SPARK-24004) Tests of from_json for MapType

2018-04-17 Thread Maxim Gekk (JIRA)
Maxim Gekk created SPARK-24004: -- Summary: Tests of from_json for MapType Key: SPARK-24004 URL: https://issues.apache.org/jira/browse/SPARK-24004 Project: Spark Issue Type: Test

[jira] [Commented] (SPARK-24004) Tests of from_json for MapType

2018-04-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24004?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16441369#comment-16441369 ] Apache Spark commented on SPARK-24004: -- User 'MaxGekk' has created a pull request for this issue:

[jira] [Assigned] (SPARK-24004) Tests of from_json for MapType

2018-04-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24004?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24004: Assignee: Apache Spark > Tests of from_json for MapType > --

[jira] [Assigned] (SPARK-24004) Tests of from_json for MapType

2018-04-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24004?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24004: Assignee: (was: Apache Spark) > Tests of from_json for MapType >

[jira] [Commented] (SPARK-15784) Add Power Iteration Clustering to spark.ml

2018-04-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15784?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16441448#comment-16441448 ] Apache Spark commented on SPARK-15784: -- User 'jkbradley' has created a pull request for this issue:

[jira] [Commented] (SPARK-15784) Add Power Iteration Clustering to spark.ml

2018-04-17 Thread Miao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15784?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16441353#comment-16441353 ] Miao Wang commented on SPARK-15784: --- [~josephkb] You can start the new PR now. :) > Add Power

[jira] [Commented] (SPARK-23963) Queries on text-based Hive tables grow disproportionately slower as the number of columns increase

2018-04-17 Thread Bruce Robbins (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23963?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16441378#comment-16441378 ] Bruce Robbins commented on SPARK-23963: --- [~Tagar] Yes, although I am a little fuzzy on the process

[jira] [Updated] (SPARK-8799) OneVsRestModel should extend ClassificationModel

2018-04-17 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-8799: - Target Version/s: 3.0.0 > OneVsRestModel should extend ClassificationModel >

[jira] [Assigned] (SPARK-24003) Add support to provide spark.executor.extraJavaOptions in terms of App Id and/or Executor Id's

2018-04-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24003?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24003: Assignee: (was: Apache Spark) > Add support to provide

[jira] [Commented] (SPARK-24003) Add support to provide spark.executor.extraJavaOptions in terms of App Id and/or Executor Id's

2018-04-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24003?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16441328#comment-16441328 ] Apache Spark commented on SPARK-24003: -- User 'devaraj-kavali' has created a pull request for this

[jira] [Assigned] (SPARK-24003) Add support to provide spark.executor.extraJavaOptions in terms of App Id and/or Executor Id's

2018-04-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24003?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24003: Assignee: Apache Spark > Add support to provide spark.executor.extraJavaOptions in terms

[jira] [Assigned] (SPARK-21741) Python API for DataFrame-based multivariate summarizer

2018-04-17 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-21741: - Assignee: Weichen Xu > Python API for DataFrame-based multivariate summarizer >

[jira] [Resolved] (SPARK-21741) Python API for DataFrame-based multivariate summarizer

2018-04-17 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-21741. --- Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 20695

[jira] [Created] (SPARK-24003) Add support to provide spark.executor.extraJavaOptions in terms of App Id and/or Executor Id's

2018-04-17 Thread Devaraj K (JIRA)
Devaraj K created SPARK-24003: - Summary: Add support to provide spark.executor.extraJavaOptions in terms of App Id and/or Executor Id's Key: SPARK-24003 URL: https://issues.apache.org/jira/browse/SPARK-24003

[jira] [Updated] (SPARK-22884) ML test for StructuredStreaming: spark.ml.clustering

2018-04-17 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22884?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-22884: -- Shepherd: Joseph K. Bradley > ML test for StructuredStreaming: spark.ml.clustering >

[jira] [Updated] (SPARK-8799) OneVsRestModel should extend ClassificationModel

2018-04-17 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-8799: - Shepherd: Joseph K. Bradley > OneVsRestModel should extend ClassificationModel >

[jira] [Commented] (SPARK-8799) OneVsRestModel should extend ClassificationModel

2018-04-17 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16441207#comment-16441207 ] Joseph K. Bradley commented on SPARK-8799: -- The missing functionality was added in [SPARK-9312],

[jira] [Commented] (SPARK-21063) Spark return an empty result from remote hadoop cluster

2018-04-17 Thread Carlos Bribiescas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21063?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16441257#comment-16441257 ] Carlos Bribiescas commented on SPARK-21063: --- Any update or workarounds for this? > Spark

[jira] [Commented] (SPARK-23933) High-order function: map(array, array) → map<K,V>

2018-04-17 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23933?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16441268#comment-16441268 ] Kazuaki Ishizaki commented on SPARK-23933: -- ping [~smilegator] > High-order function:

[jira] [Commented] (SPARK-18057) Update structured streaming kafka from 0.10.0.1 to 1.1.0

2018-04-17 Thread Cody Koeninger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18057?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16441606#comment-16441606 ] Cody Koeninger commented on SPARK-18057: Out of curiosity, was that a compacted topic? >

[jira] [Updated] (SPARK-23948) Trigger mapstage's job listener in submitMissingTasks

2018-04-17 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Imran Rashid updated SPARK-23948: - Fix Version/s: 2.3.1 > Trigger mapstage's job listener in submitMissingTasks >

[jira] [Created] (SPARK-24005) Remove usage of Scala’s parallel collection

2018-04-17 Thread Xiao Li (JIRA)
Xiao Li created SPARK-24005: --- Summary: Remove usage of Scala’s parallel collection Key: SPARK-24005 URL: https://issues.apache.org/jira/browse/SPARK-24005 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-18057) Update structured streaming kafka from 0.10.0.1 to 1.1.0

2018-04-17 Thread Jordan Moore (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18057?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16441485#comment-16441485 ] Jordan Moore commented on SPARK-18057: -- Hi all, chiming in here to point out a production issue we

[jira] [Assigned] (SPARK-22968) java.lang.IllegalStateException: No current assignment for partition kssh-2

2018-04-17 Thread Cody Koeninger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cody Koeninger reassigned SPARK-22968: -- Assignee: Saisai Shao > java.lang.IllegalStateException: No current assignment for

[jira] [Resolved] (SPARK-22968) java.lang.IllegalStateException: No current assignment for partition kssh-2

2018-04-17 Thread Cody Koeninger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cody Koeninger resolved SPARK-22968. Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 21038

[jira] [Commented] (SPARK-18057) Update structured streaming kafka from 0.10.0.1 to 1.1.0

2018-04-17 Thread Cody Koeninger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18057?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16441793#comment-16441793 ] Cody Koeninger commented on SPARK-18057: Just adding the extra dependency on 0.11 probably won't

  1   2   >