[jira] [Updated] (SPARK-26845) Avro from_avro to_avro roundtrip fails if data type is string

2019-02-07 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26845?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Somogyi updated SPARK-26845: -- Description: I was playing with AvroFunctionsSuite and created a situation where test fails w

[jira] [Updated] (SPARK-26845) Avro from_avro to_avro roundtrip fails if data type is string

2019-02-07 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26845?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Somogyi updated SPARK-26845: -- Description: I was playing with AvroFunctionsSuite and creates a situation where test fails w

[jira] [Created] (SPARK-26845) Avro from_avro to_avro roundtrip fails if data type is string

2019-02-07 Thread Gabor Somogyi (JIRA)
Gabor Somogyi created SPARK-26845: - Summary: Avro from_avro to_avro roundtrip fails if data type is string Key: SPARK-26845 URL: https://issues.apache.org/jira/browse/SPARK-26845 Project: Spark

[jira] [Commented] (SPARK-26842) java.lang.IllegalArgumentException: Unsupported class file major version 55

2019-02-07 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16762749#comment-16762749 ] Gabor Somogyi commented on SPARK-26842: --- Please see the response provided by [~kab

[jira] [Commented] (SPARK-23685) Spark Structured Streaming Kafka 0.10 Consumer Can't Handle Non-consecutive Offsets (i.e. Log Compaction)

2019-02-07 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23685?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16762756#comment-16762756 ] Gabor Somogyi commented on SPARK-23685: --- [~sindiri] gentle ping. > Spark Structur

[jira] [Commented] (SPARK-26842) java.lang.IllegalArgumentException: Unsupported class file major version 55

2019-02-07 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16762754#comment-16762754 ] Gabor Somogyi commented on SPARK-26842: --- Have you tried it out [~ranjit_hande]? >

[jira] [Commented] (SPARK-26842) java.lang.IllegalArgumentException: Unsupported class file major version 55

2019-02-07 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16762753#comment-16762753 ] Gabor Somogyi commented on SPARK-26842: --- {quote}so you may need to use lower versi

[jira] [Commented] (SPARK-24284) java.util.NoSuchElementException in Spark Streaming with Kafka

2019-02-05 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16760769#comment-16760769 ] Gabor Somogyi commented on SPARK-24284: --- [~ujjalsatpa...@gmail.com] On 1.6.3 Cache

[jira] [Commented] (SPARK-26825) Spark Structure Streaming job failing when submitted in cluster mode

2019-02-05 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26825?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16760777#comment-16760777 ] Gabor Somogyi commented on SPARK-26825: --- There is another PR from me which modifie

[jira] [Commented] (SPARK-24284) java.util.NoSuchElementException in Spark Streaming with Kafka

2019-02-05 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16760775#comment-16760775 ] Gabor Somogyi commented on SPARK-24284: --- This code part has been rewritten in 2.4.

[jira] [Commented] (SPARK-26825) Spark Structure Streaming job failing when submitted in cluster mode

2019-02-05 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26825?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16760702#comment-16760702 ] Gabor Somogyi commented on SPARK-26825: --- [~asdaraujo] excellent analysis! One mino

[jira] [Commented] (SPARK-23685) Spark Structured Streaming Kafka 0.10 Consumer Can't Handle Non-consecutive Offsets (i.e. Log Compaction)

2019-02-01 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23685?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16758351#comment-16758351 ] Gabor Somogyi commented on SPARK-23685: --- [~sindiri] We've tried to reproduce the i

[jira] [Commented] (SPARK-26783) Kafka parameter documentation doesn't match with the reality (upper/lowercase)

2019-02-01 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16758348#comment-16758348 ] Gabor Somogyi commented on SPARK-26783: --- [~zsxwing] [~kabhwan] The more I'm playin

[jira] [Updated] (SPARK-26734) StackOverflowError on WAL serialization caused by large receivedBlockQueue

2019-02-01 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Somogyi updated SPARK-26734: -- Component/s: DStreams > StackOverflowError on WAL serialization caused by large receivedBlockQ

[jira] [Commented] (SPARK-25136) unable to use HDFS checkpoint directories after driver restart

2019-01-31 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16757199#comment-16757199 ] Gabor Somogyi commented on SPARK-25136: --- [~kerbylane] did you have time to check i

[jira] [Comment Edited] (SPARK-26783) Kafka parameter documentation doesn't match with the reality (upper/lowercase)

2019-01-31 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16756994#comment-16756994 ] Gabor Somogyi edited comment on SPARK-26783 at 1/31/19 8:14 AM: --

[jira] [Commented] (SPARK-26783) Kafka parameter documentation doesn't match with the reality (upper/lowercase)

2019-01-31 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16756994#comment-16756994 ] Gabor Somogyi commented on SPARK-26783: --- cc [~joseph.torres] [~kabhwan] [~LI,Xiao

[jira] [Updated] (SPARK-26783) Kafka parameter documentation doesn't match with the reality (upper/lowercase)

2019-01-30 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26783?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Somogyi updated SPARK-26783: -- Priority: Minor (was: Major) > Kafka parameter documentation doesn't match with the reality (

[jira] [Commented] (SPARK-23685) Spark Structured Streaming Kafka 0.10 Consumer Can't Handle Non-consecutive Offsets (i.e. Log Compaction)

2019-01-30 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23685?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16756104#comment-16756104 ] Gabor Somogyi commented on SPARK-23685: --- Filed SPARK-26783. > Spark Structured St

[jira] [Created] (SPARK-26783) Kafka parameter documentation doesn't match with the reality (upper/lowercase)

2019-01-30 Thread Gabor Somogyi (JIRA)
Gabor Somogyi created SPARK-26783: - Summary: Kafka parameter documentation doesn't match with the reality (upper/lowercase) Key: SPARK-26783 URL: https://issues.apache.org/jira/browse/SPARK-26783 Proj

[jira] [Comment Edited] (SPARK-23685) Spark Structured Streaming Kafka 0.10 Consumer Can't Handle Non-consecutive Offsets (i.e. Log Compaction)

2019-01-30 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23685?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16756077#comment-16756077 ] Gabor Somogyi edited comment on SPARK-23685 at 1/30/19 1:19 PM: --

[jira] [Resolved] (SPARK-23685) Spark Structured Streaming Kafka 0.10 Consumer Can't Handle Non-consecutive Offsets (i.e. Log Compaction)

2019-01-30 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Somogyi resolved SPARK-23685. --- Resolution: Information Provided > Spark Structured Streaming Kafka 0.10 Consumer Can't Hand

[jira] [Comment Edited] (SPARK-23685) Spark Structured Streaming Kafka 0.10 Consumer Can't Handle Non-consecutive Offsets (i.e. Log Compaction)

2019-01-30 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23685?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16756077#comment-16756077 ] Gabor Somogyi edited comment on SPARK-23685 at 1/30/19 1:22 PM: --

[jira] [Commented] (SPARK-23685) Spark Structured Streaming Kafka 0.10 Consumer Can't Handle Non-consecutive Offsets (i.e. Log Compaction)

2019-01-30 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23685?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16756077#comment-16756077 ] Gabor Somogyi commented on SPARK-23685: --- Comment from [~sindiri] on the PR: {quote

[jira] [Commented] (SPARK-26766) Remove the list of filesystems from HadoopDelegationTokenProvider.obtainDelegationTokens

2019-01-30 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16755934#comment-16755934 ] Gabor Somogyi commented on SPARK-26766: --- Considering the size of the hadoopFSsToAc

[jira] [Updated] (SPARK-26772) YARNHadoopDelegationTokenManager should load ServiceCredentialProviders independently

2019-01-29 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Somogyi updated SPARK-26772: -- Description: YARNHadoopDelegationTokenManager now loads ServiceCredentialProviders in one ste

[jira] [Updated] (SPARK-26772) YARNHadoopDelegationTokenManager should load ServiceCredentialProviders independently

2019-01-29 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Somogyi updated SPARK-26772: -- Component/s: (was: Spark Core) YARN > YARNHadoopDelegationTokenManager sh

[jira] [Created] (SPARK-26772) YARNHadoopDelegationTokenManager should load ServiceCredentialProviders independently

2019-01-29 Thread Gabor Somogyi (JIRA)
Gabor Somogyi created SPARK-26772: - Summary: YARNHadoopDelegationTokenManager should load ServiceCredentialProviders independently Key: SPARK-26772 URL: https://issues.apache.org/jira/browse/SPARK-26772

[jira] [Updated] (SPARK-26766) Remove the list of filesystems from HadoopDelegationTokenProvider.obtainDelegationTokens

2019-01-29 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Somogyi updated SPARK-26766: -- Priority: Minor (was: Major) > Remove the list of filesystems from > HadoopDelegationTokenPr

[jira] [Commented] (SPARK-26766) Remove the list of filesystems from HadoopDelegationTokenProvider.obtainDelegationTokens

2019-01-29 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16754865#comment-16754865 ] Gabor Somogyi commented on SPARK-26766: --- [~vanzin] I was thinking about your [sug

[jira] [Created] (SPARK-26766) Remove the list of filesystems from HadoopDelegationTokenProvider.obtainDelegationTokens

2019-01-29 Thread Gabor Somogyi (JIRA)
Gabor Somogyi created SPARK-26766: - Summary: Remove the list of filesystems from HadoopDelegationTokenProvider.obtainDelegationTokens Key: SPARK-26766 URL: https://issues.apache.org/jira/browse/SPARK-26766

[jira] [Commented] (SPARK-26718) structured streaming fetched wrong current offset from kafka

2019-01-24 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26718?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16751706#comment-16751706 ] Gabor Somogyi commented on SPARK-26718: --- +1 on [~kabhwan] suggestion > structured

[jira] [Commented] (SPARK-26389) temp checkpoint folder at executor should be deleted on graceful shutdown

2019-01-24 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26389?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16751082#comment-16751082 ] Gabor Somogyi commented on SPARK-26389: --- I've lowered the prio and will file a PR

[jira] [Updated] (SPARK-26389) temp checkpoint folder at executor should be deleted on graceful shutdown

2019-01-24 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26389?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Somogyi updated SPARK-26389: -- Priority: Minor (was: Major) > temp checkpoint folder at executor should be deleted on gracef

[jira] [Commented] (SPARK-26389) temp checkpoint folder at executor should be deleted on graceful shutdown

2019-01-23 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26389?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16750223#comment-16750223 ] Gabor Somogyi commented on SPARK-26389: --- Good to hear with HDFS it's working. Pro

[jira] [Updated] (SPARK-26668) One Kafka broker serve is down,the spark streaming start consuming delay

2019-01-23 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Somogyi updated SPARK-26668: -- Component/s: DStreams > One Kafka broker serve is down,the spark streaming start consuming del

[jira] [Commented] (SPARK-26649) Noop Streaming Sink using DSV2

2019-01-22 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26649?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16748887#comment-16748887 ] Gabor Somogyi commented on SPARK-26649: --- Started to work on this. > Noop Streamin

[jira] [Commented] (SPARK-26668) One Kafka broker serve is down,the spark streaming start consuming delay

2019-01-22 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16748809#comment-16748809 ] Gabor Somogyi commented on SPARK-26668: --- [~quanyou.chang] Is it DStreams or Strctu

[jira] [Comment Edited] (SPARK-26668) One Kafka broker serve is down,the spark streaming start consuming delay

2019-01-22 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16748809#comment-16748809 ] Gabor Somogyi edited comment on SPARK-26668 at 1/22/19 3:03 PM: --

[jira] [Updated] (SPARK-26649) Noop Streaming Sink using DSV2

2019-01-22 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26649?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Somogyi updated SPARK-26649: -- Issue Type: New Feature (was: Bug) > Noop Streaming Sink using DSV2 > ---

[jira] [Created] (SPARK-26686) Remove unnecessary KafkaSourceProvider parameter lowercase conversion

2019-01-22 Thread Gabor Somogyi (JIRA)
Gabor Somogyi created SPARK-26686: - Summary: Remove unnecessary KafkaSourceProvider parameter lowercase conversion Key: SPARK-26686 URL: https://issues.apache.org/jira/browse/SPARK-26686 Project: Spar

[jira] [Commented] (SPARK-26649) Noop Streaming Sink using DSV2

2019-01-21 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26649?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16747830#comment-16747830 ] Gabor Somogyi commented on SPARK-26649: --- Just wondering why is it a bug? > Noop S

[jira] [Created] (SPARK-26592) Kafka delegation token doesn't support proxy user

2019-01-10 Thread Gabor Somogyi (JIRA)
Gabor Somogyi created SPARK-26592: - Summary: Kafka delegation token doesn't support proxy user Key: SPARK-26592 URL: https://issues.apache.org/jira/browse/SPARK-26592 Project: Spark Issue Typ

[jira] [Commented] (SPARK-26592) Kafka delegation token doesn't support proxy user

2019-01-10 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26592?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16739681#comment-16739681 ] Gabor Somogyi commented on SPARK-26592: --- Here is the KIP: https://cwiki.apache.or

[jira] [Commented] (SPARK-26385) YARN - Spark Stateful Structured streaming HDFS_DELEGATION_TOKEN not found in cache

2019-01-10 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16739679#comment-16739679 ] Gabor Somogyi commented on SPARK-26385: --- [~stud3nt] are you guys using dynamic all

[jira] [Updated] (SPARK-26592) Kafka delegation token doesn't support proxy user

2019-01-10 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26592?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Somogyi updated SPARK-26592: -- Description: Kafka is not yet support to obtain delegation token with proxy user. It has to be

[jira] [Resolved] (SPARK-23636) [SPARK 2.2] | Kafka Consumer | KafkaUtils.createRDD throws Exception - java.util.ConcurrentModificationException: KafkaConsumer is not safe for multi-threaded access

2019-01-07 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Somogyi resolved SPARK-23636. --- Resolution: Fixed Fix Version/s: 2.4.0 Please reopen if problem re-appears. > [SPARK

[jira] [Commented] (SPARK-26254) Move delegation token providers into a separate project

2019-01-04 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26254?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16734393#comment-16734393 ] Gabor Somogyi commented on SPARK-26254: --- [~hyukjin.kwon] In my last comment right

[jira] [Commented] (SPARK-26254) Move delegation token providers into a separate project

2019-01-04 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26254?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16733956#comment-16733956 ] Gabor Somogyi commented on SPARK-26254: --- ping [~vanzin] > Move delegation token p

[jira] [Closed] (SPARK-26359) Spark checkpoint restore fails after query restart

2019-01-04 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26359?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Somogyi closed SPARK-26359. - > Spark checkpoint restore fails after query restart > -

[jira] [Resolved] (SPARK-26359) Spark checkpoint restore fails after query restart

2019-01-04 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26359?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Somogyi resolved SPARK-26359. --- Resolution: Information Provided > Spark checkpoint restore fails after query restart >

[jira] [Commented] (SPARK-26385) YARN - Spark Stateful Structured streaming HDFS_DELEGATION_TOKEN not found in cache

2019-01-03 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16733318#comment-16733318 ] Gabor Somogyi commented on SPARK-26385: --- Yeah and additional logs would be also go

[jira] [Commented] (SPARK-26359) Spark checkpoint restore fails after query restart

2019-01-03 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16733209#comment-16733209 ] Gabor Somogyi commented on SPARK-26359: --- [~Tint] did the suggested workaround work

[jira] [Commented] (SPARK-26389) temp checkpoint folder at executor should be deleted on graceful shutdown

2019-01-03 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26389?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16733201#comment-16733201 ] Gabor Somogyi commented on SPARK-26389: --- Temp checkpoint can be used in one-node s

[jira] [Updated] (SPARK-26434) disallow ADAPTIVE_EXECUTION_ENABLED&CBO_ENABLED in org.apache.spark.sql.execution.streaming.StreamExecution#runStream, but logWarning in org.apache.spark.sql.streaming.S

2019-01-03 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26434?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Somogyi updated SPARK-26434: -- Affects Version/s: (was: 2.4.0) 3.0.0 > disallow ADAPTIVE_EXECUTION

[jira] [Commented] (SPARK-26389) temp checkpoint folder at executor should be deleted on graceful shutdown

2018-12-19 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26389?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16725418#comment-16725418 ] Gabor Somogyi commented on SPARK-26389: --- {quote}can't used to recovery{quote} What

[jira] [Commented] (SPARK-26389) temp checkpoint folder at executor should be deleted on graceful shutdown

2018-12-19 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26389?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16725413#comment-16725413 ] Gabor Somogyi commented on SPARK-26389: --- spark.streaming.stopGracefullyOnShutdown

[jira] [Comment Edited] (SPARK-26415) Mark StreamSinkProvider and StreamSourceProvider as stable

2018-12-19 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26415?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16725395#comment-16725395 ] Gabor Somogyi edited comment on SPARK-26415 at 12/19/18 10:30 PM:

[jira] [Commented] (SPARK-26415) Mark StreamSinkProvider and StreamSourceProvider as stable

2018-12-19 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26415?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16725395#comment-16725395 ] Gabor Somogyi commented on SPARK-26415: --- One minor thing: AFAIK targer version sho

[jira] [Commented] (SPARK-26396) Kafka consumer cache overflow since 2.4.x

2018-12-19 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16725393#comment-16725393 ] Gabor Somogyi commented on SPARK-26396: --- Should be. Related the jira the describe

[jira] [Comment Edited] (SPARK-26396) Kafka consumer cache overflow since 2.4.x

2018-12-19 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16725063#comment-16725063 ] Gabor Somogyi edited comment on SPARK-26396 at 12/19/18 2:48 PM: -

[jira] [Comment Edited] (SPARK-26396) Kafka consumer cache overflow since 2.4.x

2018-12-19 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16725063#comment-16725063 ] Gabor Somogyi edited comment on SPARK-26396 at 12/19/18 2:50 PM: -

[jira] [Comment Edited] (SPARK-26396) Kafka consumer cache overflow since 2.4.x

2018-12-19 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16725063#comment-16725063 ] Gabor Somogyi edited comment on SPARK-26396 at 12/19/18 2:49 PM: -

[jira] [Commented] (SPARK-26396) Kafka consumer cache overflow since 2.4.x

2018-12-19 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16725063#comment-16725063 ] Gabor Somogyi commented on SPARK-26396: --- Number of executors can be set with {{--n

[jira] [Comment Edited] (SPARK-26396) Kafka consumer cache overflow since 2.4.x

2018-12-19 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16725021#comment-16725021 ] Gabor Somogyi edited comment on SPARK-26396 at 12/19/18 2:07 PM: -

[jira] [Commented] (SPARK-26396) Kafka consumer cache overflow since 2.4.x

2018-12-19 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16725021#comment-16725021 ] Gabor Somogyi commented on SPARK-26396: --- [~Tint] seems like you're trying to scale

[jira] [Commented] (SPARK-26359) Spark checkpoint restore fails after query restart

2018-12-14 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16721251#comment-16721251 ] Gabor Somogyi commented on SPARK-26359: --- There is also a possibility (apart from d

[jira] [Created] (SPARK-26371) Increase Kafka ConfigUpdater test coverage

2018-12-14 Thread Gabor Somogyi (JIRA)
Gabor Somogyi created SPARK-26371: - Summary: Increase Kafka ConfigUpdater test coverage Key: SPARK-26371 URL: https://issues.apache.org/jira/browse/SPARK-26371 Project: Spark Issue Type: Impr

[jira] [Commented] (SPARK-26359) Spark checkpoint restore fails after query restart

2018-12-14 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16721195#comment-16721195 ] Gabor Somogyi commented on SPARK-26359: --- Yeah, after the recovery it's advised to

[jira] [Commented] (SPARK-26359) Spark checkpoint restore fails after query restart

2018-12-13 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16720558#comment-16720558 ] Gabor Somogyi commented on SPARK-26359: --- As I see the main issue appears because S

[jira] [Comment Edited] (SPARK-26254) Move delegation token providers into a separate project

2018-12-13 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26254?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16720241#comment-16720241 ] Gabor Somogyi edited comment on SPARK-26254 at 12/13/18 3:29 PM: -

[jira] [Commented] (SPARK-26254) Move delegation token providers into a separate project

2018-12-13 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26254?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16720241#comment-16720241 ] Gabor Somogyi commented on SPARK-26254: --- {quote}There was concern about using Serv

[jira] [Commented] (SPARK-25136) unable to use HDFS checkpoint directories after driver restart

2018-12-12 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16718976#comment-16718976 ] Gabor Somogyi commented on SPARK-25136: --- Without knowing the exact root cause I wo

[jira] [Commented] (SPARK-25136) unable to use HDFS checkpoint directories after driver restart

2018-12-12 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16718784#comment-16718784 ] Gabor Somogyi commented on SPARK-25136: --- As I see from the logs you've chosen the

[jira] [Commented] (SPARK-25136) unable to use HDFS checkpoint directories after driver restart

2018-12-12 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16718747#comment-16718747 ] Gabor Somogyi commented on SPARK-25136: --- [~kerbylane] how do you set checkpoint lo

[jira] [Commented] (SPARK-25136) unable to use HDFS checkpoint directories after driver restart

2018-12-12 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16718743#comment-16718743 ] Gabor Somogyi commented on SPARK-25136: --- Yeah, the HDFS thing is different. > una

[jira] [Commented] (SPARK-25136) unable to use HDFS checkpoint directories after driver restart

2018-12-12 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16718727#comment-16718727 ] Gabor Somogyi commented on SPARK-25136: --- We've tested S3 as a checkpoint directory

[jira] [Commented] (SPARK-26302) retainedBatches configuration can eat up memory on driver

2018-12-12 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26302?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16718663#comment-16718663 ] Gabor Somogyi commented on SPARK-26302: --- Code most of the time not read by users s

[jira] [Commented] (SPARK-19888) Seeing offsets not resetting even when reset policy is configured explicitly

2018-12-12 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16718645#comment-16718645 ] Gabor Somogyi commented on SPARK-19888: --- [~jrmiller] SPARK-19185 resolved on 2.4.0

[jira] [Closed] (SPARK-19185) ConcurrentModificationExceptions with CachedKafkaConsumers when Windowing

2018-12-12 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Somogyi closed SPARK-19185. - > ConcurrentModificationExceptions with CachedKafkaConsumers when Windowing > --

[jira] [Commented] (SPARK-23526) KafkaMicroBatchV2SourceSuite.ensure stream-stream self-join generates only one offset in offset log

2018-12-12 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16718644#comment-16718644 ] Gabor Somogyi commented on SPARK-23526: --- [~cloud_fan] this should be resolved in S

[jira] [Commented] (SPARK-23636) [SPARK 2.2] | Kafka Consumer | KafkaUtils.createRDD throws Exception - java.util.ConcurrentModificationException: KafkaConsumer is not safe for multi-threaded access

2018-12-12 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23636?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16718622#comment-16718622 ] Gabor Somogyi commented on SPARK-23636: --- [~mcdeepak] It should be resolved in SPAR

[jira] [Updated] (SPARK-23636) [SPARK 2.2] | Kafka Consumer | KafkaUtils.createRDD throws Exception - java.util.ConcurrentModificationException: KafkaConsumer is not safe for multi-threaded access

2018-12-12 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Somogyi updated SPARK-23636: -- Component/s: (was: Structured Streaming) DStreams > [SPARK 2.2] | Kafka C

[jira] [Closed] (SPARK-22606) There may be two or more tasks in one executor will use the same kafka consumer at the same time, then it will throw an exception: "KafkaConsumer is not safe for multi-th

2018-12-12 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Somogyi closed SPARK-22606. - > There may be two or more tasks in one executor will use the same kafka > consumer at the same tim

[jira] [Resolved] (SPARK-23663) Spark Streaming Kafka 010 , fails with "java.util.ConcurrentModificationException: KafkaConsumer is not safe for multi-threaded access"

2018-12-12 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23663?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Somogyi resolved SPARK-23663. --- Resolution: Duplicate Fix Version/s: 2.4.0 This should be fixed in SPARK-19185. > Sp

[jira] [Closed] (SPARK-23663) Spark Streaming Kafka 010 , fails with "java.util.ConcurrentModificationException: KafkaConsumer is not safe for multi-threaded access"

2018-12-12 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23663?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Somogyi closed SPARK-23663. - > Spark Streaming Kafka 010 , fails with > "java.util.ConcurrentModificationException: KafkaConsume

[jira] [Resolved] (SPARK-22606) There may be two or more tasks in one executor will use the same kafka consumer at the same time, then it will throw an exception: "KafkaConsumer is not safe for multi-

2018-12-12 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Somogyi resolved SPARK-22606. --- Resolution: Duplicate Fix Version/s: 2.4.0 > There may be two or more tasks in one ex

[jira] [Reopened] (SPARK-22606) There may be two or more tasks in one executor will use the same kafka consumer at the same time, then it will throw an exception: "KafkaConsumer is not safe for multi-

2018-12-12 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Somogyi reopened SPARK-22606: --- > There may be two or more tasks in one executor will use the same kafka > consumer at the same

[jira] [Issue Comment Deleted] (SPARK-22606) There may be two or more tasks in one executor will use the same kafka consumer at the same time, then it will throw an exception: "KafkaConsumer is not sa

2018-12-12 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Somogyi updated SPARK-22606: -- Comment: was deleted (was: This should be resolved in SPARK-19185, closing.) > There may be t

[jira] [Resolved] (SPARK-22606) There may be two or more tasks in one executor will use the same kafka consumer at the same time, then it will throw an exception: "KafkaConsumer is not safe for multi-

2018-12-12 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Somogyi resolved SPARK-22606. --- Resolution: Won't Fix This should be resolved in SPARK-19185. > There may be two or more ta

[jira] [Commented] (SPARK-22606) There may be two or more tasks in one executor will use the same kafka consumer at the same time, then it will throw an exception: "KafkaConsumer is not safe for multi

2018-12-12 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16718599#comment-16718599 ] Gabor Somogyi commented on SPARK-22606: --- This should be resolved in SPARK-19185, c

[jira] [Comment Edited] (SPARK-19185) ConcurrentModificationExceptions with CachedKafkaConsumers when Windowing

2018-12-12 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19185?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16718595#comment-16718595 ] Gabor Somogyi edited comment on SPARK-19185 at 12/12/18 8:27 AM: -

[jira] [Commented] (SPARK-19185) ConcurrentModificationExceptions with CachedKafkaConsumers when Windowing

2018-12-12 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19185?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16718595#comment-16718595 ] Gabor Somogyi commented on SPARK-19185: --- There are many related issues open but th

[jira] [Commented] (SPARK-26302) retainedBatches configuration can eat up memory on driver

2018-12-11 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26302?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16717274#comment-16717274 ] Gabor Somogyi commented on SPARK-26302: --- Changed the title to reflect the issue mo

[jira] [Updated] (SPARK-26302) retainedBatches configuration can eat up memory on driver

2018-12-11 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Somogyi updated SPARK-26302: -- Summary: retainedBatches configuration can eat up memory on driver (was: retainedBatches conf

[jira] [Comment Edited] (SPARK-26302) retainedBatches configuration can cause memory leak

2018-12-10 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26302?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16715124#comment-16715124 ] Gabor Somogyi edited comment on SPARK-26302 at 12/10/18 5:12 PM: -

[jira] [Commented] (SPARK-26302) retainedBatches configuration can cause memory leak

2018-12-10 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26302?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16715124#comment-16715124 ] Gabor Somogyi commented on SPARK-26302: --- > can cause memory leak Is it really memo

[jira] [Commented] (SPARK-26302) retainedBatches configuration can cause memory leak

2018-12-10 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26302?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16715109#comment-16715109 ] Gabor Somogyi commented on SPARK-26302: --- I think the same applies to all *spark.ui

[jira] [Updated] (SPARK-26302) retainedBatches configuration can cause memory leak

2018-12-10 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Somogyi updated SPARK-26302: -- Component/s: DStreams > retainedBatches configuration can cause memory leak >

[jira] [Updated] (SPARK-26322) Simplify kafka delegation token sasl.mechanism configuration

2018-12-10 Thread Gabor Somogyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Somogyi updated SPARK-26322: -- Description: When Kafka delegation token obtained, SCRAM sasl.mechanism has to be configured

<    1   2   3   4   5   6   7   >