Re: KStream/KTable prioritization/initial offset

2016-03-25 Thread Greg Fodor
need to be able to not > commit offsets against the input used to populate the state store so that > it will always get reprocessed and you need to be able to disable the > changelog. > > -Jay > > On Fri, Mar 25, 2016 at 11:55 AM, Greg Fodor <gfo...@gmail.com> wrote: > >>

KStream/KTable prioritization/initial offset

2016-03-24 Thread Greg Fodor
Really digging Kafka Streams so far, nice work all. I'm interested in being able to materialize one or more KTables in full before the rest of the topology begins processing messages. This seems fundamentally useful since it allows you to get your database tables replicated up off the change

Re: KStream/KTable prioritization/initial offset

2016-03-25 Thread Greg Fodor
thers. > > On Thu, Mar 24, 2016 at 6:57 PM, Greg Fodor <gfo...@gmail.com> wrote: > > > Really digging Kafka Streams so far, nice work all. I'm interested in > > being able to materialize one or more KTables in full before the rest > > of the topology begins processi

Re: Kafka Streams file handle leak

2016-04-25 Thread Greg Fodor
create a new JIRA for this and change the RB > title with the JIRA number? > > Thanks! > > Guozhang > > On Mon, Apr 25, 2016 at 12:13 AM, Greg Fodor <gfo...@gmail.com> wrote: > >> Apologies for a second posting to the list, but the ASF JIRA is >> prev

Re: Kafka missing from ASF Jira?

2016-04-25 Thread Greg Fodor
hemselves. >> >> Anyways, have just added you to the contributor list. >> >> >> Guozhang >> >> On Mon, Apr 25, 2016 at 9:14 AM, Greg Fodor <gfo...@gmail.com> wrote: >> >> > If I go there, and hit "Create", the Project List at th

Re: Kafka missing from ASF Jira?

2016-04-25 Thread Greg Fodor
apache.org/jira/browse/KAFKA/?selectedTab=com.atlassian.jira.jira-projects-plugin:summary-panel > > Thanks, > Liquan > > On Mon, Apr 25, 2016 at 12:04 AM, Greg Fodor <gfo...@gmail.com> wrote: > >> I am trying to file a bug, but when I go to create a ticket on the

Kafka Streams file handle leak

2016-04-25 Thread Greg Fodor
Apologies for a second posting to the list, but the ASF JIRA is preventing me from creating KAFKA- tickets so I wanted to report this issue. This seems to be a critical bug with KStreams, the .lock files in the state store directories do not seem to be having their file handles freed (despite the

Kafka missing from ASF Jira?

2016-04-25 Thread Greg Fodor
I am trying to file a bug, but when I go to create a ticket on the ASF Jira, Kafka is not visible in the list of projects in the first field of the ticket. I see the hundred+ other Apache projects, but no Kafka :(

Re: [Streams] Reason for swallowing KafkaExceptions?

2016-05-12 Thread Greg Fodor
that there >> are some cases where KafkaExceptions are not logged in log4j yet before it >> pop up to the StreamThread. I will file a JIRA for this issue. >> >> >> Guozhang >> >> On Thu, May 12, 2016 at 5:57 PM, Greg Fodor <gfo...@gmail.com> wrote

Re: [Streams] Reason for swallowing KafkaExceptions?

2016-05-12 Thread Greg Fodor
, Guozhang Wang <wangg...@gmail.com> wrote: > Greg, > > Could you post the output from stdout when running the console as well? > > > Guozhang > > On Thu, May 12, 2016 at 4:52 PM, Greg Fodor <gfo...@gmail.com> wrote: > >> We noticed that some e

[Streams] Reason for swallowing KafkaExceptions?

2016-05-12 Thread Greg Fodor
We noticed that some errors were happening in one of our KafkaStreams jobs but they were not appearing in our logs or being sent to our error reporting service (Airbrake) -- they only became visible on stdout when running from the console. I believe the reason is because of this explicit

Re: best way to duplicate a KStream?

2016-04-14 Thread Greg Fodor
gt; > stream2 = stream1.map(/*set key*/).through("topic2").aggregateByKey(...); > > stream3 = stream1.map(/*a different > key*/).through("topci3").aggregateByKey(...); > > ----- > > Guozhang > > > On Thu, Apr 14, 2016 at 10:05 AM, Greg Fodo

best way to duplicate a KStream?

2016-04-14 Thread Greg Fodor
Hi all, I'm working on a Kafka Streaming job and I've hit a case I didn't see straightforward support for. I have an input stream topic I'd like to perform two separate aggregations on, but building up two parallel sub-topologies with the same source topic at the source results in an error: "Topic

Store flushing on commit.interval.ms from KIP-63 introduces aggregation latency

2016-10-09 Thread Greg Fodor
I'm taking 0.10.1 for a spin on our existing Kafka Streams jobs and I'm hitting what seems to be a serious issue (at least, for us) with the changes brought about in KIP-63. In our job, we have a number of steps in the topology where we perform a repartition and aggregation on topics that require

Re: Store flushing on commit.interval.ms from KIP-63 introduces aggregation latency

2016-10-09 Thread Greg Fodor
that if the records are forwarded immediately it resolves the issue I am seeing. Having it be done on a per-KGroupedStream basis would provide maximum flexibility. On Sun, Oct 9, 2016 at 1:06 AM, Greg Fodor <gfo...@gmail.com> wrote: > I'm taking 0.10.1 for a spin on our existing Kafka Streams jobs

Re: Store flushing on commit.interval.ms from KIP-63 introduces aggregation latency

2016-10-10 Thread Greg Fodor
cache size to 0. That should > give you the old behaviour: > streamsConfiguration.put(StreamsConfig.CACHE_MAX_BYTES_BUFFERING_CONFIG, > 0L); > > Your PR is an alternative, but it requires changing the APIs and would > require a KIP. > > Thanks > Eno > > > On 9 Oct 2

Re: Store flushing on commit.interval.ms from KIP-63 introduces aggregation latency

2016-10-12 Thread Greg Fodor
peaking you are saving those many serde calls per store). But if you do > observe significant performance degradation I'd personally love to learn > more and help on that end. > > > Guozhang > > > > > > On Tue, Oct 11, 2016 at 10:10 AM, Greg Fodor <gfo...@gmai

Re: [ANNOUNCE] Apache Kafka 0.10.1.0 Released

2016-10-20 Thread Greg Fodor
;> > > afka_2.11-0.10.1.0.tgz >> > > https://www.apache.org/dyn/closer.cgi?path=/kafka/0.10.1.0/k >> > > afka_2.10-0.10.1.0.tgz >> > > >> > > Thanks to the 115 contributors on this release! >> > > >> > > Alex Gliks

Re: [jira] [Commented] (KAFKA-4113) Allow KTable bootstrap

2016-10-21 Thread Greg Fodor
gracefully shut down our jobs in certain cases so there's a chance that might be related. Anyhow, now that I have a better understanding of things I will be able to investigate if we experience missing keys in the future, thanks! On Oct 20, 2016 2:08 PM, "Greg Fodor (JIRA)" <j...@apac

Re: Store flushing on commit.interval.ms from KIP-63 introduces aggregation latency

2016-10-11 Thread Greg Fodor
sdb/wiki/Basic-Operations#write-buffer > <https://github.com/facebook/rocksdb/wiki/Basic-Operations#write-buffer> > > They can override these settings by creating an impl of RocksDBConfigSetter > and setting StreamsConfig.ROCKSDB_CONFIG_SETTER_CLASS_CONFIG in Kafka Streams. > &g

Re: Store flushing on commit.interval.ms from KIP-63 introduces aggregation latency

2016-10-09 Thread Greg Fodor
JIRA opened here: https://issues.apache.org/jira/browse/KAFKA-4281 On Sun, Oct 9, 2016 at 2:02 AM, Greg Fodor <gfo...@gmail.com> wrote: > I went ahead and did some more testing, and it feels to me one option > for resolving this issue is having a method on KGroupedStream which &

Re: Dealing with noisy timestamps in kafka streams + 10.1

2017-07-07 Thread Greg Fodor
topic while allowing CreateTime timestamps to land on the other topics. On Thu, Jul 6, 2017 at 9:32 PM, Greg Fodor <gfo...@gmail.com> wrote: > I managed to answer some of my own questions :) > > For future google'ers: > > to deal with retention.ms see https://issues.apache.org

Re: Possible deadlock during shutdown in setState in kafka streams 10.2

2017-07-07 Thread Greg Fodor
t; Damian > > On Fri, 7 Jul 2017 at 10:36 Greg Fodor <gfo...@gmail.com> wrote: > > > I'm running a 10.2 job across 5 nodes with 32 stream threads on each node > > and find that when gracefully shutdown all of them at once via an ansible > > scripts, some of the nodes

Consumer group suggestion: minimum consumer count

2017-07-10 Thread Greg Fodor
With Kafka Streams, it's common to spin up and shut down clusters of consumers by performing a graceful shutdown and restart during a deploy. One thing we've been running into is that during the startup and shutdown of a kafka streams clusters you often can have multiple rebalances as the consumer

Re: Consumer group suggestion: minimum consumer count

2017-07-10 Thread Greg Fodor
Apologies, just discovered KIP-134, which would address our specific problem directly via delaying consumer join. On Mon, Jul 10, 2017 at 4:53 PM, Greg Fodor <gfo...@gmail.com> wrote: > With Kafka Streams, it's common to spin up and shut down clusters of > consumers by performin

Contention on CachedSchemaRegistryClient#getBy*

2017-07-10 Thread Greg Fodor
We've noticed that there's a lot of contention on get/set of schemas by ID in CachedSchemaRegistryClient if that object is shared across threads. All of the methods on CachedSchemaRegistryClient are synchronized and getBySubjectAndId is called when deserializing a record:

Re: [VOTE] KIP-134: Delay initial consumer group rebalance

2017-07-10 Thread Greg Fodor
Found this thread after posting an alternative idea after we starting hitting this issue ourselves for a job that has a lot of state stores and topic partitions. My suggestion was to have consumer groups have a configurable minimum member count before consumption begins, but that has its own trade

Dealing with noisy timestamps in kafka streams + 10.1

2017-07-06 Thread Greg Fodor
Hey all, we are currently working on migrating our system to kafka 10.2 from 10.0 and one thing that we have hit that I wanted some advice on is dealing with the new log retention/rolling semantics that are based on timestamps. We send telemetry data from installed clients into kafka via kafka

Re: RocksDB flushing issue on 0.10.2 streams

2017-07-06 Thread Greg Fodor
threads per host. - Peak TPS seems to be approximately 5k-10k tuples/sec per node. We're reprocessing historical data in kafka. On Thu, Jul 6, 2017 at 10:45 AM, Greg Fodor <gfo...@gmail.com> wrote: > That's great news, thanks! > > On Thu, Jul 6, 2017 at 6:18 AM, Damian Guy <damian...

Possible deadlock during shutdown in setState in kafka streams 10.2

2017-07-07 Thread Greg Fodor
I'm running a 10.2 job across 5 nodes with 32 stream threads on each node and find that when gracefully shutdown all of them at once via an ansible scripts, some of the nodes end up freezing -- at a glance the attached thread dump implies a deadlock between stream threads trying to update their

Re: RocksDB flushing issue on 0.10.2 streams

2017-07-04 Thread Greg Fodor
.2.1). You can also try > setting `state.cleanup.delay.ms` to a higher value (default is 10 > minutes), > to try and avoid it happening during a rebalance (I know this isn't a fix, > but will make it less likely to happen). > > Thanks, > Damian > > On Tue, 4 Jul 2017 at 12:43 Gre

Re: RocksDB flushing issue on 0.10.2 streams

2017-07-04 Thread Greg Fodor
Also, I am on 0.10.2.1, so poll interval was already set to MAX_VALUE. On Tue, Jul 4, 2017 at 11:28 AM, Greg Fodor <gfo...@gmail.com> wrote: > I've nuked the nodes this happened on, but the job had been running for > about 5-10 minutes across 5 nodes before this happened. Does

Re: RocksDB flushing issue on 0.10.2 streams

2017-07-04 Thread Greg Fodor
Upon another run, I see the same error occur during a rebalance, so either my log was showing a rebalance or there is a shared underlying issue with state stores. On Tue, Jul 4, 2017 at 11:35 AM, Greg Fodor <gfo...@gmail.com> wrote: > Also, I am on 0.10.2.1, so poll interval was al

Re: RocksDB flushing issue on 0.10.2 streams

2017-07-05 Thread Greg Fodor
, this does not seem to be happening during a rebalancing event, but after the cluster is stable. On Tue, Jul 4, 2017 at 12:29 PM, Greg Fodor <gfo...@gmail.com> wrote: > Upon another run, I see the same error occur during a rebalance, so either > my log was showing a rebalance or there

Re: RocksDB flushing issue on 0.10.2 streams

2017-07-05 Thread Greg Fodor
, Greg Fodor <gfo...@gmail.com> wrote: > The issue I am hitting is not the directory locking issues we've seen in > the past. The issue seems to be, as you mentioned, that the state dir is > getting deleted by the store cleanup process, but there are still tasks > running that a

Re: RocksDB flushing issue on 0.10.2 streams

2017-07-06 Thread Greg Fodor
at 23:23 Damian Guy <damian@gmail.com> wrote: > > > >> Thanks Greg. I'll look into it more tomorrow. Just finding it difficult > >> to reproduce in a test. > >> Thanks for providing the sequence, gives me something to try and repo. > >> Appreciated. &

Re: Dealing with noisy timestamps in kafka streams + 10.1

2017-07-06 Thread Greg Fodor
://issues.apache.org/jira/browse/KAFKA-5344 In our case, we can live with setting the retention.ms during backfills. Still would like to know if there are any better practices for dealing with mis-stamped records during backills w/ state store topics. On Thu, Jul 6, 2017 at 12:32 PM, Greg Fodor <gfo...@gmail.

Re: RocksDB flushing issue on 0.10.2 streams

2017-07-05 Thread Greg Fodor
g used as the lock will fail. > > > > > > On Wed, 5 Jul 2017 at 08:15 Greg Fodor <gfo...@gmail.com> wrote: > > > >> I can report that setting state.cleanup.delay.ms to a very large value > >> (effectively disabling it) works around the issue. It seems

[jira] [Commented] (KAFKA-3758) KStream job fails to recover after Kafka broker stopped

2016-06-02 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3758?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15312912#comment-15312912 ] Greg Fodor commented on KAFKA-3758: --- Hey, we're running 16 threads -- for this job we have 25 topics

[jira] [Commented] (KAFKA-3758) KStream job fails to recover after Kafka broker stopped

2016-06-02 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3758?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15312913#comment-15312913 ] Greg Fodor commented on KAFKA-3758: --- also, if we did not run at an elevated number of threads, we were

[jira] [Created] (KAFKA-3785) Fetcher spending unnecessary time during metrics recording

2016-06-02 Thread Greg Fodor (JIRA)
Greg Fodor created KAFKA-3785: - Summary: Fetcher spending unnecessary time during metrics recording Key: KAFKA-3785 URL: https://issues.apache.org/jira/browse/KAFKA-3785 Project: Kafka Issue

[jira] [Commented] (KAFKA-3770) KStream job should be able to specify linger.ms

2016-06-02 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15313434#comment-15313434 ] Greg Fodor commented on KAFKA-3770: --- Cut a new PR since I screwed up the rebase on the previous one

[jira] [Updated] (KAFKA-3785) Fetcher spending unnecessary time during metrics recording

2016-06-02 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3785?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Fodor updated KAFKA-3785: -- Component/s: consumer > Fetcher spending unnecessary time during metrics record

[jira] [Commented] (KAFKA-3785) Fetcher spending unnecessary time during metrics recording

2016-06-02 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15313461#comment-15313461 ] Greg Fodor commented on KAFKA-3785: --- https://github.com/apache/kafka/pull/1464 > Fetcher spend

[jira] [Commented] (KAFKA-3769) KStream job spending 60% of time writing metrics

2016-06-12 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15326778#comment-15326778 ] Greg Fodor commented on KAFKA-3769: --- Discussion/resolution moved to: https://issues.apache.org/jira

[jira] [Commented] (KAFKA-3811) Introduce Kafka Streams metrics recording levels

2016-06-10 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15324847#comment-15324847 ] Greg Fodor commented on KAFKA-3811: --- I've also attached a screenshot of YourKit of the relevant call

[jira] [Updated] (KAFKA-3811) Introduce Kafka Streams metrics recording levels

2016-06-10 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Fodor updated KAFKA-3811: -- Attachment: screenshot-1.png > Introduce Kafka Streams metrics recording lev

[jira] [Updated] (KAFKA-3811) Introduce Kafka Streams metrics recording levels

2016-06-10 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Fodor updated KAFKA-3811: -- Attachment: Muon-Snapshot.zip > Introduce Kafka Streams metrics recording lev

[jira] [Updated] (KAFKA-3811) Introduce Kafka Streams metrics recording levels

2016-06-10 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Fodor updated KAFKA-3811: -- Attachment: Muon-latency.zip > Introduce Kafka Streams metrics recording lev

[jira] [Commented] (KAFKA-3811) Introduce Kafka Streams metrics recording levels

2016-06-10 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15324838#comment-15324838 ] Greg Fodor commented on KAFKA-3811: --- Hey [~aartigupta], I ran an attached yourkit profiler to one of our

[jira] [Commented] (KAFKA-3811) Introduce Kafka Streams metrics recording levels

2016-06-10 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15324869#comment-15324869 ] Greg Fodor commented on KAFKA-3811: --- Also, I've attached a screenshot + snapshot of a second run where I

[jira] [Updated] (KAFKA-3811) Introduce Kafka Streams metrics recording levels

2016-06-10 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Fodor updated KAFKA-3811: -- Attachment: screenshot-latency.png > Introduce Kafka Streams metrics recording lev

[jira] [Updated] (KAFKA-3811) Introduce Kafka Streams metrics recording levels

2016-06-09 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Fodor updated KAFKA-3811: -- Summary: Introduce Kafka Streams metrics recording levels (was: Introduce KStream metrics recording

[jira] [Created] (KAFKA-3811) Introduce KStream metrics recording levels

2016-06-09 Thread Greg Fodor (JIRA)
Greg Fodor created KAFKA-3811: - Summary: Introduce KStream metrics recording levels Key: KAFKA-3811 URL: https://issues.apache.org/jira/browse/KAFKA-3811 Project: Kafka Issue Type: Improvement

[jira] [Reopened] (KAFKA-3758) KStream job fails to recover after Kafka broker stopped

2016-05-28 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3758?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Fodor reopened KAFKA-3758: --- > KStream job fails to recover after Kafka broker stop

[jira] [Commented] (KAFKA-3758) KStream job fails to recover after Kafka broker stopped

2016-05-28 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3758?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15305501#comment-15305501 ] Greg Fodor commented on KAFKA-3758: --- Ah yes this looks like the same issue, thanks! > KStream job fa

[jira] [Resolved] (KAFKA-3758) KStream job fails to recover after Kafka broker stopped

2016-05-28 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3758?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Fodor resolved KAFKA-3758. --- Resolution: Duplicate > KStream job fails to recover after Kafka broker stop

[jira] [Commented] (KAFKA-3758) KStream job fails to recover after Kafka broker stopped

2016-05-28 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3758?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15305503#comment-15305503 ] Greg Fodor commented on KAFKA-3758: --- Oh, actually, I'm not so sure. This was not during an unclean

[jira] [Created] (KAFKA-3770) KStream job should be able to specify linger.ms

2016-05-30 Thread Greg Fodor (JIRA)
Greg Fodor created KAFKA-3770: - Summary: KStream job should be able to specify linger.ms Key: KAFKA-3770 URL: https://issues.apache.org/jira/browse/KAFKA-3770 Project: Kafka Issue Type

[jira] [Commented] (KAFKA-3770) KStream job should be able to specify linger.ms

2016-05-30 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15306371#comment-15306371 ] Greg Fodor commented on KAFKA-3770: --- https://github.com/apache/kafka/pull/1448 > KStream job sho

[jira] [Commented] (KAFKA-3769) KStream job spending 60% of time writing metrics

2016-05-30 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15306306#comment-15306306 ] Greg Fodor commented on KAFKA-3769: --- https://github.com/apache/kafka/pull/1447 > KStream job spend

[jira] [Commented] (KAFKA-3769) KStream job spending 60% of time writing metrics

2016-05-30 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15306318#comment-15306318 ] Greg Fodor commented on KAFKA-3769: --- Actually, additionally it looks like the code path for fetching

[jira] [Created] (KAFKA-3769) KStream job spending 60% of time writing metrics

2016-05-30 Thread Greg Fodor (JIRA)
Greg Fodor created KAFKA-3769: - Summary: KStream job spending 60% of time writing metrics Key: KAFKA-3769 URL: https://issues.apache.org/jira/browse/KAFKA-3769 Project: Kafka Issue Type: Bug

[jira] [Commented] (KAFKA-3769) KStream job spending 60% of time writing metrics

2016-05-30 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15306381#comment-15306381 ] Greg Fodor commented on KAFKA-3769: --- Consider the PR a first pass, please advise on how we may want

[jira] [Commented] (KAFKA-3769) KStream job spending 60% of time writing metrics

2016-05-30 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15306321#comment-15306321 ] Greg Fodor commented on KAFKA-3769: --- It seems it might be desirable to have a way to just flip off some

[jira] [Commented] (KAFKA-3758) KStream job fails to recover after Kafka broker stopped

2016-05-31 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3758?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15308306#comment-15308306 ] Greg Fodor commented on KAFKA-3758: --- No, the kstream job was running across 2 servers, and the kafka

[jira] [Created] (KAFKA-3758) KStream job fails to recover after Kafka broker stopped

2016-05-26 Thread Greg Fodor (JIRA)
Greg Fodor created KAFKA-3758: - Summary: KStream job fails to recover after Kafka broker stopped Key: KAFKA-3758 URL: https://issues.apache.org/jira/browse/KAFKA-3758 Project: Kafka Issue Type

[jira] [Updated] (KAFKA-3758) KStream job fails to recover after Kafka broker stopped

2016-05-26 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3758?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Fodor updated KAFKA-3758: -- Attachment: muon.log.1.gz > KStream job fails to recover after Kafka broker stop

[jira] [Commented] (KAFKA-3758) KStream job fails to recover after Kafka broker stopped

2016-05-26 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3758?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15302542#comment-15302542 ] Greg Fodor commented on KAFKA-3758: --- Also, the log is truncated at the top to the point where we shut

[jira] [Commented] (KAFKA-3769) KStream job spending 60% of time writing metrics

2016-06-01 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15311179#comment-15311179 ] Greg Fodor commented on KAFKA-3769: --- Thanks Jay! Guozhang, what are your thoughts on instead of trying

[jira] [Comment Edited] (KAFKA-3360) Add a protocol page/section to the official Kafka documentation

2016-03-19 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3360?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15198422#comment-15198422 ] Greg Fodor edited comment on KAFKA-3360 at 3/16/16 11:39 PM: - Apologies as I

[jira] [Commented] (KAFKA-3542) Add "repartition (+ join)" operations to streams

2016-04-11 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3542?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15236331#comment-15236331 ] Greg Fodor commented on KAFKA-3542: --- I might just not be understanding, but a simple example is if I

[jira] [Comment Edited] (KAFKA-3544) Missing topics on startup

2016-04-11 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15236512#comment-15236512 ] Greg Fodor edited comment on KAFKA-3544 at 4/12/16 3:05 AM: Not sure

[jira] [Commented] (KAFKA-3544) Missing topics on startup

2016-04-11 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15236512#comment-15236512 ] Greg Fodor commented on KAFKA-3544: --- Not sure of the best way to share the topology. Here's the relevant

[jira] [Comment Edited] (KAFKA-3544) Missing topics on startup

2016-04-11 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15236512#comment-15236512 ] Greg Fodor edited comment on KAFKA-3544 at 4/12/16 3:04 AM: Not sure

[jira] [Comment Edited] (KAFKA-3544) Missing topics on startup

2016-04-11 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15236512#comment-15236512 ] Greg Fodor edited comment on KAFKA-3544 at 4/12/16 3:04 AM: Not sure

[jira] [Commented] (KAFKA-3543) Allow a variant of transform() which can emit multiple values

2016-04-11 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3543?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15236339#comment-15236339 ] Greg Fodor commented on KAFKA-3543: --- Ah interesting, my assumption was that the items emitted from

[jira] [Commented] (KAFKA-3542) Add "repartition (+ join)" operations to streams

2016-04-11 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3542?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15236417#comment-15236417 ] Greg Fodor commented on KAFKA-3542: --- great. feel free to close. > Add "repartition (+ join)&quo

[jira] [Commented] (KAFKA-3542) Add "repartition (+ join)" operations to streams

2016-04-11 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3542?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15236374#comment-15236374 ] Greg Fodor commented on KAFKA-3542: --- Right, this map approach is what I am doing right now before all

[jira] [Commented] (KAFKA-3543) Allow a variant of transform() which can emit multiple values

2016-04-11 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3543?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15236427#comment-15236427 ] Greg Fodor commented on KAFKA-3543: --- Just something akin to a flatTransform where the output

[jira] [Commented] (KAFKA-3542) Add "repartition (+ join)" operations to streams

2016-04-11 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3542?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15236343#comment-15236343 ] Greg Fodor commented on KAFKA-3542: --- Ah, I may understand what you're getting at here -- to do

[jira] [Commented] (KAFKA-3543) Allow a variant of transform() which can emit multiple values

2016-04-11 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3543?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15236375#comment-15236375 ] Greg Fodor commented on KAFKA-3543: --- That makes sense, thanks! It may be useful to provide a helper

[jira] [Updated] (KAFKA-3543) Allow a variant of transform() which can emit multiple values

2016-04-11 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3543?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Fodor updated KAFKA-3543: -- Summary: Allow a variant of transform() which can emit multiple values (was: Allow a variant

[jira] [Created] (KAFKA-3543) Allow a variant of transform() which allows emitting multiple values

2016-04-11 Thread Greg Fodor (JIRA)
Greg Fodor created KAFKA-3543: - Summary: Allow a variant of transform() which allows emitting multiple values Key: KAFKA-3543 URL: https://issues.apache.org/jira/browse/KAFKA-3543 Project: Kafka

[jira] [Comment Edited] (KAFKA-3543) Allow a variant of transform() which can emit multiple values

2016-04-11 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3543?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15236121#comment-15236121 ] Greg Fodor edited comment on KAFKA-3543 at 4/11/16 10:40 PM: - Also note

[jira] [Created] (KAFKA-3544) Missing topics on startup

2016-04-11 Thread Greg Fodor (JIRA)
Greg Fodor created KAFKA-3544: - Summary: Missing topics on startup Key: KAFKA-3544 URL: https://issues.apache.org/jira/browse/KAFKA-3544 Project: Kafka Issue Type: Bug Components

[jira] [Created] (KAFKA-3545) Generalized Serdes for List/Map

2016-04-11 Thread Greg Fodor (JIRA)
Greg Fodor created KAFKA-3545: - Summary: Generalized Serdes for List/Map Key: KAFKA-3545 URL: https://issues.apache.org/jira/browse/KAFKA-3545 Project: Kafka Issue Type: Improvement

[jira] [Created] (KAFKA-3542) Add "repartition (+ join)" operations to streams

2016-04-11 Thread Greg Fodor (JIRA)
Greg Fodor created KAFKA-3542: - Summary: Add "repartition (+ join)" operations to streams Key: KAFKA-3542 URL: https://issues.apache.org/jira/browse/KAFKA-3542 Project: Kafka

[jira] [Commented] (KAFKA-3360) Add a protocol page/section to the official Kafka documentation

2016-03-19 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3360?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15198422#comment-15198422 ] Greg Fodor commented on KAFKA-3360: --- Apologies as I did not really know the best place to put

[jira] [Created] (KAFKA-3619) State lock file handle leaks

2016-04-25 Thread Greg Fodor (JIRA)
Greg Fodor created KAFKA-3619: - Summary: State lock file handle leaks Key: KAFKA-3619 URL: https://issues.apache.org/jira/browse/KAFKA-3619 Project: Kafka Issue Type: Bug Components

[jira] [Updated] (KAFKA-3619) State lock file handle leaks

2016-04-25 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3619?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Fodor updated KAFKA-3619: -- Description: The .lock files in the state store directories do not seem to be having their file

[jira] [Commented] (KAFKA-3745) Consider adding join key to ValueJoiner interface

2016-05-23 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3745?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15297477#comment-15297477 ] Greg Fodor commented on KAFKA-3745: --- Yep, I admit this is definitely not the most common case. But when

[jira] [Commented] (KAFKA-3745) Consider adding join key to ValueJoiner interface

2016-05-23 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3745?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15297250#comment-15297250 ] Greg Fodor commented on KAFKA-3745: --- sure. we are left joining a KTable against a KStream. The entry

[jira] [Created] (KAFKA-3745) Consider adding join key to ValueJoiner interface

2016-05-22 Thread Greg Fodor (JIRA)
Greg Fodor created KAFKA-3745: - Summary: Consider adding join key to ValueJoiner interface Key: KAFKA-3745 URL: https://issues.apache.org/jira/browse/KAFKA-3745 Project: Kafka Issue Type

[jira] [Commented] (KAFKA-3745) Consider adding join key to ValueJoiner interface

2016-05-23 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3745?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15297733#comment-15297733 ] Greg Fodor commented on KAFKA-3745: --- Yes, the join key needs to be added to the final joined record. I

[jira] [Commented] (KAFKA-3745) Consider adding join key to ValueJoiner interface

2016-05-24 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3745?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15297745#comment-15297745 ] Greg Fodor commented on KAFKA-3745: --- Actually this seems like a good enough solution, I forgot

[jira] [Commented] (KAFKA-3544) Missing topics on startup

2016-04-14 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15240723#comment-15240723 ] Greg Fodor commented on KAFKA-3544: --- Ah this makes sense. Fortunately thanks to the other responses

[jira] [Resolved] (KAFKA-3544) Missing topics on startup

2016-04-14 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3544?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Fodor resolved KAFKA-3544. --- Resolution: Not A Problem > Missing topics on star

[jira] [Commented] (KAFKA-3752) Provide a way for KStreams to recover from unclean shutdown

2016-08-10 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3752?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15416351#comment-15416351 ] Greg Fodor commented on KAFKA-3752: --- Oh apologies for mis-reading the ticket, but in our case it's

[jira] [Commented] (KAFKA-3752) Provide a way for KStreams to recover from unclean shutdown

2016-08-10 Thread Greg Fodor (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3752?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15415664#comment-15415664 ] Greg Fodor commented on KAFKA-3752: --- Hey [~guozhang], we're hitting this issue as well during

  1   2   >