Re: 0.8.2 How do I suppress INFO Closing socket connection to

2016-01-13 Thread Dillian Murphey
Oh, great, I get it. Thank you.

On Mon, Jan 4, 2016 at 6:26 PM, tao xiao <xiaotao...@gmail.com> wrote:

> You can bump the log level to warn for a particular class
>
> log4j.logger.kafka.network.Processor=WARN
>
> On Tue, 5 Jan 2016 at 08:33 Dillian Murphey <crackshotm...@gmail.com>
> wrote:
>
> > Constant spam of this INFO on my log.
> >
> > [2016-01-05 00:31:15,887] INFO Closing socket connection to /10.9.255.67
> .
> > (kafka.network.Processor)
> > [2016-01-05 00:31:15,917] INFO Closing socket connection to /10.9.255.67
> .
> > (kafka.network.Proces
> >
> > Anyone know what I need to modify (assuming log4j.properties) to disable
> > this but still be informed of issues?
> >
> > Thank you!
> >
>


Re: Frequent ZK session timeouts

2016-01-12 Thread Dillian Murphey
[2016-01-12 <http://airmail.calendar/2016-01-12%2012:00:00%20PST>
22:16:59,629 <http://airmail.calendar/2016-01-12%2022:16:59%20PST>] TRACE
[Controller 925537]: leader imbalance ratio for broker 925537 is 0.00
(kafka.controller.KafkaController)

[2016-01-12 <http://airmail.calendar/2016-01-12%2012:00:00%20PST>
22:21:07,167 <http://airmail.calendar/2016-01-12%2022:21:07%20PST>] INFO
[SessionExpirationListener on 925537], ZK expired; shut down all controller
components and try to re-elect
(kafka.controller.KafkaController$SessionExpirationListener)

[2016-01-12 <http://airmail.calendar/2016-01-12%2012:00:00%20PST>
22:21:07,167 <http://airmail.calendar/2016-01-12%2022:21:07%20PST>] INFO
[delete-topics-thread-925537], Shutting down
(kafka.controller.TopicDeletionManager$DeleteTopicsThread)

[2016-01-12 <http://airmail.calendar/2016-01-12%2012:00:00%20PST>
22:21:07,169 <http://airmail.calendar/2016-01-12%2022:21:07%20PST>] INFO
[delete-topics-thread-925537], Shutdown completed
(kafka.controller.TopicDeletionManager$DeleteTopicsThread)

[2016-01-12 <http://airmail.calendar/2016-01-12%2012:00:00%20PST>
22:21:07,169 <http://airmail.calendar/2016-01-12%2022:21:07%20PST>] INFO
[delete-topics-thread-925537], Stopped
 (kafka.controller.TopicDeletionManager$Del

 This occurs very frequently, even after clean slating kafka.  This is
something that never occurs in our production env. I've read here and there
that it could be a GC issue? Here is the tail end of recent GC log.


20534K(8354560K), 52.5293140 secs] [Times: user=209.09 sys=0.06, real=52.53
secs]

2016-01-11T23:16:05.149+: 784.219: [GC 784.219: [ParNew:
274263K->1685K(306688K), 54.8993730 secs] 793174K->520803K(8354560K),
54.8994450 secs] [Times: user=218.86 sys=0.03, real=54.90 secs]

2016-01-11T23:17:01.095+: 840.165: [GC 840.165: [ParNew:
274325K->1896K(306688K), 56.4208930 secs] 793443K->521139K(8354560K),
56.4209750 secs] [Times: user=224.88 sys=0.05, real=56.42 secs]

2016-01-11T23:17:59.024+: 898.093: [GC 898.093: [ParNew:
274536K->1705K(306688K), 58.1100630 secs] 793779K->521093K(8354560K),
58.1101400 secs] [Times: user=231.75 sys=0.05, real=58.12 secs]

2016-01-11T23:18:58.240+: 957.310: [GC 957.310: [ParNew:
274345K->1483K(306688K), 64.2820420 secs] 793733K->521047K(8354560K),
64.2821180 secs] [Times: user=241.93 sys=0.06, real=64.28 secs]

2016-01-11T23:20:03.571+: 1022.640: [GC 1022.640: [ParNew:
274123K->1379K(306688K), 61.5305280 secs] 793687K->521097K(8354560K),
61.5305990 secs] [Times: user=245.72 sys=0.01, real=61.53 secs]

2016-01-11T23:21:06.194+: 1085.263: [GC 1085.263: [ParNew:
274019K->1508K(306688K), 63.4433440 secs] 793737K->521372K(8354560K),
63.4434240 secs] [Times: user=253.33 sys=0.02, real=63.44 secs]

2016-01-11T23:22:10.413+: 1149.482: [GC 1149.483: [ParNew:
274148K->1313K(306688K), 65.6956010 secs] 794012K->521330K(8354560K),
65.6956660 secs] [Times: user=262.01 sys=0.05, real=65.69 secs]

Heap

 par new generation   total 306688K, used 132112K [0x0005f5a0,
0x00060a6c, 0x00060a6c)

  eden space 272640K,  47% used [0x0005f5a0, 0x0005fd9bbba0,
0x00060644)

  from space 34048K,   3% used [0x00060644, 0x0006065884a8,
0x00060858)

  to   space 34048K,   0% used [0x00060858, 0x00060858,
0x00060a6c)

 concurrent mark-sweep generation total 8047872K, used 520016K
[0x00060a6c, 0x0007f5a0, 0x0007f5a0)

 concurrent-mark-sweep perm gen total 38760K, used 25768K
[0x0007f5a0, 0x0007f7fda000, 0x0008)



On Tue, Jan 12, 2016 at 6:34 PM, Mayuresh Gharat <gharatmayures...@gmail.com
> wrote:

> Can you paste the logs?
>
> Thanks,
>
> Mayuresh
>
> On Tue, Jan 12, 2016 at 4:58 PM, Dillian Murphey <crackshotm...@gmail.com>
> wrote:
>
> > Possibly running more stable with 1.7 JVM.
> >
> > Can someone explain the Zookeeper session?  SHould it never expire,
> unless
> > the broker becomes unresponsive?  I set a massive timeout value in the
> > broker config far beyond the amount of time I see the zk expiration. Is
> > this entirely on the kafka side, or could zookeeper be doing something?
> > From my zk logs I didn't see anything unusual, just exceptions as a
> result
> > of the zk session expiring (my guess).
> >
> > tnx
> >
> > On Tue, Jan 12, 2016 at 3:05 PM, Dillian Murphey <
> crackshotm...@gmail.com>
> > wrote:
> >
> > > Our 2 node kafka cluster has become unhealthy.  We're running zookeeper
> > as
> > > a 3 node system, which very light load.
> > >
> > > What seems to be happening is in the controller log we get a ZK session
> > > expire message, and in the process of re-assigning the lead

Frequent ZK session timeouts

2016-01-12 Thread Dillian Murphey
Our 2 node kafka cluster has become unhealthy.  We're running zookeeper as
a 3 node system, which very light load.

What seems to be happening is in the controller log we get a ZK session
expire message, and in the process of re-assigning the leader for the
partitions (if I'm understanding this right, please correct me), the broker
goes offline and it interrupts our applications that are publishing
messages.

We don't see this in production, and kafka has been stable for months,
since september.

I've searched a lot and found some similiar complaints but no real
solutions.

I'm running 0.8.2 and JVM 1.6.X on ubuntu.

Thanks for any ideas at all.


0.8.2 How do I suppress INFO Closing socket connection to

2016-01-04 Thread Dillian Murphey
Constant spam of this INFO on my log.

[2016-01-05 00:31:15,887] INFO Closing socket connection to /10.9.255.67.
(kafka.network.Processor)
[2016-01-05 00:31:15,917] INFO Closing socket connection to /10.9.255.67.
(kafka.network.Proces

Anyone know what I need to modify (assuming log4j.properties) to disable
this but still be informed of issues?

Thank you!


Find current kafka memory usage

2016-01-04 Thread Dillian Murphey
I was running out of heap space for my kafka broker. Is there a way I can
check how much memory kafka is using so I can alert myself if it is
reaching the max heap size?  Default is 1GB.

Thanks


Re: Increasing replication factor reliable?

2015-11-26 Thread Dillian Murphey
Alright, thank you all. Appreciate it.

Cheers

On Wed, Nov 25, 2015 at 10:50 PM, Gaurav Agarwal <gaurav130...@gmail.com>
wrote:

> So u have two nodes running where you want to increase the replication
> factor 2 because of fault tolerance. That won't be a problem
> On Nov 25, 2015 6:26 AM, "Dillian Murphey" <crackshotm...@gmail.com>
> wrote:
>
> > Is it safe to run this on an active production topic?  A topic was
> created
> > without a replication factor of 2 and I want to increase it from 1 to 2
> to
> > have fault tolerance.
> >
> >
> >
> http://kafka.apache.org/documentation.html#basic_ops_increase_replication_factor
> >
>


Re: Is re-partition hitless process?

2015-11-24 Thread Dillian Murphey
Not adding.  Taking some of the partitions from one kafka server and
spreading them to another.

On Mon, Nov 23, 2015 at 5:40 PM, Gwen Shapira <g...@confluent.io> wrote:

> By re-partition you mean adding partitions to an existing topics?
>
> There are two things to note in that case:
> 1. It is "hitless" because all it does is create new partitions where
> future records can go, it does not actually move data around.
> 2. You could be "hit" if your consumer code assumes that all data for a
> specific key will be in the same partition. Because all future keys may get
> assigned to different partitions than what they had before adding
> partitions.
>
> Hope this helps.
>
> Gwen
>
> On Mon, Nov 23, 2015 at 9:38 AM, Dillian Murphey <crackshotm...@gmail.com>
> wrote:
>
> > Can I do this on a production system and not have downtime?  I'm using
> > kafkamanager to make this easier, but it's just running the re-partition
> > task.
> >
>


Re: Is re-partition hitless process?

2015-11-24 Thread Dillian Murphey
I see, thanks.  Yes, re-assignment. Got my terminology off.

On Tue, Nov 24, 2015 at 4:45 PM, Gwen Shapira <g...@confluent.io> wrote:

> ah, re-assignment!
>
> If your re-assignment involves moving leaders to other servers, there will
> be a tiny downtime when the new leader election happens.
> Otherwise the main risk is copying the partition over to the new node - it
> can use lots of network and IO. On a busy system we recommend scripting
> this to do very few (maybe one) partition at a time.
>
> On Tue, Nov 24, 2015 at 4:42 PM, Dillian Murphey <crackshotm...@gmail.com>
> wrote:
>
> > Not adding.  Taking some of the partitions from one kafka server and
> > spreading them to another.
> >
> > On Mon, Nov 23, 2015 at 5:40 PM, Gwen Shapira <g...@confluent.io> wrote:
> >
> > > By re-partition you mean adding partitions to an existing topics?
> > >
> > > There are two things to note in that case:
> > > 1. It is "hitless" because all it does is create new partitions where
> > > future records can go, it does not actually move data around.
> > > 2. You could be "hit" if your consumer code assumes that all data for a
> > > specific key will be in the same partition. Because all future keys may
> > get
> > > assigned to different partitions than what they had before adding
> > > partitions.
> > >
> > > Hope this helps.
> > >
> > > Gwen
> > >
> > > On Mon, Nov 23, 2015 at 9:38 AM, Dillian Murphey <
> > crackshotm...@gmail.com>
> > > wrote:
> > >
> > > > Can I do this on a production system and not have downtime?  I'm
> using
> > > > kafkamanager to make this easier, but it's just running the
> > re-partition
> > > > task.
> > > >
> > >
> >
>


Increasing replication factor reliable?

2015-11-24 Thread Dillian Murphey
Is it safe to run this on an active production topic?  A topic was created
without a replication factor of 2 and I want to increase it from 1 to 2 to
have fault tolerance.

http://kafka.apache.org/documentation.html#basic_ops_increase_replication_factor


Is re-partition hitless process?

2015-11-23 Thread Dillian Murphey
Can I do this on a production system and not have downtime?  I'm using
kafkamanager to make this easier, but it's just running the re-partition
task.


Question on re-partition

2015-11-18 Thread Dillian Murphey
I want change the partition assignment to spread the partitions across two
machines, since machine #1 is getting full on disk space.

I have kafka manager to make this easy. Is there any downtime to
re-assigning partitions? I assume kafka builds up the new partitions and
then does a hit-less switch over.

Thanks for any info.


Occasional Time Outs - GC problem or...?

2015-10-27 Thread Dillian Murphey
Just poking around our amazon kafka instance. Occasionally one of our
applications gets a kafka time-out.  We want to guess it is a momentary
network issue with aws, however, I noticed the GC log is updated very
frequently.  Could the GC be causing a problem? How do I determine if it is
causing a problem which could lead to a temporary socket failure to kafka?

Thanks for any ideas.


Re: Keeping Zookeeper and Kafka Server Up

2015-06-17 Thread Dillian Murphey
supervisord is pretty easy to use.  Netflix Exhibitor will manage this all
for zookeeper, if you want to try that tool.

On Wed, Jun 17, 2015 at 7:03 AM, Kashyap Mhaisekar kashya...@gmail.com
wrote:

 We use supervisord for this. It ensures that the processes are always up
 and running.

 Thanks
 Kashyap

 On Wednesday, June 17, 2015, Shayne S shaynest...@gmail.com wrote:

  kafka-server-start.sh has a -daemon option, but I don't think Zookeeper
 has
  it.
 
  On Tue, Jun 16, 2015 at 11:32 PM, Su She suhsheka...@gmail.com
  javascript:; wrote:
 
   It seems like nohup has solved this issue, even when the putty window
   becomes inactive the processes are still running (I din't need to
   interact with them). I might look into using screen or tmux as a long
   term solution.
  
   Thanks Terry and Mike!
  
   Best,
  
   Su
  
  
   On Tue, Jun 16, 2015 at 3:42 PM, Terry Bates terryjba...@gmail.com
  javascript:;
   wrote:
Greetings,
   
nohup does the trick, as Mr. Bridge has shared. If you seem to want
 to
   run
these and still have some interactivity with
the services, consider using screen or tmux as these will enable
  you
   to
run these programs in foreground, have added
windows you can use to access shell, tail logs, and so on, and enable
  you
to disconnect from the session, but still have
these sessions available for re-attachment.
   
In addition, I using runit for service supervision may enable you
 to
   keep
daemons running, but if your services are dying
you may need to introspect more deeply on the root cause versus
 working
around it by restarting them.
   
   
*Terry Bates*
   
*Email: *terryjba...@gmail.com javascript:;
*Phone: (*412) 215-0881
*Skype*: terryjbates
*GitHub*: https://github.com/terryjbates
*Linkedin*: http://www.linkedin.com/in/terryjbates/
   
   
On Tue, Jun 16, 2015 at 3:30 PM, Mike Bridge m...@bridgecanada.com
  javascript:;
   wrote:
   
Have you tried using nohup
   
nohup bin/zookeeper-server-start.sh config/zookeeper.properties
 
nohup bin/kafka-server-start.sh config/server.properties 
   
   
On Tue, Jun 16, 2015 at 3:21 PM, Su She suhsheka...@gmail.com
  javascript:; wrote:
   
 Hello Everyone,

 I'm wondering how to keep Zookeeper and Kafka Server up even when
 my
 SSH (using putty) becomes inactive. I've tried running it in the
 background (using ), but it seems like it stops sometimes after a
 couple hours or so and I'll have to restart zookeeper and/or the
  kafka
 server.

 The only remediation i've found is to export TMOUT=[big number],
 but
 there must be another solution.

 Thank you!

 Best,

 Su

   
  
 



Re: Increased replication factor. Replication didn't happen!

2015-06-12 Thread Dillian Murphey
Thank you Adam, that helps me know what to look for.

On Thu, Jun 11, 2015 at 12:49 PM, Adam Dubiel dubiel.a...@gmail.com wrote:

 I just tried it out on my 0.8.2 cluster and it worked just fine - the ISR
 grew, replica factor changed and data was physically moved to new brokers.
 Was there not output/no logs? I see things like

 INFO Created log for partition [topicName,7] in /opt/kafka/ with
 properties {.. some json}

 in server.log on new brokers.

 2015-06-11 21:02 GMT+02:00 Dillian Murphey crackshotm...@gmail.com:

   Oh, hmm.  There is even documentation on it:
 
 
 
 http://kafka.apache.org/documentation.html#basic_ops_increase_replication_factor
 
  Having a difficult time sifting through the logs.  Is this not a common
  operation?  Do users normally just delete the topic and create a new
 one??
 
 
 
  On Wed, Jun 10, 2015 at 8:20 PM, Shady Xu shad...@gmail.com wrote:
 
   Right now, Kafka topics do not support changing replication factor or
   partition number after creation. The  kafka-reassign-partitions.sh tool
  can
   only reassign existent partitions.
  
   2015-06-11 9:31 GMT+08:00 Gwen Shapira gshap...@cloudera.com:
  
What do the logs show?
   
On Wed, Jun 10, 2015 at 5:07 PM, Dillian Murphey
crackshotm...@gmail.com wrote:
 Ran this:

 $KAFKA_HOME/bin/kafka-reassign-partitions.sh

 But Kafka did not actually do the replication. Topic description
  shows
the
 right numbers, but it just didn't replicate.

 What's wrong, and how do I trigger the replication to occur??

 I'm running 0.8.2.0

 thanks
   
  
 



Noobie question about consumers and ordering of messages

2015-06-12 Thread Dillian Murphey
So I just realized that when creating multiple partitions for a topic, it
is the responsibility of the producer and the consumer to  decide where to
put data.

So if I am a consumer, how do I know what partition to read data from?
What if the producer only put data in 1 partition out of 50.

And so there isn't any order, here, right? Only order per partition, but
there is no concept of consuming messages in the order they were produced,
correct?

Thanks. Just struggling on the practical aspects now that I've moved past
some of the administration tasks.


Re: Increased replication factor. Replication didn't happen!

2015-06-11 Thread Dillian Murphey
 Oh, hmm.  There is even documentation on it:

http://kafka.apache.org/documentation.html#basic_ops_increase_replication_factor

Having a difficult time sifting through the logs.  Is this not a common
operation?  Do users normally just delete the topic and create a new one??



On Wed, Jun 10, 2015 at 8:20 PM, Shady Xu shad...@gmail.com wrote:

 Right now, Kafka topics do not support changing replication factor or
 partition number after creation. The  kafka-reassign-partitions.sh tool can
 only reassign existent partitions.

 2015-06-11 9:31 GMT+08:00 Gwen Shapira gshap...@cloudera.com:

  What do the logs show?
 
  On Wed, Jun 10, 2015 at 5:07 PM, Dillian Murphey
  crackshotm...@gmail.com wrote:
   Ran this:
  
   $KAFKA_HOME/bin/kafka-reassign-partitions.sh
  
   But Kafka did not actually do the replication. Topic description shows
  the
   right numbers, but it just didn't replicate.
  
   What's wrong, and how do I trigger the replication to occur??
  
   I'm running 0.8.2.0
  
   thanks
 



Increased replication factor. Replication didn't happen!

2015-06-10 Thread Dillian Murphey
Ran this:

$KAFKA_HOME/bin/kafka-reassign-partitions.sh

But Kafka did not actually do the replication. Topic description shows the
right numbers, but it just didn't replicate.

What's wrong, and how do I trigger the replication to occur??

I'm running 0.8.2.0

thanks


Re: Waiting for replication factor to take effect...?

2015-05-28 Thread Dillian Murphey
Nothing on this?  Someone must surely have a need to add brokers and
increase the replication factor.  I waited a day and I still do not see
in-sync replicas.

:(



On Wed, May 27, 2015 at 5:51 PM, Dillian Murphey crackshotm...@gmail.com
wrote:

 Hi Ho,

 I'm trying to increase my replication factor from 1 to 2.

 I used the tool

 kafka-reassign-partitions.sh

 I see the replica factor change, but my replicas are not syncing up. There
 is pretty much no data in this topic. Will this replication get triggered
 at some point??

 root@b3b4b5d71b48:/opt/kafka_2.10-0.8.2.1#
 $KAFKA_HOME/bin/kafka-topics.sh --describe --zookeeper $ZK --topic mytopic
 Topic:mytopic PartitionCount:2 ReplicationFactor:2 Configs:
 Topic: mytopic Partition: 0 Leader: 925537 Replicas: 9255216,925537 Isr:
 925537
 Topic: mytopic Partition: 1 Leader: 9255216 Replicas: 925537,9255216 Isr:
 9255216


 Thanks for any comments.



Waiting for replication factor to take effect...?

2015-05-27 Thread Dillian Murphey
Hi Ho,

I'm trying to increase my replication factor from 1 to 2.

I used the tool

kafka-reassign-partitions.sh

I see the replica factor change, but my replicas are not syncing up. There
is pretty much no data in this topic. Will this replication get triggered
at some point??

root@b3b4b5d71b48:/opt/kafka_2.10-0.8.2.1# $KAFKA_HOME/bin/kafka-topics.sh
--describe --zookeeper $ZK --topic mytopic
Topic:mytopic PartitionCount:2 ReplicationFactor:2 Configs:
Topic: mytopic Partition: 0 Leader: 925537 Replicas: 9255216,925537 Isr:
925537
Topic: mytopic Partition: 1 Leader: 9255216 Replicas: 925537,9255216 Isr:
9255216


Thanks for any comments.


Delete topic pending

2015-05-18 Thread Dillian Murphey
If a broker doesn't have the topic, and I run delete topic, that topic will
be in a pending delete state forever.

What am I doing wrong here?

Also, what if I have data loss and I just want to delete the dang topic
form zookeeper directly with non of this pending stuff.

Thanks


Re: Kafka behind AWS ELB

2015-05-04 Thread Dillian Murphey
I'm interested in this topic as well.  If you put kafka brokers inside an
autoscaling group, then AWS will automatically add brokers if demand
increases, and the ELB will automatically round-robin across all of your
kafka instances.  So in your config files and code, you only need to
provide a single DNS name (the load balancer). You don't need to specify
all your kafka brokers inside your config file.  If a broker dies, the ELB
will only route to healthy nodes.

So you get a lot of robustness, scalability, and fault-tolerance by using
the AWS services. Kafka Brokers will automatically load balance, but the
question is whether it is ok to put all your brokers behind an ELB and
expect the system to work properly.

What alternatives are there to dynamic/scalable broker clusters?  I don't
want to have to modify my config files or code if I add more brokers, and I
want to be able to handle a broker going down. So these are the reasons AWS
questions like this come up.

Thanks for any comments too. :)




On Mon, May 4, 2015 at 9:03 AM, Mayuresh Gharat gharatmayures...@gmail.com
wrote:

 Ok. You can deploy kafka in AWS. You can have brokers on AWS servers.
 Kafka is not a push system. So you will need someone writing to kafka and
 consuming from kafka. It will work. My suggestion will be to try it out on
 a smaller instance in AWS and see the effects.

 As I do not know the actual use case about why you want to use kafka for, I
 cannot comment on whether it will work for you personalized use case.

 Thanks,

 Mayuresh

 On Mon, May 4, 2015 at 8:55 AM, Chandrashekhar Kotekar 
 shekhar.kote...@gmail.com wrote:

  I am sorry but I cannot reveal those details due to confidentiality
 issues.
  I hope you understand.
 
 
  Regards,
  Chandrash3khar Kotekar
  Mobile - +91 8600011455
 
  On Mon, May 4, 2015 at 9:18 PM, Mayuresh Gharat 
  gharatmayures...@gmail.com
  wrote:
 
   Hi Chandrashekar,
  
   Can you please elaborate the use case for Kafka here, like how you are
   planning to use it.
  
  
   Thanks,
  
   Mayuresh
  
   On Sat, May 2, 2015 at 9:08 PM, Chandrashekhar Kotekar 
   shekhar.kote...@gmail.com wrote:
  
Hi,
   
I am new to Apache Kafka. I have played with it on my laptop.
   
I want to use Kafka in AWS. Currently we have tomcat web servers
 based
   REST
API. We want to replace REST API with Apache Kafka, web servers are
   behind
ELB.
   
I would like to know if we can keep Kafka brokers behind ELB? Will it
   work?
   
Regards,
Chandrash3khar Kotekar
Mobile - +91 8600011455
   
  
  
  
   --
   -Regards,
   Mayuresh R. Gharat
   (862) 250-7125
  
 



 --
 -Regards,
 Mayuresh R. Gharat
 (862) 250-7125



Re: Kafka brokers behind AWS ELB

2015-05-04 Thread Dillian Murphey
Can you clarify what is meant by using the ELB as a metadata broker?

Let's say I have a python script that sends messages into kafka. Do I have
to provide a list of brokers or can I use the ELB DNS alias that will
round-robin to all my healthy brokers?  Much easier to use the ELB instead
of hard-coding server names or IPs. And when the broker list expands or
shrinks, I have to them edit my python code?

Thanks for any help.

On Sun, May 3, 2015 at 11:05 AM, Jiangjie Qin j...@linkedin.com.invalid
wrote:

 Yes.

 On 5/3/15, 10:57 AM, Chandrashekhar Kotekar shekhar.kote...@gmail.com
 wrote:

 Thanks a lot for the information. So basically each broker needs to have
 his own public IP address?
 
 
 Regards,
 Chandrash3khar Kotekar
 Mobile - +91 8600011455
 
 On Sun, May 3, 2015 at 11:09 PM, Jiangjie Qin j...@linkedin.com.invalid
 wrote:
 
  You can use ELB address as metadata broker list. But you still need to
  allow direct traffic from clients to a brokers because clients need to
  talk to a specific broker who has the interested partition.
 
  Jiangjie (Becket) Qin
 
  On 5/2/15, 11:10 AM, Chandrashekhar Kotekar
 shekhar.kote...@gmail.com
  wrote:
 
  Hi,
  
  I am new to Apache Kafka. I have played with it on my laptop.
  
  I want to use Kafka in AWS. Currently we have tomcat web servers based
  REST
  API. We want to replace REST API with Apache Kafka, web servers are
 behind
  ELB.
  
  I would like to know if we can keep Kafka brokers behind ELB? Will it
  work?
  
  Regards,
  Chandrash3khar Kotekar
  Mobile - +91 8600011455
 
 




Re: Topic missing Leader and Isr

2015-05-04 Thread Dillian Murphey
Thank you, Buntu.

On Sat, May 2, 2015 at 1:16 PM, Buntu Dev buntu...@gmail.com wrote:

 This seems to be an issue thats fixed in 0.8.2 and after upgrading the
 kafka cluster to 0.8.2 this issue was resolved.
  https://issues.apache.org/jira/browse/KAFKA-1738

 https://issues.apache.org/jira/browse/KAFKA-1738

 On Fri, May 1, 2015 at 2:58 PM, Dillian Murphey crackshotm...@gmail.com
 wrote:

  Hey, not a help, but I just posted the same question.  I'm noticing the
  same thing.  Where is your hardware? Just curious. I'm running my
  kafka/zookeeper on AWS.
 
  On Tue, Apr 28, 2015 at 10:23 AM, Buntu Dev buntu...@gmail.com wrote:
 
   Also note that the metadata for the topic is missing. I tried creating
  few
   more topics and all have the same issue.
  
   Using the Kafka console producer on the topic, I see these error
 messages
   indicating the missing metadata:
  
   WARN Error while fetching metadata [{TopicMetadata for topic my-topic
 -
   No partition metadata for topic my-topic due to
   kafka.common.UnknownException}] for topic [my-topic]: class
   kafka.common.UnknownException  (kafka.producer.BrokerPartitionInfo)
  
 



Re: Topic missing Leader and Isr

2015-05-02 Thread Dillian Murphey
Oh let me further add.  I believe there is no leader/isr when I've
re-created a deleted topic.  If I pick a topic name never seen before I get
a leader/isr.  Were you deleting topics?  Deleting topics is also a good
test of your system.  FYI:

$KAFKA_HOME/bin/kafka-run-class.sh kafka.admin.DeleteTopicCommand
--zookeeper $ZK --topic test



On Fri, May 1, 2015 at 2:58 PM, Dillian Murphey crackshotm...@gmail.com
wrote:

 Hey, not a help, but I just posted the same question.  I'm noticing the
 same thing.  Where is your hardware? Just curious. I'm running my
 kafka/zookeeper on AWS.

 On Tue, Apr 28, 2015 at 10:23 AM, Buntu Dev buntu...@gmail.com wrote:

 Also note that the metadata for the topic is missing. I tried creating few
 more topics and all have the same issue.

 Using the Kafka console producer on the topic, I see these error messages
 indicating the missing metadata:

 WARN Error while fetching metadata [{TopicMetadata for topic my-topic -
 No partition metadata for topic my-topic due to
 kafka.common.UnknownException}] for topic [my-topic]: class
 kafka.common.UnknownException  (kafka.producer.BrokerPartitionInfo)





Re: Topic missing Leader and Isr

2015-05-02 Thread Dillian Murphey
Hey, not a help, but I just posted the same question.  I'm noticing the
same thing.  Where is your hardware? Just curious. I'm running my
kafka/zookeeper on AWS.

On Tue, Apr 28, 2015 at 10:23 AM, Buntu Dev buntu...@gmail.com wrote:

 Also note that the metadata for the topic is missing. I tried creating few
 more topics and all have the same issue.

 Using the Kafka console producer on the topic, I see these error messages
 indicating the missing metadata:

 WARN Error while fetching metadata [{TopicMetadata for topic my-topic -
 No partition metadata for topic my-topic due to
 kafka.common.UnknownException}] for topic [my-topic]: class
 kafka.common.UnknownException  (kafka.producer.BrokerPartitionInfo)



Re: Leaderless topics

2015-05-01 Thread Dillian Murphey
kafka_2.9.2-0.8.1.1

Thanks, Gwen.

On Thu, Apr 30, 2015 at 5:34 PM, Gwen Shapira gshap...@cloudera.com wrote:

 Which Kafka version are you using?

 On Thu, Apr 30, 2015 at 4:11 PM, Dillian Murphey crackshotm...@gmail.com
 wrote:

  Scenerio with 1 node broker, and 3 node zookeeper ensemble.
 
  1) Create topic
  2) Delete topic
  3) Re-create with same name
 
  I'm noticing this recreation gives me Leader: non, and Isr: as empty.
 
  Any ideas what the deal is here?
 
  I googled around and not being an experienced kafka admin, someone said
 to
  delete the /controller entry in zk. This appears to fix the problem on
  existing topics that show no leader.
 
  Is it ok to do this? What am I doing by deleting /controller? Is there a
  better way?
 
  Thanks for any advice, and your time of course.
 



Sometimes I don't get a leader with 1 broker

2015-04-30 Thread Dillian Murphey
Running a 1 broker system.  I had some issues with the system but got it
working. I've deleted the topic I had trouble with and re-created it.

But describing shows no leader, not producer/consumption works on it.

I create a brand new topic with a name I never used before and I get a
leader.  I think I sometimes get a leader and sometimes don't. Not sure.

controller log is fine.

Is this normal?


Kafka still aware of old zookeeper nodes

2015-04-30 Thread Dillian Murphey
I had 3 zookeeper nodes. I added 3 new ones and shut down the old 3.

The server.log shows Closing socket connection error to the old IPs. I
rebooted the kafka server entirely but it still somehow seems aware of
these servers.

Any ideas what's up?


Delete topic / Recreate = No leader

2015-04-30 Thread Dillian Murphey
I am trying to reproduce this. But if I create a topic, then delete it,
then re-create it, no leader is getting assigned.

I can still produce/consume messages (via command line, basic testing).

Is there some additional cleanup I need to do?

Thanks for your time!


Re: Kafka still aware of old zookeeper nodes

2015-04-30 Thread Dillian Murphey
Not sure if this is the best way to do this, but my zookeeper.connect is
set to a DNS alias which points to a load balancer for 3 zookeeper nodes.

I was trying this to see if I could have the kafka config dynamic and allow
me to change/scale whatever I wanted with zookeeper and not have to ever
mess with the config for kafka.

Thanks for your comments.

On Thu, Apr 30, 2015 at 11:35 AM, svante karlsson s...@csi.se wrote:

 Have you changed

 zookeeper.connect=

 in server.properties.

 A better procedure for replacing zookeeper nodes would be to shutdown one
 and install the new one with the same ip. This can easily be done to a
 running cluster.

 /svante

 2015-04-30 20:08 GMT+02:00 Dillian Murphey crackshotm...@gmail.com:

  I had 3 zookeeper nodes. I added 3 new ones and shut down the old 3.
 
  The server.log shows Closing socket connection error to the old IPs. I
  rebooted the kafka server entirely but it still somehow seems aware of
  these servers.
 
  Any ideas what's up?
 



Leaderless topics

2015-04-30 Thread Dillian Murphey
Scenerio with 1 node broker, and 3 node zookeeper ensemble.

1) Create topic
2) Delete topic
3) Re-create with same name

I'm noticing this recreation gives me Leader: non, and Isr: as empty.

Any ideas what the deal is here?

I googled around and not being an experienced kafka admin, someone said to
delete the /controller entry in zk. This appears to fix the problem on
existing topics that show no leader.

Is it ok to do this? What am I doing by deleting /controller? Is there a
better way?

Thanks for any advice, and your time of course.


Re: Poor performance running performance test

2015-01-28 Thread Dillian Murphey
You could be right Ewen. I was starting to wonder about the load balancer
too. Is using a load balancer a bad idea? How else do users know which
kafka broker to connect to?

I'm using one of the IPs directly and I don't see that error. I am seeing
an occasional connection refused. What the heck. Maybe this is another aws
specific thing.

OR, I am running kafka brokers in a docker container. I think I will remove
the docker component and see if that makes a difference.

Thanks for the reply. bow


question on the mailing list

2015-01-28 Thread Dillian Murphey
Hi all,

Sorry for asking, but is there some easier way to use the mailing list?
Maybe a tool which makes reading and replying to messages more like google
groups?  I like the hadoop searcher, but the UI on that is really bad.

tnx


Re: Consuming Kafka Messages Inside of EC2 Instances

2015-01-28 Thread Dillian Murphey
Am I understanding your question correctly... You're asking how do you
establish connectivity to an instance in a private subnet from the outside
world?  Are you thinking in terms of zookeeper or just general aws network
connectivity?

On Wed, Jan 28, 2015 at 11:03 AM, Su She suhsheka...@gmail.com wrote:

 Hello All,

 I have set up a cluster of EC2 instances using this method:


 http://blogs.aws.amazon.com/bigdata/post/Tx2D0J7QOVRJBRX/Deploying-Cloudera-s-Enterprise-Data-Hub-on-AWS

 As you can see the instances are w/in a private subnet. I was wondering if
 anyone has any advice on how I can set up a Kafka zookeeper/server on an
 instance that receives messages from a Kafka Producer outside of the
 private subnet. I have tried using the cluster launcher, but I feel like it
 is not a best practice and only a temporary situation.

 Thank you for the help!

 Best,

 Su



WARN Error in I/O with NetworkReceive.readFrom(NetworkReceive.java

2015-01-28 Thread Dillian Murphey
Running the performance test. What is the nature of this error??  I'm
running a very high end cluster on aws. Tried this even within the same
subnet on aws.

bin/kafka-run-class.sh org.apache.kafka.clients.tools.ProducerPerformance
topic9 5000 100 -1 acks=1 bootstrap.servers=$IP:9092
buffer.memory=67108864 batch.size=8196



2015-01-28 16:32:22,178] WARN Error in I/O with /IP ADDR
(org.apache.kafka.common.network.Selector)
java.io.EOFException
at
org.apache.kafka.common.network.NetworkReceive.readFrom(NetworkReceive.java:62)
at org.apache.kafka.common.network.Selector.poll(Selector.java:248)
at org.apache.kafka.clients.NetworkClient.poll(NetworkClient.java:192)
at org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:191)
at org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:122)
at java.lang.Thread.run(Thread.java:745)
65910 records sent, 13158.3 records/sec (1.25 MB/sec), 38525.9 ms avg
latency, 39478.0 max latency.

Thanks for any ideas


Poor performance running performance test

2015-01-27 Thread Dillian Murphey
I'm a new user/admin to kafka. I'm running a 3 node ZK and a 6 brokers on
aws.

The performance I'm seeing is shockingly bad. I need some advice!

bin/kafka-run-class.sh org.apache.kafka.clients.tools.ProducerPerformance
test2 5000 100 -1 acks=1 bootstrap.servers=5myloadbalancer:9092
buffer.memory=67108864 batch.size=8196




6097 records sent, 13198.3 records/sec (1.26 MB/sec), 2098.0 ms avg
latency, 4306.0 max latency.
71695 records sent, 14339.0 records/sec (1.37 MB/sec), 6658.1 ms avg
latency, 9053.0 max latency.
65195 records sent, 13028.6 records/sec (1.24 MB/sec), 11504.0 ms avg
latency, 13809.0 max latency.
71955 records sent, 14391.0 records/sec (1.37 MB/sec), 16137.4 ms avg
latency, 18541.0 max latency.

Thanks for any help!


Re: Poor performance running performance test

2015-01-27 Thread Dillian Murphey
I was running the performance command from a virtual box server, so that
seems like it was part of the problem.  I'm getting better results running
this on a server on aws, but that's kind of expected.  Can you look at
these results, and comment on the occasional warning I see?  I appreciate
it!

1220375 records sent, 243928.6 records/sec (23.26 MB/sec), 2111.5 ms avg
latency, 4435.0 max latency.
1195090 records sent, 239018.0 records/sec (22.79 MB/sec), 2203.1 ms avg
latency, 4595.0 max latency.
1257165 records sent, 251433.0 records/sec (23.98 MB/sec), 2172.6 ms avg
latency, 4525.0 max latency.
1230981 records sent, 246196.2 records/sec (23.48 MB/sec), 2173.5 ms avg
latency, 4465.0 max latency.
[2015-01-28 07:19:07,274] WARN Error in I/O with
myawsloadbalancer(org.apache.kafka.common.network.Selector)
java.io.EOFException
at
org.apache.kafka.common.network.NetworkReceive.readFrom(NetworkReceive.java:62)
at org.apache.kafka.common.network.Selector.poll(Selector.java:248)
at org.apache.kafka.clients.NetworkClient.poll(NetworkClient.java:192)
at org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:191)
at org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:122)
at java.lang.Thread.run(Thread.java:745)
1090689 records sent, 218137.8 records/sec (20.80 MB/sec), 2413.6 ms avg
latency, 4829.0 max latency.

On Tue, Jan 27, 2015 at 7:37 PM, Ewen Cheslack-Postava e...@confluent.io
wrote:

 Where are you running ProducerPerformance in relation to ZK and the Kafka
 brokers? You should definitely see much higher performance than this.

 A couple of other things I can think of that might be going wrong: Are all
 your VMs in the same AZ? Are you storing Kafka data in EBS or local
 ephemeral storage? If EBS, have you provisioned enough IOPS.


 On Tue, Jan 27, 2015 at 4:29 PM, Dillian Murphey crackshotm...@gmail.com
 wrote:

  I'm a new user/admin to kafka. I'm running a 3 node ZK and a 6 brokers on
  aws.
 
  The performance I'm seeing is shockingly bad. I need some advice!
 
  bin/kafka-run-class.sh org.apache.kafka.clients.tools.ProducerPerformance
  test2 5000 100 -1 acks=1 bootstrap.servers=5myloadbalancer:9092
  buffer.memory=67108864 batch.size=8196
 
 
 
 
  6097 records sent, 13198.3 records/sec (1.26 MB/sec), 2098.0 ms avg
  latency, 4306.0 max latency.
  71695 records sent, 14339.0 records/sec (1.37 MB/sec), 6658.1 ms avg
  latency, 9053.0 max latency.
  65195 records sent, 13028.6 records/sec (1.24 MB/sec), 11504.0 ms avg
  latency, 13809.0 max latency.
  71955 records sent, 14391.0 records/sec (1.37 MB/sec), 16137.4 ms avg
  latency, 18541.0 max latency.
 
  Thanks for any help!
 



 --
 Thanks,
 Ewen



Command to list my brokers

2015-01-17 Thread Dillian Murphey
Hi all,

I just want a way to query all of my brokers to see if they're all
connected and online, without creating a topic.  Or is creating a topic the
best way to verify all my brokers are up and running???

Thanks


kafka cluster on aws

2015-01-14 Thread Dillian Murphey
I can't seem to find much information to help me (being green to kafka) on
setting up a cluster on aws. Does anyone have any sources?

The question I have off the bat is, what methods have already been explored
to generate a unique broker id? If I spin up a new server, do I just need
to maintain my own broker-id list somewhere so I don't re-use an already
allocated broker id?

Also, I read an article about a broker going down and requiring a new
broker be spun up with the same id. Is this also something I need to
implement?

I want to setup a kafka auto-scaling group on AWS, so I can add brokers at
well or based on load. It doesn't seem too complicated, or maybe I'm too
green to see it, but I don't want to re-invent everything myself.

I know Loggly uses AWS/Kafka, so I'm hunting for more details on that too.

Thanks for any help


Re: kafka cluster on aws

2015-01-14 Thread Dillian Murphey
Thanks for the comments.  Hey Joe, I'm looking at your project now. I'm
going to give it a try.


Re: kafka cluster on aws

2015-01-14 Thread Dillian Murphey
Trying to understand the docs.  Can I just use the docker image and run the
minotaur command from there?  I don't understand the Basion SSH stuff. Do I
need that? I just want a quick start for right now. Also, not sure where I
get the ENVIRONMENT.key.

Any extra help is greatly appreciated. You can email directly. Thanks!

On Wed, Jan 14, 2015 at 12:09 PM, Joe Stein joe.st...@stealth.ly wrote:

 We have an open source framework you can use to spin up Kafka (any version
 or even any build you want) clusters (and Zookeeper) with CloudFormation on
 AWS https://github.com/stealthly/minotaur

 It is very nice/handy you basically specify your instance types, counts,
 versions of code, etc and hit a enter
 https://github.com/stealthly/minotaur/tree/master/labs/kafka e.g.

 ./minotaur.py lab deploy kafka -e bdoss-dev -d testing -r us-east-1 -z
 us-east-1a -k http://example.com/kafka.tar.gz -n 3 -i m1.small

 There is some setup for the bastion host (

 https://github.com/stealthly/minotaur/tree/master/infrastructure/aws/bastion
 )
 and supervisor (
 https://github.com/stealthly/minotaur/tree/master/supervisor)
 and after that it is really nice and easy.

 /***
  Joe Stein
  Founder, Principal Consultant
  Big Data Open Source Security LLC
  http://www.stealth.ly
  Twitter: @allthingshadoop http://www.twitter.com/allthingshadoop
 /

 On Wed, Jan 14, 2015 at 2:54 PM, Joseph Lawson jlaw...@roomkey.com
 wrote:

  We have a separate daemon process that assigns EIPs to servers when they
  startup in an autoscaling group based off of an autoscaling message.  So
  for a cluster of 3 we have 3 EIPs. Then we inject the EIPs into startup
  script for Kafka which checks to see if it has one of the EIPs and
 assigns
  itself the index of that IP so in the list:
  10.0.0.1 10.0.0.2 10.0.0.3
 
  1 is broker 0, 2 is broker 1 and 3 is broker 2.  All this is injected via
  cloudformation and then we have a mod value so if we want to spin brokers
  in the same group we do mod 1,2 and get brokers mod * 3 + index to
  determine which is in the group. (the EIPs are different as it is a
  different cloudformation)
 
  For redundancy make sure you run at least two that have full replicas of
  all other partitions.  We run replication factor of 3 with three
 instances
  so if any goes down the other two bring it back in sync once a fresh
 server
  spins in the autoscaling group.
 
  
  From: Dillian Murphey crackshotm...@gmail.com
  Sent: Wednesday, January 14, 2015 2:42 PM
  To: users@kafka.apache.org
  Subject: kafka cluster on aws
 
  I can't seem to find much information to help me (being green to kafka)
 on
  setting up a cluster on aws. Does anyone have any sources?
 
  The question I have off the bat is, what methods have already been
 explored
  to generate a unique broker id? If I spin up a new server, do I just need
  to maintain my own broker-id list somewhere so I don't re-use an already
  allocated broker id?
 
  Also, I read an article about a broker going down and requiring a new
  broker be spun up with the same id. Is this also something I need to
  implement?
 
  I want to setup a kafka auto-scaling group on AWS, so I can add brokers
 at
  well or based on load. It doesn't seem too complicated, or maybe I'm too
  green to see it, but I don't want to re-invent everything myself.
 
  I know Loggly uses AWS/Kafka, so I'm hunting for more details on that
 too.
 
  Thanks for any help