[jira] [Comment Edited] (KAFKA-1533) transient unit test failure in ProducerFailureHandlingTest

2014-07-29 Thread David Corley (JIRA)

[ 
https://issues.apache.org/jira/browse/KAFKA-1533?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14077511#comment-14077511
 ] 

David Corley edited comment on KAFKA-1533 at 7/29/14 8:17 AM:
--

Hey Jun, I can confirm the test is now passing. However your patch isn't 
converting the InetAddress to a string representation of the IP. It just needs 
to be updated to use:
{{getInetAddress().getHostAddress()}}



was (Author: heavydawson):
Hey Jun, I can confirm the test is now passing. However your patch isn't 
converting the InetAddress to a string representation of the IP. It just needs 
to be updated to use:
getInetAddress().getHostAddress()


 transient unit test failure in ProducerFailureHandlingTest
 --

 Key: KAFKA-1533
 URL: https://issues.apache.org/jira/browse/KAFKA-1533
 Project: Kafka
  Issue Type: Bug
  Components: core
Affects Versions: 0.8.2
Reporter: Jun Rao
Assignee: Guozhang Wang
 Fix For: 0.8.2

 Attachments: KAFKA-1533.patch, KAFKA-1533.patch, KAFKA-1533.patch, 
 KAFKA-1533_2014-07-21_15:45:58.patch, kafka.threads, stack.out


 Occasionally, saw the test hang on tear down. The following is the stack 
 trace.
 Test worker prio=5 tid=7f9246956000 nid=0x10e078000 in Object.wait() 
 [10e075000]
java.lang.Thread.State: WAITING (on object monitor)
 at java.lang.Object.wait(Native Method)
 - waiting on 7f4e69578 (a org.apache.zookeeper.ClientCnxn$Packet)
 at java.lang.Object.wait(Object.java:485)
 at org.apache.zookeeper.ClientCnxn.submitRequest(ClientCnxn.java:1344)
 - locked 7f4e69578 (a org.apache.zookeeper.ClientCnxn$Packet)
 at org.apache.zookeeper.ZooKeeper.delete(ZooKeeper.java:732)
 at org.I0Itec.zkclient.ZkConnection.delete(ZkConnection.java:91)
 at org.I0Itec.zkclient.ZkClient$8.call(ZkClient.java:720)
 at org.I0Itec.zkclient.ZkClient.retryUntilConnected(ZkClient.java:675)
 at org.I0Itec.zkclient.ZkClient.delete(ZkClient.java:716)
 at kafka.utils.ZkUtils$.deletePath(ZkUtils.scala:416)
 at kafka.utils.ZkUtils$.deregisterBrokerInZk(ZkUtils.scala:184)
 at kafka.server.KafkaHealthcheck.shutdown(KafkaHealthcheck.scala:50)
 at 
 kafka.server.KafkaServer$$anonfun$shutdown$2.apply$mcV$sp(KafkaServer.scala:243)
 at kafka.utils.Utils$.swallow(Utils.scala:172)
 at kafka.utils.Logging$class.swallowWarn(Logging.scala:92)
 at kafka.utils.Utils$.swallowWarn(Utils.scala:45)
 at kafka.utils.Logging$class.swallow(Logging.scala:94)
 at kafka.utils.Utils$.swallow(Utils.scala:45)
 at kafka.server.KafkaServer.shutdown(KafkaServer.scala:243)
 at 
 kafka.api.ProducerFailureHandlingTest.tearDown(ProducerFailureHandlingTest.scala:90)



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (KAFKA-1533) transient unit test failure in ProducerFailureHandlingTest

2014-07-29 Thread David Corley (JIRA)

[ 
https://issues.apache.org/jira/browse/KAFKA-1533?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14077511#comment-14077511
 ] 

David Corley commented on KAFKA-1533:
-

Hey Jun, I can confirm the test is now passing. However your patch isn't 
converting the InetAddress to a string representation of the IP. It just needs 
to be updated to use:
getInetAddress().getHostAddress()


 transient unit test failure in ProducerFailureHandlingTest
 --

 Key: KAFKA-1533
 URL: https://issues.apache.org/jira/browse/KAFKA-1533
 Project: Kafka
  Issue Type: Bug
  Components: core
Affects Versions: 0.8.2
Reporter: Jun Rao
Assignee: Guozhang Wang
 Fix For: 0.8.2

 Attachments: KAFKA-1533.patch, KAFKA-1533.patch, KAFKA-1533.patch, 
 KAFKA-1533_2014-07-21_15:45:58.patch, kafka.threads, stack.out


 Occasionally, saw the test hang on tear down. The following is the stack 
 trace.
 Test worker prio=5 tid=7f9246956000 nid=0x10e078000 in Object.wait() 
 [10e075000]
java.lang.Thread.State: WAITING (on object monitor)
 at java.lang.Object.wait(Native Method)
 - waiting on 7f4e69578 (a org.apache.zookeeper.ClientCnxn$Packet)
 at java.lang.Object.wait(Object.java:485)
 at org.apache.zookeeper.ClientCnxn.submitRequest(ClientCnxn.java:1344)
 - locked 7f4e69578 (a org.apache.zookeeper.ClientCnxn$Packet)
 at org.apache.zookeeper.ZooKeeper.delete(ZooKeeper.java:732)
 at org.I0Itec.zkclient.ZkConnection.delete(ZkConnection.java:91)
 at org.I0Itec.zkclient.ZkClient$8.call(ZkClient.java:720)
 at org.I0Itec.zkclient.ZkClient.retryUntilConnected(ZkClient.java:675)
 at org.I0Itec.zkclient.ZkClient.delete(ZkClient.java:716)
 at kafka.utils.ZkUtils$.deletePath(ZkUtils.scala:416)
 at kafka.utils.ZkUtils$.deregisterBrokerInZk(ZkUtils.scala:184)
 at kafka.server.KafkaHealthcheck.shutdown(KafkaHealthcheck.scala:50)
 at 
 kafka.server.KafkaServer$$anonfun$shutdown$2.apply$mcV$sp(KafkaServer.scala:243)
 at kafka.utils.Utils$.swallow(Utils.scala:172)
 at kafka.utils.Logging$class.swallowWarn(Logging.scala:92)
 at kafka.utils.Utils$.swallowWarn(Utils.scala:45)
 at kafka.utils.Logging$class.swallow(Logging.scala:94)
 at kafka.utils.Utils$.swallow(Utils.scala:45)
 at kafka.server.KafkaServer.shutdown(KafkaServer.scala:243)
 at 
 kafka.api.ProducerFailureHandlingTest.tearDown(ProducerFailureHandlingTest.scala:90)



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (KAFKA-1542) normal IOException in the new producer is logged as ERROR

2014-07-29 Thread David Corley (JIRA)

[ 
https://issues.apache.org/jira/browse/KAFKA-1542?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14077741#comment-14077741
 ] 

David Corley commented on KAFKA-1542:
-

Strictly speaking, toString can return either the hostname and the ipaddress 
[http://docs.oracle.com/javase/1.5.0/docs/api/java/net/InetAddress.html#toString()],
 whereas getHostAddress will always be just the IP address. That said, I defer 
to you guys on this. It was you who request the host info, so happy to run with 
your suggestion.



 normal IOException in the new producer is logged as ERROR
 -

 Key: KAFKA-1542
 URL: https://issues.apache.org/jira/browse/KAFKA-1542
 Project: Kafka
  Issue Type: Bug
Affects Versions: 0.8.2
Reporter: Jun Rao
Assignee: David Corley
  Labels: newbie
 Fix For: 0.8.2

 Attachments: KAFKA-1542.patch


 Saw the following error in the log. It seems this can happen if the broker is 
 down. So, this probably should be logged as WARN, instead ERROR.
 2014/07/16 00:12:51.799 [Selector] Error in I/O: 
 java.io.IOException: Connection timed out
 at sun.nio.ch.FileDispatcherImpl.read0(Native Method)
 at sun.nio.ch.SocketDispatcher.read(SocketDispatcher.java:39)
 at sun.nio.ch.IOUtil.readIntoNativeBuffer(IOUtil.java:223)
 at sun.nio.ch.IOUtil.read(IOUtil.java:197)
 at sun.nio.ch.SocketChannelImpl.read(SocketChannelImpl.java:379)
 at 
 org.apache.kafka.common.network.NetworkReceive.readFrom(NetworkReceive.java:60)
 at org.apache.kafka.common.network.Selector.poll(Selector.java:241)
 at org.apache.kafka.clients.NetworkClient.poll(NetworkClient.java:171)
 at 
 org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:174)
 at 
 org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:114)
 at java.lang.Thread.run(Thread.java:744)



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (KAFKA-1533) transient unit test failure in ProducerFailureHandlingTest

2014-07-28 Thread David Corley (JIRA)

 [ 
https://issues.apache.org/jira/browse/KAFKA-1533?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

David Corley updated KAFKA-1533:


Attachment: kafka.threads

Seeing the same issue. Jun, not sure if you attached your own thread dump or a 
copy of mine from the mailing list, but attaching here again per your request.

 transient unit test failure in ProducerFailureHandlingTest
 --

 Key: KAFKA-1533
 URL: https://issues.apache.org/jira/browse/KAFKA-1533
 Project: Kafka
  Issue Type: Bug
  Components: core
Affects Versions: 0.8.2
Reporter: Jun Rao
Assignee: Guozhang Wang
 Fix For: 0.8.2

 Attachments: KAFKA-1533.patch, KAFKA-1533.patch, KAFKA-1533.patch, 
 KAFKA-1533_2014-07-21_15:45:58.patch, kafka.threads, stack.out


 Occasionally, saw the test hang on tear down. The following is the stack 
 trace.
 Test worker prio=5 tid=7f9246956000 nid=0x10e078000 in Object.wait() 
 [10e075000]
java.lang.Thread.State: WAITING (on object monitor)
 at java.lang.Object.wait(Native Method)
 - waiting on 7f4e69578 (a org.apache.zookeeper.ClientCnxn$Packet)
 at java.lang.Object.wait(Object.java:485)
 at org.apache.zookeeper.ClientCnxn.submitRequest(ClientCnxn.java:1344)
 - locked 7f4e69578 (a org.apache.zookeeper.ClientCnxn$Packet)
 at org.apache.zookeeper.ZooKeeper.delete(ZooKeeper.java:732)
 at org.I0Itec.zkclient.ZkConnection.delete(ZkConnection.java:91)
 at org.I0Itec.zkclient.ZkClient$8.call(ZkClient.java:720)
 at org.I0Itec.zkclient.ZkClient.retryUntilConnected(ZkClient.java:675)
 at org.I0Itec.zkclient.ZkClient.delete(ZkClient.java:716)
 at kafka.utils.ZkUtils$.deletePath(ZkUtils.scala:416)
 at kafka.utils.ZkUtils$.deregisterBrokerInZk(ZkUtils.scala:184)
 at kafka.server.KafkaHealthcheck.shutdown(KafkaHealthcheck.scala:50)
 at 
 kafka.server.KafkaServer$$anonfun$shutdown$2.apply$mcV$sp(KafkaServer.scala:243)
 at kafka.utils.Utils$.swallow(Utils.scala:172)
 at kafka.utils.Logging$class.swallowWarn(Logging.scala:92)
 at kafka.utils.Utils$.swallowWarn(Utils.scala:45)
 at kafka.utils.Logging$class.swallow(Logging.scala:94)
 at kafka.utils.Utils$.swallow(Utils.scala:45)
 at kafka.server.KafkaServer.shutdown(KafkaServer.scala:243)
 at 
 kafka.api.ProducerFailureHandlingTest.tearDown(ProducerFailureHandlingTest.scala:90)



--
This message was sent by Atlassian JIRA
(v6.2#6252)


Re: Gradle hanging on unit test run against ProducerFailureHangingTest-testBrokerFailure

2014-07-28 Thread David Corley
Done


On Sun, Jul 27, 2014 at 7:45 PM, Jun Rao jun...@gmail.com wrote:

 David,

 Apache mailing list doesn't seem to allow large attachments. Could you
 attach the stacktrace to the jira KAFKA-1533 (now reopened)?

 Thanks,

 Jun


 On Sun, Jul 27, 2014 at 11:21 AM, David Corley davidcor...@gmail.com
 wrote:

  Nope. It definitely sent. Are there some restrictions on mailing list
  attachments I wonder?
  I'll put it inline here:
  =
  2014-07-25 18:40:15
  Full thread dump Java HotSpot(TM) 64-Bit Server VM (20.65-b04-462 mixed
  mode):
 
  Attach Listener daemon prio=9 tid=7fcfb92a5000 nid=0x11a961000 waiting
 on
  condition []
 java.lang.Thread.State: RUNNABLE
 
  kafka-scheduler-17 daemon prio=5 tid=7fcfbb80c000 nid=0x1387c3000
 waiting
  on condition [1387c2000]
 java.lang.Thread.State: TIMED_WAITING (parking)
   at sun.misc.Unsafe.park(Native Method)
  - parking to wait for  7f53b84d8 (a
  java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject)
   at
 java.util.concurrent.locks.LockSupport.parkNanos(LockSupport.java:196)
  at
 
 
 java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.awaitNanos(AbstractQueuedSynchronizer.java:2025)
   at java.util.concurrent.DelayQueue.take(DelayQueue.java:164)
  at
 
 
 java.util.concurrent.ScheduledThreadPoolExecutor$DelayedWorkQueue.take(ScheduledThreadPoolExecutor.java:609)
   at
 
 
 java.util.concurrent.ScheduledThreadPoolExecutor$DelayedWorkQueue.take(ScheduledThreadPoolExecutor.java:602)
  at
 
 
 java.util.concurrent.ThreadPoolExecutor.getTask(ThreadPoolExecutor.java:957)
   at
 
 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:917)
  at java.lang.Thread.run(Thread.java:695)
 
  ReplicaFetcherThread-0-0 prio=5 tid=7fcfbb80b000 nid=0x1385bd000
 runnable
  [1385bb000]
 java.lang.Thread.State: RUNNABLE
  at sun.nio.ch.KQueueArrayWrapper.kevent0(Native Method)
   at sun.nio.ch.KQueueArrayWrapper.poll(KQueueArrayWrapper.java:136)
  at sun.nio.ch.KQueueSelectorImpl.doSelect(KQueueSelectorImpl.java:69)
   at sun.nio.ch.SelectorImpl.lockAndDoSelect(SelectorImpl.java:69)
  - locked 7f53b7578 (a sun.nio.ch.Util$2)
   - locked 7f53b7560 (a java.util.Collections$UnmodifiableSet)
  - locked 7f5400668 (a sun.nio.ch.KQueueSelectorImpl)
   at sun.nio.ch.SelectorImpl.select(SelectorImpl.java:80)
  at
 sun.nio.ch.SocketAdaptor$SocketInputStream.read(SocketAdaptor.java:193)
   - locked 7f53b7590 (a java.lang.Object)
  at sun.nio.ch.ChannelInputStream.read(ChannelInputStream.java:86)
   - locked 7f5405530 (a sun.nio.ch.SocketAdaptor$SocketInputStream)
  at
 
 java.nio.channels.Channels$ReadableByteChannelImpl.read(Channels.java:221)
   - locked 7f5407fc0 (a java.lang.Object)
  at kafka.utils.Utils$.read(Utils.scala:380)
  at
 
 
 kafka.network.BoundedByteBufferReceive.readFrom(BoundedByteBufferReceive.scala:54)
   at kafka.network.Receive$class.readCompletely(Transmission.scala:56)
  at
 
 
 kafka.network.BoundedByteBufferReceive.readCompletely(BoundedByteBufferReceive.scala:29)
   at kafka.network.BlockingChannel.receive(BlockingChannel.scala:108)
  at kafka.consumer.SimpleConsumer.liftedTree1$1(SimpleConsumer.scala:71)
   at
 
 
 kafka.consumer.SimpleConsumer.kafka$consumer$SimpleConsumer$$sendRequest(SimpleConsumer.scala:68)
  - locked 7f5407ff0 (a java.lang.Object)
   at
 
 
 kafka.consumer.SimpleConsumer$$anonfun$fetch$1$$anonfun$apply$mcV$sp$1.apply$mcV$sp(SimpleConsumer.scala:112)
  at
 
 
 kafka.consumer.SimpleConsumer$$anonfun$fetch$1$$anonfun$apply$mcV$sp$1.apply(SimpleConsumer.scala:112)
   at
 
 
 kafka.consumer.SimpleConsumer$$anonfun$fetch$1$$anonfun$apply$mcV$sp$1.apply(SimpleConsumer.scala:112)
  at kafka.metrics.KafkaTimer.time(KafkaTimer.scala:33)
   at
 
 
 kafka.consumer.SimpleConsumer$$anonfun$fetch$1.apply$mcV$sp(SimpleConsumer.scala:111)
  at
 
 
 kafka.consumer.SimpleConsumer$$anonfun$fetch$1.apply(SimpleConsumer.scala:111)
   at
 
 
 kafka.consumer.SimpleConsumer$$anonfun$fetch$1.apply(SimpleConsumer.scala:111)
  at kafka.metrics.KafkaTimer.time(KafkaTimer.scala:33)
   at kafka.consumer.SimpleConsumer.fetch(SimpleConsumer.scala:110)
  at
 
 
 kafka.server.AbstractFetcherThread.processFetchRequest(AbstractFetcherThread.scala:96)
   at
  kafka.server.AbstractFetcherThread.doWork(AbstractFetcherThread.scala:88)
  at kafka.utils.ShutdownableThread.run(ShutdownableThread.scala:60)
 
  Controller-0-to-broker-1-send-thread prio=5 tid=7fcfbb809800
  nid=0x1384ba000 waiting on condition [1384b9000]
 java.lang.Thread.State: WAITING (parking)
  at sun.misc.Unsafe.park(Native Method)
   - parking to wait for  7f53eba88 (a
  java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject)
  at java.util.concurrent.locks.LockSupport.park(LockSupport.java:156)
   at
 
 
 java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.await(AbstractQueuedSynchronizer.java:1987

Re: Gradle hanging on unit test run against ProducerFailureHangingTest-testBrokerFailure

2014-07-27 Thread David Corley
Nope. It definitely sent. Are there some restrictions on mailing list
attachments I wonder?
I'll put it inline here:
=
2014-07-25 18:40:15
Full thread dump Java HotSpot(TM) 64-Bit Server VM (20.65-b04-462 mixed
mode):

Attach Listener daemon prio=9 tid=7fcfb92a5000 nid=0x11a961000 waiting on
condition []
   java.lang.Thread.State: RUNNABLE

kafka-scheduler-17 daemon prio=5 tid=7fcfbb80c000 nid=0x1387c3000 waiting
on condition [1387c2000]
   java.lang.Thread.State: TIMED_WAITING (parking)
 at sun.misc.Unsafe.park(Native Method)
- parking to wait for  7f53b84d8 (a
java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject)
 at java.util.concurrent.locks.LockSupport.parkNanos(LockSupport.java:196)
at
java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.awaitNanos(AbstractQueuedSynchronizer.java:2025)
 at java.util.concurrent.DelayQueue.take(DelayQueue.java:164)
at
java.util.concurrent.ScheduledThreadPoolExecutor$DelayedWorkQueue.take(ScheduledThreadPoolExecutor.java:609)
 at
java.util.concurrent.ScheduledThreadPoolExecutor$DelayedWorkQueue.take(ScheduledThreadPoolExecutor.java:602)
at
java.util.concurrent.ThreadPoolExecutor.getTask(ThreadPoolExecutor.java:957)
 at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:917)
at java.lang.Thread.run(Thread.java:695)

ReplicaFetcherThread-0-0 prio=5 tid=7fcfbb80b000 nid=0x1385bd000 runnable
[1385bb000]
   java.lang.Thread.State: RUNNABLE
at sun.nio.ch.KQueueArrayWrapper.kevent0(Native Method)
 at sun.nio.ch.KQueueArrayWrapper.poll(KQueueArrayWrapper.java:136)
at sun.nio.ch.KQueueSelectorImpl.doSelect(KQueueSelectorImpl.java:69)
 at sun.nio.ch.SelectorImpl.lockAndDoSelect(SelectorImpl.java:69)
- locked 7f53b7578 (a sun.nio.ch.Util$2)
 - locked 7f53b7560 (a java.util.Collections$UnmodifiableSet)
- locked 7f5400668 (a sun.nio.ch.KQueueSelectorImpl)
 at sun.nio.ch.SelectorImpl.select(SelectorImpl.java:80)
at sun.nio.ch.SocketAdaptor$SocketInputStream.read(SocketAdaptor.java:193)
 - locked 7f53b7590 (a java.lang.Object)
at sun.nio.ch.ChannelInputStream.read(ChannelInputStream.java:86)
 - locked 7f5405530 (a sun.nio.ch.SocketAdaptor$SocketInputStream)
at
java.nio.channels.Channels$ReadableByteChannelImpl.read(Channels.java:221)
 - locked 7f5407fc0 (a java.lang.Object)
at kafka.utils.Utils$.read(Utils.scala:380)
at
kafka.network.BoundedByteBufferReceive.readFrom(BoundedByteBufferReceive.scala:54)
 at kafka.network.Receive$class.readCompletely(Transmission.scala:56)
at
kafka.network.BoundedByteBufferReceive.readCompletely(BoundedByteBufferReceive.scala:29)
 at kafka.network.BlockingChannel.receive(BlockingChannel.scala:108)
at kafka.consumer.SimpleConsumer.liftedTree1$1(SimpleConsumer.scala:71)
 at
kafka.consumer.SimpleConsumer.kafka$consumer$SimpleConsumer$$sendRequest(SimpleConsumer.scala:68)
- locked 7f5407ff0 (a java.lang.Object)
 at
kafka.consumer.SimpleConsumer$$anonfun$fetch$1$$anonfun$apply$mcV$sp$1.apply$mcV$sp(SimpleConsumer.scala:112)
at
kafka.consumer.SimpleConsumer$$anonfun$fetch$1$$anonfun$apply$mcV$sp$1.apply(SimpleConsumer.scala:112)
 at
kafka.consumer.SimpleConsumer$$anonfun$fetch$1$$anonfun$apply$mcV$sp$1.apply(SimpleConsumer.scala:112)
at kafka.metrics.KafkaTimer.time(KafkaTimer.scala:33)
 at
kafka.consumer.SimpleConsumer$$anonfun$fetch$1.apply$mcV$sp(SimpleConsumer.scala:111)
at
kafka.consumer.SimpleConsumer$$anonfun$fetch$1.apply(SimpleConsumer.scala:111)
 at
kafka.consumer.SimpleConsumer$$anonfun$fetch$1.apply(SimpleConsumer.scala:111)
at kafka.metrics.KafkaTimer.time(KafkaTimer.scala:33)
 at kafka.consumer.SimpleConsumer.fetch(SimpleConsumer.scala:110)
at
kafka.server.AbstractFetcherThread.processFetchRequest(AbstractFetcherThread.scala:96)
 at
kafka.server.AbstractFetcherThread.doWork(AbstractFetcherThread.scala:88)
at kafka.utils.ShutdownableThread.run(ShutdownableThread.scala:60)

Controller-0-to-broker-1-send-thread prio=5 tid=7fcfbb809800
nid=0x1384ba000 waiting on condition [1384b9000]
   java.lang.Thread.State: WAITING (parking)
at sun.misc.Unsafe.park(Native Method)
 - parking to wait for  7f53eba88 (a
java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject)
at java.util.concurrent.locks.LockSupport.park(LockSupport.java:156)
 at
java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.await(AbstractQueuedSynchronizer.java:1987)
at
java.util.concurrent.LinkedBlockingQueue.take(LinkedBlockingQueue.java:399)
 at
kafka.controller.RequestSendThread.doWork(ControllerChannelManager.scala:121)
at kafka.utils.ShutdownableThread.run(ShutdownableThread.scala:60)

kafka-scheduler-16 daemon prio=5 tid=7fcfbb809000 nid=0x1383b7000 waiting
on condition [1383b6000]
   java.lang.Thread.State: TIMED_WAITING (parking)
at sun.misc.Unsafe.park(Native Method)
 - parking to wait for  7f53b84d8 (a
java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject)
at 

Re: Gradle hanging on unit test run against ProducerFailureHangingTest-testBrokerFailure

2014-07-25 Thread David Corley
Hey Gouzhang,
Yes, I spotted that commit and had updated my working copy to that, but the
test is still hanging. If it's any help, the test looks like it's doing
_something_  as the CPU usage ramps up significantly and stays there until
I kill the process.
/Dave


On Thu, Jul 24, 2014 at 4:10 PM, Guozhang Wang wangg...@gmail.com wrote:

 Hi Dave,

 KAFKA-1533 has just been committed targeting at this issue. Did your update
 on trunk include this commit?

 commit ff05e9b3616a222e29a42f6e8fdf41945a417f41
 Author: Guozhang Wang guw...@linkedin.com
 Date:   Tue Jul 22 14:14:19 2014 -0700

 Guozhang

 kafka-1533; transient unit test failure in ProducerFailureHandlingTest;
 reviewed by Guozhang Wang; reviewed by Jun Rao




 On Thu, Jul 24, 2014 at 5:52 AM, David Corley davidcor...@gmail.com
 wrote:

  Hey all,
  I'm trying my hand at writing some patches for open issues, but I'm
 running
  into issues with running gradlew test.
  It hangs every time when trying to run testBrokerFailure in the
  ProducerFailureHangingTest suite.
 
  It was working for a time, but I updated to trunk HEAD and it's no longer
  working.
  I'm running on OSX with JDK 1.6.0_65.
  I tried increasing the HeapSize for the test target, but hit the same
  issue.
  Running against 3f1a9c4cee778d089d3ec3167555c2b89cdc48bb
 
  Would appreciate any help.
  Regards,
  Dave
 



 --
 -- Guozhang



[jira] [Updated] (KAFKA-1542) normal IOException in the new producer is logged as ERROR

2014-07-25 Thread David Corley (JIRA)

 [ 
https://issues.apache.org/jira/browse/KAFKA-1542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

David Corley updated KAFKA-1542:


Attachment: KAFKA-1542.patch

Attaching patch

 normal IOException in the new producer is logged as ERROR
 -

 Key: KAFKA-1542
 URL: https://issues.apache.org/jira/browse/KAFKA-1542
 Project: Kafka
  Issue Type: Bug
Affects Versions: 0.8.2
Reporter: Jun Rao
  Labels: newbie
 Attachments: KAFKA-1542.patch


 Saw the following error in the log. It seems this can happen if the broker is 
 down. So, this probably should be logged as WARN, instead ERROR.
 2014/07/16 00:12:51.799 [Selector] Error in I/O: 
 java.io.IOException: Connection timed out
 at sun.nio.ch.FileDispatcherImpl.read0(Native Method)
 at sun.nio.ch.SocketDispatcher.read(SocketDispatcher.java:39)
 at sun.nio.ch.IOUtil.readIntoNativeBuffer(IOUtil.java:223)
 at sun.nio.ch.IOUtil.read(IOUtil.java:197)
 at sun.nio.ch.SocketChannelImpl.read(SocketChannelImpl.java:379)
 at 
 org.apache.kafka.common.network.NetworkReceive.readFrom(NetworkReceive.java:60)
 at org.apache.kafka.common.network.Selector.poll(Selector.java:241)
 at org.apache.kafka.clients.NetworkClient.poll(NetworkClient.java:171)
 at 
 org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:174)
 at 
 org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:114)
 at java.lang.Thread.run(Thread.java:744)



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (KAFKA-1542) normal IOException in the new producer is logged as ERROR

2014-07-25 Thread David Corley (JIRA)

 [ 
https://issues.apache.org/jira/browse/KAFKA-1542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

David Corley updated KAFKA-1542:


Status: Patch Available  (was: Open)

 normal IOException in the new producer is logged as ERROR
 -

 Key: KAFKA-1542
 URL: https://issues.apache.org/jira/browse/KAFKA-1542
 Project: Kafka
  Issue Type: Bug
Affects Versions: 0.8.2
Reporter: Jun Rao
  Labels: newbie
 Attachments: KAFKA-1542.patch


 Saw the following error in the log. It seems this can happen if the broker is 
 down. So, this probably should be logged as WARN, instead ERROR.
 2014/07/16 00:12:51.799 [Selector] Error in I/O: 
 java.io.IOException: Connection timed out
 at sun.nio.ch.FileDispatcherImpl.read0(Native Method)
 at sun.nio.ch.SocketDispatcher.read(SocketDispatcher.java:39)
 at sun.nio.ch.IOUtil.readIntoNativeBuffer(IOUtil.java:223)
 at sun.nio.ch.IOUtil.read(IOUtil.java:197)
 at sun.nio.ch.SocketChannelImpl.read(SocketChannelImpl.java:379)
 at 
 org.apache.kafka.common.network.NetworkReceive.readFrom(NetworkReceive.java:60)
 at org.apache.kafka.common.network.Selector.poll(Selector.java:241)
 at org.apache.kafka.clients.NetworkClient.poll(NetworkClient.java:171)
 at 
 org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:174)
 at 
 org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:114)
 at java.lang.Thread.run(Thread.java:744)



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (KAFKA-1502) source jar is empty

2014-07-25 Thread David Corley (JIRA)

[ 
https://issues.apache.org/jira/browse/KAFKA-1502?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14074158#comment-14074158
 ] 

David Corley commented on KAFKA-1502:
-

Can't reproduce this on trunk or on the 0.8.1 branch. Seems to be resolved.

 source jar is empty
 ---

 Key: KAFKA-1502
 URL: https://issues.apache.org/jira/browse/KAFKA-1502
 Project: Kafka
  Issue Type: Bug
  Components: build
Affects Versions: 0.8.2
Reporter: Jun Rao
Assignee: Joel Koshy
  Labels: newbie

 When doing a local publish, kafka_2.8.0-0.8.1.1-sources.jar only contains the 
 following files.
 META-INF/
 META-INF/MANIFEST.MF
 LICENSE
 NOTICE



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (KAFKA-1542) normal IOException in the new producer is logged as ERROR

2014-07-25 Thread David Corley (JIRA)

[ 
https://issues.apache.org/jira/browse/KAFKA-1542?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14074582#comment-14074582
 ] 

David Corley commented on KAFKA-1542:
-

Sure. Will revise the patch to get the address instead.

 normal IOException in the new producer is logged as ERROR
 -

 Key: KAFKA-1542
 URL: https://issues.apache.org/jira/browse/KAFKA-1542
 Project: Kafka
  Issue Type: Bug
Affects Versions: 0.8.2
Reporter: Jun Rao
  Labels: newbie
 Attachments: KAFKA-1542.patch


 Saw the following error in the log. It seems this can happen if the broker is 
 down. So, this probably should be logged as WARN, instead ERROR.
 2014/07/16 00:12:51.799 [Selector] Error in I/O: 
 java.io.IOException: Connection timed out
 at sun.nio.ch.FileDispatcherImpl.read0(Native Method)
 at sun.nio.ch.SocketDispatcher.read(SocketDispatcher.java:39)
 at sun.nio.ch.IOUtil.readIntoNativeBuffer(IOUtil.java:223)
 at sun.nio.ch.IOUtil.read(IOUtil.java:197)
 at sun.nio.ch.SocketChannelImpl.read(SocketChannelImpl.java:379)
 at 
 org.apache.kafka.common.network.NetworkReceive.readFrom(NetworkReceive.java:60)
 at org.apache.kafka.common.network.Selector.poll(Selector.java:241)
 at org.apache.kafka.clients.NetworkClient.poll(NetworkClient.java:171)
 at 
 org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:174)
 at 
 org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:114)
 at java.lang.Thread.run(Thread.java:744)



--
This message was sent by Atlassian JIRA
(v6.2#6252)


Re: Gradle hanging on unit test run against ProducerFailureHangingTest-testBrokerFailure

2014-07-25 Thread David Corley
Sure. Attaching here.



On Fri, Jul 25, 2014 at 4:16 PM, Guozhang Wang wangg...@gmail.com wrote:

 Could you get a thread dump when it hangs?


 On Thu, Jul 24, 2014 at 11:01 PM, David Corley davidcor...@gmail.com
 wrote:

  Hey Gouzhang,
  Yes, I spotted that commit and had updated my working copy to that, but
 the
  test is still hanging. If it's any help, the test looks like it's doing
  _something_  as the CPU usage ramps up significantly and stays there
 until
  I kill the process.
  /Dave
 
 
  On Thu, Jul 24, 2014 at 4:10 PM, Guozhang Wang wangg...@gmail.com
 wrote:
 
   Hi Dave,
  
   KAFKA-1533 has just been committed targeting at this issue. Did your
  update
   on trunk include this commit?
  
   commit ff05e9b3616a222e29a42f6e8fdf41945a417f41
   Author: Guozhang Wang guw...@linkedin.com
   Date:   Tue Jul 22 14:14:19 2014 -0700
  
   Guozhang
  
   kafka-1533; transient unit test failure in
  ProducerFailureHandlingTest;
   reviewed by Guozhang Wang; reviewed by Jun Rao
  
  
  
  
   On Thu, Jul 24, 2014 at 5:52 AM, David Corley davidcor...@gmail.com
   wrote:
  
Hey all,
I'm trying my hand at writing some patches for open issues, but I'm
   running
into issues with running gradlew test.
It hangs every time when trying to run testBrokerFailure in the
ProducerFailureHangingTest suite.
   
It was working for a time, but I updated to trunk HEAD and it's no
  longer
working.
I'm running on OSX with JDK 1.6.0_65.
I tried increasing the HeapSize for the test target, but hit the same
issue.
Running against 3f1a9c4cee778d089d3ec3167555c2b89cdc48bb
   
Would appreciate any help.
Regards,
Dave
   
  
  
  
   --
   -- Guozhang
  
 



 --
 -- Guozhang



[jira] [Comment Edited] (KAFKA-1542) normal IOException in the new producer is logged as ERROR

2014-07-25 Thread David Corley (JIRA)

[ 
https://issues.apache.org/jira/browse/KAFKA-1542?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14074582#comment-14074582
 ] 

David Corley edited comment on KAFKA-1542 at 7/25/14 6:07 PM:
--

Hey Jun, the current patch returns the IP address. getHostAddress() returns the 
address as a string, whereas getHostName() would be used if we wanted the 
hostname


was (Author: heavydawson):
Sure. Will revise the patch to get the address instead.

 normal IOException in the new producer is logged as ERROR
 -

 Key: KAFKA-1542
 URL: https://issues.apache.org/jira/browse/KAFKA-1542
 Project: Kafka
  Issue Type: Bug
Affects Versions: 0.8.2
Reporter: Jun Rao
  Labels: newbie
 Attachments: KAFKA-1542.patch


 Saw the following error in the log. It seems this can happen if the broker is 
 down. So, this probably should be logged as WARN, instead ERROR.
 2014/07/16 00:12:51.799 [Selector] Error in I/O: 
 java.io.IOException: Connection timed out
 at sun.nio.ch.FileDispatcherImpl.read0(Native Method)
 at sun.nio.ch.SocketDispatcher.read(SocketDispatcher.java:39)
 at sun.nio.ch.IOUtil.readIntoNativeBuffer(IOUtil.java:223)
 at sun.nio.ch.IOUtil.read(IOUtil.java:197)
 at sun.nio.ch.SocketChannelImpl.read(SocketChannelImpl.java:379)
 at 
 org.apache.kafka.common.network.NetworkReceive.readFrom(NetworkReceive.java:60)
 at org.apache.kafka.common.network.Selector.poll(Selector.java:241)
 at org.apache.kafka.clients.NetworkClient.poll(NetworkClient.java:171)
 at 
 org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:174)
 at 
 org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:114)
 at java.lang.Thread.run(Thread.java:744)



--
This message was sent by Atlassian JIRA
(v6.2#6252)


Gradle hanging on unit test run against ProducerFailureHangingTest-testBrokerFailure

2014-07-24 Thread David Corley
Hey all,
I'm trying my hand at writing some patches for open issues, but I'm running
into issues with running gradlew test.
It hangs every time when trying to run testBrokerFailure in the
ProducerFailureHangingTest suite.

It was working for a time, but I updated to trunk HEAD and it's no longer
working.
I'm running on OSX with JDK 1.6.0_65.
I tried increasing the HeapSize for the test target, but hit the same issue.
Running against 3f1a9c4cee778d089d3ec3167555c2b89cdc48bb

Would appreciate any help.
Regards,
Dave