Re: Storm with Python

2014-05-30 Thread Andrew Montalenti
We decided to start with topology definitions in Clojure because a) that ensures that the topologies can support 100% of Storm's Clojure DSL out-of-the-box and b) that allows easy mixing of Python, Java, Clojure, and even other multi-lang bolts. For example, we plan on producing example topologies

Re: Storm with Python

2014-05-30 Thread Ashu Goel
Andrew, >From what I understand streamparse still requires that the topologies be in >Clojure… not entirely sure how this is different from what storm already >provides. I was looking more for a DSL that we could use w/ Python 2.6 and be >100% Python, but it looks like that is not available. -

Re: Storm with Python

2014-05-30 Thread Larry Palmer
Totally agree. I'd switch back to Python in a second if I could. Might be worth taking a look at the pluggable serializer. On Fri, May 30, 2014 at 5:14 PM, Andrew Montalenti wrote: > For one thing, a recently accepted Storm pull request has made this > serialization pluggable and someone has al

Optimizing Kafka Stream

2014-05-30 Thread Raphael Hsieh
I am in the process of optimizing my stream. Currently I expect 5 000 000 tuples to come out of my spout per minute. I am trying to beef up my topology in order to process this in real time without falling behind. For some reason my batch size is capping out at 83 thousand tuples. I can't seem to

RE: Workers constantly restarted due to session timeout

2014-05-30 Thread Michael Dev
Michael R, We don't have GC logging enabled yet. I lean towards agreeing with Derek that I don't think it's the issue but I will take a look at logging on Monday just to verify. Derek D, Are you certain that nimbus.task.timeout.secs is the correct config? Tracing through the github code it

Bad storm config persisted in zookeeper preventing nimbus from restarting

2014-05-30 Thread Lin Zhao
Hi, We are using storm 0.9.0.1, and one topology got { "topology.workers", nil} in the config and it crashed nimbus with this stack trace. Then each time we try to restart nimbus it also crashes with the same stack trace. Is the zookeeper in a bad state If so how do we clear it? 2014-05-30 18:25

Re: Storm with Python

2014-05-30 Thread Andrew Montalenti
For one thing, a recently accepted Storm pull request has made this serialization pluggable and someone has already implemented a protobuf variety. We plan to investigate alternative serialization options for multilang once we get the other tooling out of the way. For another, it is true the overh

Re: Storm with Python

2014-05-30 Thread Larry Palmer
We had experimented with Storm/Python 6 months ago or so, but found the JSON serialization/deserialization overhead was quite high, on the order of several hundred usec per tuple every time it transitioned from java to python or vice versa, limiting total throughput on a 12 core server to around 25

Re: Fwd: Running word count in Local cluster using Apache Storm

2014-05-30 Thread Harsha
Not sure about the eclipse but I would recommend to import as maven project from eclipse. On Fri, May 30, 2014, at 08:19 AM, Neil Shah wrote: Hi, Yes. You are correct. I tried using Storm 0.9.0, but still received same error. Thanks for your help. I will try in Ubuntu and see if it works. C

Re: Fwd: Running word count in Local cluster using Apache Storm

2014-05-30 Thread Neil Shah
Hi, Yes. You are correct. I tried using Storm 0.9.0, but still received same error. Thanks for your help. I will try in Ubuntu and see if it works. Can you please tell me about the initial error that i got? What i did was i created a separate Maven project in Eclipse. I copied corresponding files

Re: Fwd: Running word count in Local cluster using Apache Storm

2014-05-30 Thread Harsha
>From the logs it seems to me the issue is with zookeeper not releasing lock on log files and storm trying to cleanup the logs. Its a known issue for zookeeper in windows. If you can try upgrading to 0.9.1 but I don't think that will fix it though. [1]https://issues.apache.org/jira/browse/STORM-28

Re: Fwd: Running word count in Local cluster using Apache Storm

2014-05-30 Thread Neil Shah
Hi, Thanks for the input. I did run the command as suggested. I get below exception. I am running command as administrator using Windows 7. A separate question - Does Storm 0.7.1 which is written in original POm file in downloads, support Windows? Exception that i got was :- [ERROR] Failed to

Re: Fwd: Running word count in Local cluster using Apache Storm

2014-05-30 Thread Harsha
Hi Neil, I did the following ~/Downloads/storm-book-examples-ch02-getting_started-8e42636 ⮀ » mvn clean package » mvn exec:java -Dexec.mainClass="TopologyMain" -Dexec.args="src/main/resources/words.txt" -- Word Counter [word-counter-2] -- really: 1 but: 1 application: 1 is: 2 gre

Fwd: Running word count in Local cluster using Apache Storm

2014-05-30 Thread Neil Shah
Hi, I am following book " Getting started with Storm" http://my.safaribooksonline.com/9781449324025?iid=2013-12-blog-storm-book-9781449324025-SBOBlog They have specified Spouts and Bolts at following link https://github.com/storm-book/examples-ch02-getting_started/zipball/master When i try to ru