Hi all,

suppose you need to process input available on source.host and the results should finally end up on dest.host. A storm topology shall do the processing.

It is easy to write a sprout that fetches the data and emits it into the topology. Similarly a sink-bolt can write the result somewhere.

But now suppose that the data is available only locally on source.host in the file system. Is it possible and natural to make source.host a machine in the Storm cluster but somehow make sure that *only* the sprout is executed on source.host. Similarly, would it be possible to bind a sink bolt to one specific machine, the dest.host?

If this is not a possible or not a preferred way to do it, are there any specific techniques used to provide input to a sprout beyond whatever remote access methods happen to be available (smb, nfs, ssh, http)?

Thanks for any hints,
Harald.

Reply via email to