Jonathan Poltak Samosir created SAMZA-138:
---------------------------------------------
Summary: System that places specified file contents onto stream
Key: SAMZA-138
URL: https://issues.apache.org/jira/browse/SAMZA-138
Project: Samza
Issue Type: New Feature
Affects Versions: 0.7.0
Environment: RHELinux 2.6.18-371.4.1.el5
Reporter: Jonathan Poltak Samosir
Priority: Minor
A fairly straightforward Samza System that reads from a specified file, and
places that file's contents onto a SystemStreamPartition for use as input for a
StreamTask.
Roughly based off how the hello-samza example project's WikipediaSystem works
(more the SystemConsumerFactory rather than SystemConsumer class).
Probably needs a bit of work, but basic functionality works as intended.
Hopefully useful to some, either as a functioning system or as a base for a
more robust and functionally-promising system that you wish to implement.
Some suggested improvements (not yet implemented):
* handle reading from multiple files ([suggested alternative input
specification|https://mail-archives.apache.org/mod_mbox/incubator-samza-dev/201401.mbox/%3C1B43C7411DB20E47AB0FB62E7262B80179BA7465%40ESV4-MBX01.linkedin.biz%3E]-
point 2)
* use of filepos for IncomingMessageEnvelope offset ([more info
here|https://mail-archives.apache.org/mod_mbox/incubator-samza-dev/201401.mbox/%3C1B43C7411DB20E47AB0FB62E7262B80179BA749D%40ESV4-MBX01.linkedin.biz%3E]
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)