Re: Proposal for concrete operator for writing to HDFS file

Yogi Devendra Mon, 07 Mar 2016 01:25:06 -0800

Here is the summary of discussion till now:

   1. Proposed operator is for concrete implementation for writing tuples
   to HDFS. All tuples will be written to same file.
   2. File copy operation will be handled using dedicated component for
   file copy. (Proposal for that will be over another email thread).
   3. File rotation is handled in the following way:
      1. Based on file size
      2. Based on time (every X windows)
      3. If both are specified then based on whichever happens first.
      4. If nothing is specified then based on no new data for one
      application window.
   4. Conversions to json, csv, avro will be not be responsibility of this
   operator. Allowed inputs are byte[] or string.
   5. Custom separators should be allowed. Empty string should be valid
   separator.


Note that, this is just a first iteration implementation of this concrete
operator. We can enhance it later in subsequent iterations.

Also, we expect that things will be more clear when we have first iteration
of other related components ready.

Thanks all for you valuable feedback.

~ Yogi

Re: Proposal for concrete operator for writing to HDFS file

Reply via email to