Thanks Brock,
When I run it as a DataStream I still get some weird characters between records.

[DATA_HERE]ÿÿÿÿ×ùÎ0ÆÜ9Ig::¬                  ;)
 [DATA_HERE]ÿÿÿÿ×ùÎ0ÆÜ9Ig::¬
                                         ;)
                                           Î[DATA_HERE]ÿÿÿÿ×ùÎ0ÆÜ9Ig::¬
                                                                                
                                                                                
                                           ;0
                                                                                
                                                                                
                                             ½[DATA_HERE]ÿÿÿÿ×ùÎ0ÆÜ9Ig::¬
;0
:[DATA_HERE]

I was hoping to avoid the ÿÿÿÿ and spaces (I'm assuming they're characters that 
are encoded such that -cat won't show them).

Any thoughts?

Thanks again,

Chris

-----Original Message-----
From: Brock Noland [mailto:[email protected]] 
Sent: Friday, December 14, 2012 3:52 PM
To: [email protected]
Subject: Re: Flume/HDFS Encoding

Hi,

On Fri, Dec 14, 2012 at 2:48 PM, Cormier, Christopher 
<[email protected]> wrote:
> SEQ!org.apache.hadoop.io.LongWritableorg.apache.hadoop.io.TextY] 
> õpµ^R÷ﳬÕ
>

This is a SequenceFile.

>
> requestToHDFS.sinks.HDFS.hdfs.file.Type = DataStream
>
> # also tried...
>
> #requestToHDFS.sinks.HDFS.hdfs.file.Type = SequenceFile
>

The parameter is hdfs.fileType. See here:

http://flume.apache.org/FlumeUserGuide.html#hdfs-sink

It sounds like you want a text file so you should use DataStream.

Brock

Reply via email to