Noted in juraĆ  Patch provided in FLUME-2451 should fix.

-roshan



On 2/12/15 12:25 PM, "Hari Shreedharan (JIRA)" <[email protected]> wrote:

>
>    [ 
>https://issues.apache.org/jira/browse/FLUME-2375?page=com.atlassian.jira.p
>lugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14318922#com
>ment-14318922 ] 
>
>Hari Shreedharan commented on FLUME-2375:
>-----------------------------------------
>
>Can you post a full stack trace for this one?
>
>> HDFS sink's fail to recover from datanode unavailability
>> --------------------------------------------------------
>>
>>                 Key: FLUME-2375
>>                 URL: https://issues.apache.org/jira/browse/FLUME-2375
>>             Project: Flume
>>          Issue Type: Bug
>>    Affects Versions: v1.4.0
>>            Reporter: David Stendardi
>>              Labels: hdfs, hdfssink
>>
>> Hello !
>> We are running flume-ng with version cdh-4.5-1.4. When a datanode used
>>by flume-ng goes done, we get the following exceptions :
>> {code}
>> 30 Apr 2014 01:10:38,130 ERROR
>>[SinkRunner-PollingRunner-DefaultSinkProcessor]
>>(org.apache.flume.sink.hdfs.AbstractHDFSWriter.isUnderReplicated:96)  -
>>Unexpected error while checking replication factor
>> java.lang.reflect.InvocationTargetException
>>         at sun.reflect.GeneratedMethodAccessor40.invoke(Unknown Source)
>>         at 
>>sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorI
>>mpl.java:25)
>>         at java.lang.reflect.Method.invoke(Method.java:597)
>>         at 
>>org.apache.flume.sink.hdfs.AbstractHDFSWriter.getNumCurrentReplicas(Abstr
>>actHDFSWriter.java:162)
>>         at 
>>org.apache.flume.sink.hdfs.AbstractHDFSWriter.isUnderReplicated(AbstractH
>>DFSWriter.java:82)
>>         at 
>>org.apache.flume.sink.hdfs.BucketWriter.shouldRotate(BucketWriter.java:45
>>2)
>>         at 
>>org.apache.flume.sink.hdfs.BucketWriter.append(BucketWriter.java:387)
>>         at 
>>org.apache.flume.sink.hdfs.HDFSEventSink.process(HDFSEventSink.java:392)
>>         at 
>>org.apache.flume.sink.DefaultSinkProcessor.process(DefaultSinkProcessor.j
>>ava:68)
>>         at 
>>org.apache.flume.SinkRunner$PollingRunner.run(SinkRunner.java:147)
>>         at java.lang.Thread.run(Thread.java:662)
>> {code}
>> These exceptions are logged but not rethrown, and the
>>AbstractHdfsSink::isUnderReplicated still returns false so the writer
>>continue to try writing on the node.
>> Here is how we configured our sink :
>> {code}
>> collector.sinks.hdfs.channel = hdfs
>> collector.sinks.hdfs.type = hdfs
>> collector.sinks.hdfs.hdfs.path =
>>/flume-ng/%{env}/%{avro.fqn}/from_year=%Y/from_date=%Y-%m-%d
>> collector.sinks.hdfs.hdfs.filePrefix = <%= @hostname %>-%H-%{avro.fp}
>> collector.sinks.hdfs.hdfs.fileSuffix = .avro
>> collector.sinks.hdfs.hdfs.rollInterval = 3605
>> collector.sinks.hdfs.hdfs.rollSize = 0
>> collector.sinks.hdfs.hdfs.rollCount = 0
>> collector.sinks.hdfs.hdfs.batchSize = 1000
>> collector.sinks.hdfs.hdfs.txnEventMax = 1000
>> collector.sinks.hdfs.hdfs.callTimeout = 20000
>> collector.sinks.hdfs.hdfs.fileType = DataStream
>> collector.sinks.hdfs.serializer =
>>com.viadeo.event.flume.serializer.AvroEventSerializer$Builder
>> {code}
>
>
>
>--
>This message was sent by Atlassian JIRA
>(v6.3.4#6332)

Reply via email to