[ 
https://issues.apache.org/jira/browse/FLUME-2796?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Max Lin updated FLUME-2796:
---------------------------
    Description: 
Due to some error, my flume agent has queued 185204 event messages (more than 1 
TB,  about 7.7 MB /per event in average) in its file channel. 

I tried to restart the flume agent and let the file channel replay, and I got 
the following error message

{noformat}
java.lang.OutOfMemoryError: Java heap space
        at com.google.protobuf.ByteString.copyFrom(ByteString.java:90)
        at com.google.protobuf.ByteString.copyFrom(ByteString.java:99)
        at 
com.google.protobuf.CodedInputStream.readBytes(CodedInputStream.java:294)
        at 
org.apache.flume.channel.file.proto.ProtosFactory$FlumeEvent$Builder.mergeFrom(ProtosFactory.java:5136)
        at 
org.apache.flume.channel.file.proto.ProtosFactory$FlumeEvent$Builder.mergeFrom(ProtosFactory.java:4950)
        at 
com.google.protobuf.CodedInputStream.readMessage(CodedInputStream.java:275)
        at 
org.apache.flume.channel.file.proto.ProtosFactory$Put$Builder.mergeFrom(ProtosFactory.java:3312)
        at 
org.apache.flume.channel.file.proto.ProtosFactory$Put$Builder.mergeFrom(ProtosFactory.java:3164)
        at 
com.google.protobuf.AbstractMessageLite$Builder.mergeFrom(AbstractMessageLite.java:212)
        at 
com.google.protobuf.AbstractMessage$Builder.mergeFrom(AbstractMessage.java:746)
        at 
com.google.protobuf.AbstractMessage$Builder.mergeFrom(AbstractMessage.java:238)
        at 
com.google.protobuf.AbstractMessageLite$Builder.mergeDelimitedFrom(AbstractMessageLite.java:282)
        at 
com.google.protobuf.AbstractMessage$Builder.mergeDelimitedFrom(AbstractMessage.java:760)
        at 
com.google.protobuf.AbstractMessageLite$Builder.mergeDelimitedFrom(AbstractMessageLite.java:288)
        at 
com.google.protobuf.AbstractMessage$Builder.mergeDelimitedFrom(AbstractMessage.java:752)
        at 
org.apache.flume.channel.file.proto.ProtosFactory$Put.parseDelimitedFrom(ProtosFactory.java:3121)
        at org.apache.flume.channel.file.Put.readProtos(Put.java:86)
        at 
org.apache.flume.channel.file.TransactionEventRecord.fromByteArray(TransactionEventRecord.java:201)
        at 
org.apache.flume.channel.file.LogFileV3$SequentialReader.doNext(LogFileV3.java:344)
        at 
org.apache.flume.channel.file.LogFile$SequentialReader.next(LogFile.java:498)
        at 
org.apache.flume.channel.file.ReplayHandler.replayLog(ReplayHandler.java:245)
        at org.apache.flume.channel.file.Log.doReplay(Log.java:435)
        at org.apache.flume.channel.file.Log.replay(Log.java:382)

{noformat}



  was:
Due to some error, my flume agent has queued 185204 event messages (more than 1 
TB,  about 7.7 MB /per event in average) in its file channel. 

I tried to restart the flume agent and let the file channel replay, and I got 
the following error message





> File Channel whiech queued more than 1TB data files got OOME when doing replay
> ------------------------------------------------------------------------------
>
>                 Key: FLUME-2796
>                 URL: https://issues.apache.org/jira/browse/FLUME-2796
>             Project: Flume
>          Issue Type: Question
>          Components: File Channel
>    Affects Versions: v1.5.2
>         Environment: CDH 5.3
> Cent OS 
>            Reporter: Max Lin
>            Priority: Blocker
>
> Due to some error, my flume agent has queued 185204 event messages (more than 
> 1 TB,  about 7.7 MB /per event in average) in its file channel. 
> I tried to restart the flume agent and let the file channel replay, and I got 
> the following error message
> {noformat}
> java.lang.OutOfMemoryError: Java heap space
>         at com.google.protobuf.ByteString.copyFrom(ByteString.java:90)
>         at com.google.protobuf.ByteString.copyFrom(ByteString.java:99)
>         at 
> com.google.protobuf.CodedInputStream.readBytes(CodedInputStream.java:294)
>         at 
> org.apache.flume.channel.file.proto.ProtosFactory$FlumeEvent$Builder.mergeFrom(ProtosFactory.java:5136)
>         at 
> org.apache.flume.channel.file.proto.ProtosFactory$FlumeEvent$Builder.mergeFrom(ProtosFactory.java:4950)
>         at 
> com.google.protobuf.CodedInputStream.readMessage(CodedInputStream.java:275)
>         at 
> org.apache.flume.channel.file.proto.ProtosFactory$Put$Builder.mergeFrom(ProtosFactory.java:3312)
>         at 
> org.apache.flume.channel.file.proto.ProtosFactory$Put$Builder.mergeFrom(ProtosFactory.java:3164)
>         at 
> com.google.protobuf.AbstractMessageLite$Builder.mergeFrom(AbstractMessageLite.java:212)
>         at 
> com.google.protobuf.AbstractMessage$Builder.mergeFrom(AbstractMessage.java:746)
>         at 
> com.google.protobuf.AbstractMessage$Builder.mergeFrom(AbstractMessage.java:238)
>         at 
> com.google.protobuf.AbstractMessageLite$Builder.mergeDelimitedFrom(AbstractMessageLite.java:282)
>         at 
> com.google.protobuf.AbstractMessage$Builder.mergeDelimitedFrom(AbstractMessage.java:760)
>         at 
> com.google.protobuf.AbstractMessageLite$Builder.mergeDelimitedFrom(AbstractMessageLite.java:288)
>         at 
> com.google.protobuf.AbstractMessage$Builder.mergeDelimitedFrom(AbstractMessage.java:752)
>         at 
> org.apache.flume.channel.file.proto.ProtosFactory$Put.parseDelimitedFrom(ProtosFactory.java:3121)
>         at org.apache.flume.channel.file.Put.readProtos(Put.java:86)
>         at 
> org.apache.flume.channel.file.TransactionEventRecord.fromByteArray(TransactionEventRecord.java:201)
>         at 
> org.apache.flume.channel.file.LogFileV3$SequentialReader.doNext(LogFileV3.java:344)
>         at 
> org.apache.flume.channel.file.LogFile$SequentialReader.next(LogFile.java:498)
>         at 
> org.apache.flume.channel.file.ReplayHandler.replayLog(ReplayHandler.java:245)
>         at org.apache.flume.channel.file.Log.doReplay(Log.java:435)
>         at org.apache.flume.channel.file.Log.replay(Log.java:382)
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to