>From the logs, the logs look OK and the channel is working fine. It seems to >have been replaying - that is pretty much it.
-- Hari Shreedharan On Monday, February 25, 2013 at 4:28 PM, Rahul Ravindran wrote: > I have attached the zipped log file at > https://issues.apache.org/jira/browse/FLUME-1928 > > From: Hari Shreedharan <[email protected] > (mailto:[email protected])> > To: [email protected] (mailto:[email protected]); Rahul Ravindran > <[email protected] (mailto:[email protected])> > Sent: Monday, February 25, 2013 1:30 PM > Subject: Re: File Channel error stops flume > > Can you send your full logs? I suspect the channel did a full replay because > it was restarted during a restart. (If it did, the logs would show a > BadCheckpointException). > > > Hari > > -- > Hari Shreedharan > > On Monday, February 25, 2013 at 1:20 PM, Rahul Ravindran wrote: > > Thanks Hari. I had waited for 20 minutes and this did not move change. Now, > > after more than an hour, I see it working > > > > From: Hari Shreedharan <[email protected] > > (mailto:[email protected])> > > To: [email protected] (mailto:[email protected]); Rahul Ravindran > > <[email protected] (mailto:[email protected])> > > Sent: Monday, February 25, 2013 12:46 PM > > Subject: Re: File Channel error stops flume > > > > Rahul, > > > > Those messages actually just suggest that your channel is replaying. The > > channel will complete the replay and the agent will start the sinks once > > the channel is ready. It might take a few minutes based on how many events > > you have in the channel. > > > > > > Hari > > > > -- > > Hari Shreedharan > > > > On Monday, February 25, 2013 at 12:07 PM, Rahul Ravindran wrote: > > > Hi, > > > I modified a parameter to the HDFS sink on a flume config (added an > > > idleInterval) on 2 machines. Things worked fine on one, and not on the > > > other. I tried restarting flume a couple of times and I continue seeing > > > the same log statement (bolded below) with no writes to HDFS > > > > > > 25 Feb 2013 08:27:00,174 INFO [Log-BackgroundWorker-ch2] > > > (org.apache.flume.channel.file.EventQueueBackingStoreFile.checkpoint:109) > > > - Start checkpoint for /flume2/checkpoint/checkpoint, elements to sync = > > > 8506 > > > :% > > > 25 Feb 2013 19:55:51,577 INFO [lifecycleSupervisor-1-0] > > > (org.apache.flume.channel.file.ReplayHandler.replayLog:236) - Replaying > > > /flume2/data/log-17 > > > 25 Feb 2013 19:55:51,585 INFO [lifecycleSupervisor-1-1] > > > (org.apache.flume.channel.file.ReplayHandler.replayLog:236) - Replaying > > > /flume1/data/log-17 > > > 25 Feb 2013 19:55:51,588 INFO [lifecycleSupervisor-1-0] > > > (org.apache.flume.tools.DirectMemoryUtils.getDefaultDirectMemorySize:113) > > > - Unable to get maxDirectMemory from VM: NoSuchMethodException: > > > sun.misc.VM.maxDirectMemory(null) > > > 25 Feb 2013 19:55:51,592 INFO [lifecycleSupervisor-1-0] > > > (org.apache.flume.tools.DirectMemoryUtils.allocate:47) - Direct Memory > > > Allocation: Allocation = 1048576, Allocated = 0, MaxDirectMemorySize = > > > 268435456, Remaining = 268435456 > > > 25 Feb 2013 19:55:51,634 INFO [lifecycleSupervisor-1-1] > > > (org.apache.flume.channel.file.LogFile$SequentialReader.skipToLastCheckpointPosition:466) > > > - fast-forward to checkpoint position: 1622812128 > > > 25 Feb 2013 19:55:51,634 INFO [lifecycleSupervisor-1-0] > > > (org.apache.flume.channel.file.LogFile$SequentialReader.skipToLastCheckpointPosition:466) > > > - fast-forward to checkpoint position: 1622720601 > > > 25 Feb 2013 19:55:51,654 INFO [lifecycleSupervisor-1-0] > > > (org.apache.flume.channel.file.ReplayHandler.replayLog:236) - Replaying > > > /flume2/data/log-18 > > > 25 Feb 2013 19:55:51,655 INFO [lifecycleSupervisor-1-0] > > > (org.apache.flume.channel.file.LogFile$SequentialReader.skipToLastCheckpointPosition:466) > > > - fast-forward to checkpoint position: 1622821593 > > > 25 Feb 2013 19:55:51,655 INFO [lifecycleSupervisor-1-0] > > > (org.apache.flume.channel.file.ReplayHandler.replayLog:236) - Replaying > > > /flume2/data/log-19 > > > 25 Feb 2013 19:55:51,656 INFO [lifecycleSupervisor-1-0] > > > (org.apache.flume.channel.file.LogFile$SequentialReader.skipToLastCheckpointPosition:466) > > > - fast-forward to checkpoint position: 1622678590 > > > 25 Feb 2013 19:55:51,656 INFO [lifecycleSupervisor-1-0] > > > (org.apache.flume.channel.file.ReplayHandler.replayLog:236) - Replaying > > > /flume2/data/log-20 > > > 25 Feb 2013 19:55:51,657 INFO [lifecycleSupervisor-1-0] > > > (org.apache.flume.channel.file.LogFile$SequentialReader.skipToLastCheckpointPosition:466) > > > - fast-forward to checkpoint position: 244707334 > > > 25 Feb 2013 19:55:51,657 INFO [lifecycleSupervisor-1-0] > > > (org.apache.flume.channel.file.ReplayHandler.replayLog:236) - Replaying > > > /flume2/data/log-21 > > > 25 Feb 2013 19:55:51,657 INFO [lifecycleSupervisor-1-0] > > > (org.apache.flume.channel.file.LogFile$SequentialReader.skipToLastCheckpointPosition:466) > > > - fast-forward to checkpoint position: 530601497 > > > 25 Feb 2013 19:55:51,658 INFO [lifecycleSupervisor-1-0] > > > (org.apache.flume.channel.file.LogFile$SequentialReader.next:491) - > > > Encountered EOF at 530601497 in /flume2/data/log-21 > > > 25 Feb 2013 19:55:51,658 INFO [lifecycleSupervisor-1-0] > > > (org.apache.flume.channel.file.ReplayHandler.replayLog:236) - Replaying > > > /flume2/data/log-22 > > > 25 Feb 2013 19:55:51,658 INFO [lifecycleSupervisor-1-1] > > > (org.apache.flume.channel.file.ReplayHandler.replayLog:236) - Replaying > > > /flume1/data/log-18 > > > 25 Feb 2013 19:55:51,658 WARN [lifecycleSupervisor-1-0] > > > (org.apache.flume.channel.file.LogFile$SequentialReader.skipToLastCheckpointPosition:470) > > > - Checkpoint for file(/flume2/data/log-22) is: 1361844516782, which is > > > beyond the requested checkpoint time: 1361844516783 and position 0 > > > 25 Feb 2013 19:55:51,659 INFO [lifecycleSupervisor-1-1] > > > (org.apache.flume.channel.file.LogFile$SequentialReader.skipToLastCheckpointPosition:466) > > > - fast-forward to checkpoint position: 1622674426 > > > 25 Feb 2013 19:55:51,659 INFO [lifecycleSupervisor-1-0] > > > (org.apache.flume.channel.file.ReplayHandler.replayLog:236) - Replaying > > > /flume2/data/log-23 > > > 25 Feb 2013 19:55:51,659 INFO [lifecycleSupervisor-1-1] > > > (org.apache.flume.channel.file.ReplayHandler.replayLog:236) - Replaying > > > /flume1/data/log-19 > > > 25 Feb 2013 19:55:51,659 WARN [lifecycleSupervisor-1-0] > > > (org.apache.flume.channel.file.LogFile$SequentialReader.skipToLastCheckpointPosition:470) > > > - Checkpoint for file(/flume2/data/log-23) is: 1361844516783, which is > > > beyond the requested checkpoint time: 1361844516783 and position 0 > > > 25 Feb 2013 19:55:51,660 INFO [lifecycleSupervisor-1-1] > > > (org.apache.flume.channel.file.LogFile$SequentialReader.skipToLastCheckpointPosition:466) > > > - fast-forward to checkpoint position: 1622239091 > > > 25 Feb 2013 19:55:51,660 INFO [lifecycleSupervisor-1-1] > > > (org.apache.flume.channel.file.ReplayHandler.replayLog:236) - Replaying > > > /flume1/data/log-20 > > > 25 Feb 2013 19:55:51,661 INFO [lifecycleSupervisor-1-1] > > > (org.apache.flume.channel.file.LogFile$SequentialReader.skipToLastCheckpointPosition:466) > > > - fast-forward to checkpoint position: 221490603 > > > 25 Feb 2013 19:55:51,661 INFO [lifecycleSupervisor-1-1] > > > (org.apache.flume.channel.file.ReplayHandler.replayLog:236) - Replaying > > > /flume1/data/log-21 > > > 25 Feb 2013 19:55:51,661 INFO [lifecycleSupervisor-1-1] > > > (org.apache.flume.channel.file.LogFile$SequentialReader.skipToLastCheckpointPosition:466) > > > - fast-forward to checkpoint position: 532696754 > > > 25 Feb 2013 19:55:52,048 INFO [lifecycleSupervisor-1-1] > > > (org.apache.flume.channel.file.LogFile$SequentialReader.next:491) - > > > Encountered EOF at 1623195597 in /flume1/data/log-17 > > > 25 Feb 2013 19:55:52,103 INFO [lifecycleSupervisor-1-0] > > > (org.apache.flume.channel.file.LogFile$SequentialReader.next:491) - > > > Encountered EOF at 1623195583 in /flume2/data/log-17 > > > 25 Feb 2013 19:55:52,308 INFO [lifecycleSupervisor-1-0] > > > (org.apache.flume.channel.file.LogFile$SequentialReader.next:491) - > > > Encountered EOF at 1623195536 in /flume2/data/log-18 > > > 25 Feb 2013 19:55:52,319 INFO [lifecycleSupervisor-1-1] > > > (org.apache.flume.channel.file.LogFile$SequentialReader.next:491) - > > > Encountered EOF at 1623195584 in /flume1/data/log-18 > > > 25 Feb 2013 19:55:52,418 INFO [lifecycleSupervisor-1-0] > > > (org.apache.flume.channel.file.LogFile$SequentialReader.next:491) - > > > Encountered EOF at 1623195600 in /flume2/data/log-19 > > > 25 Feb 2013 19:55:52,439 INFO [lifecycleSupervisor-1-0] > > > (org.apache.flume.channel.file.LogFile$SequentialReader.next:491) - > > > Encountered EOF at 244816205 in /flume2/data/log-20 > > > 25 Feb 2013 19:55:52,440 INFO [lifecycleSupervisor-1-0] > > > (org.apache.flume.channel.file.ReplayHandler.replayLog:320) - read: > > > 12348, put: 0, take: 0, rollback: 0, commit: 0, skip: 12348, eventCount:0 > > > 25 Feb 2013 19:55:52,441 INFO [lifecycleSupervisor-1-0] > > > (org.apache.flume.channel.file.Log.replay:399) - Rolling /flume2/data > > > 25 Feb 2013 19:55:52,441 INFO [lifecycleSupervisor-1-0] > > > (org.apache.flume.channel.file.Log.roll:811) - Roll start /flume2/data > > > 25 Feb 2013 19:55:52,443 INFO [lifecycleSupervisor-1-0] > > > (org.apache.flume.channel.file.LogFile$Writer.<init>:171) - Opened > > > /flume2/data/log-24 > > > 25 Feb 2013 19:55:52,449 INFO [lifecycleSupervisor-1-0] > > > (org.apache.flume.channel.file.Log.roll:826) - Roll end > > > 25 Feb 2013 19:55:52,453 INFO [lifecycleSupervisor-1-0] > > > (org.apache.flume.channel.file.EventQueueBackingStoreFile.checkpoint:109) > > > - Start checkpoint for /flume2/checkpoint/checkpoint, elements to sync = > > > 0 > > > 25 Feb 2013 19:55:52,455 INFO [lifecycleSupervisor-1-0] > > > (org.apache.flume.channel.file.EventQueueBackingStoreFile.checkpoint:117) > > > - Updating checkpoint metadata: logWriteOrderID: 1361844516784, > > > queueSize: 34525000, queueHead: 40625267 > > > 25 Feb 2013 19:55:52,489 INFO [lifecycleSupervisor-1-0] > > > (org.apache.flume.channel.file.LogFileV3$MetaDataWriter.markCheckpoint:85) > > > - Updating log-24.meta currentPosition = 0, logWriteOrderID = > > > 1361844516784 > > > 25 Feb 2013 19:55:52,491 INFO [lifecycleSupervisor-1-0] > > > (org.apache.flume.channel.file.Log.writeCheckpoint:886) - Updated > > > checkpoint for file: /flume2/data/log-24 position: 0 logWriteOrderID: > > > 1361844516784 > > > 25 Feb 2013 19:55:52,491 INFO [lifecycleSupervisor-1-0] > > > (org.apache.flume.channel.file.LogFile$RandomReader.close:356) - Closing > > > RandomReader /flume2/data/log-17 > > > 25 Feb 2013 19:55:52,497 INFO [lifecycleSupervisor-1-0] > > > (org.apache.flume.channel.file.LogFileV3$MetaDataWriter.markCheckpoint:85) > > > - Updating log-17.meta currentPosition = 1622720601, logWriteOrderID = > > > 1361844516784 > > > 25 Feb 2013 19:55:52,499 INFO [lifecycleSupervisor-1-0] > > > (org.apache.flume.channel.file.LogFile$RandomReader.close:356) - Closing > > > RandomReader /flume2/data/log-18 > > > 25 Feb 2013 19:55:52,500 INFO [lifecycleSupervisor-1-1] > > > (org.apache.flume.channel.file.LogFile$SequentialReader.next:491) - > > > Encountered EOF at 1623195593 in /flume1/data/log-19 > > > 25 Feb 2013 19:55:52,505 INFO [lifecycleSupervisor-1-0] > > > (org.apache.flume.channel.file.LogFileV3$MetaDataWriter.markCheckpoint:85) > > > - Updating log-18.meta currentPosition = 1622821593, logWriteOrderID = > > > 1361844516784 > > > 25 Feb 2013 19:55:52,507 INFO [lifecycleSupervisor-1-0] > > > (org.apache.flume.channel.file.LogFile$RandomReader.close:356) - Closing > > > RandomReader /flume2/data/log-19 > > > 25 Feb 2013 19:55:52,513 INFO [lifecycleSupervisor-1-0] > > > (org.apache.flume.channel.file.LogFileV3$MetaDataWriter.markCheckpoint:85) > > > - Updating log-19.meta currentPosition = 1622678590, logWriteOrderID = > > > 1361844516784 > > > 25 Feb 2013 19:55:52,514 INFO [lifecycleSupervisor-1-0] > > > (org.apache.flume.channel.file.LogFile$RandomReader.close:356) - Closing > > > RandomReader /flume2/data/log-20 > > > 25 Feb 2013 19:55:52,520 INFO [lifecycleSupervisor-1-0] > > > (org.apache.flume.channel.file.LogFileV3$MetaDataWriter.markCheckpoint:85) > > > - Updating log-20.meta currentPosition = 244707334, logWriteOrderID = > > > 1361844516784 > > > 25 Feb 2013 19:55:52,521 INFO [lifecycleSupervisor-1-0] > > > (org.apache.flume.channel.file.LogFile$RandomReader.close:356) - Closing > > > RandomReader /flume2/data/log-21 > > > 25 Feb 2013 19:55:52,527 INFO [lifecycleSupervisor-1-0] > > > (org.apache.flume.channel.file.LogFileV3$MetaDataWriter.markCheckpoint:85) > > > - Updating log-21.meta currentPosition = 530601497, logWriteOrderID = > > > 1361844516784 > > > 25 Feb 2013 19:55:52,529 INFO [lifecycleSupervisor-1-0] > > > (org.apache.flume.channel.file.FileChannel.start:309) - Queue Size after > > > replay: 34525000 [channel=ch2] > > > 25 Feb 2013 19:55:52,594 INFO [lifecycleSupervisor-1-0] > > > (org.apache.flume.instrumentation.MonitoredCounterGroup.register:89) - > > > Monitoried counter group for type: CHANNEL, name: ch2, registered > > > successfully. > > > 25 Feb 2013 19:55:52,594 INFO [lifecycleSupervisor-1-0] > > > (org.apache.flume.instrumentation.MonitoredCounterGroup.start:73) - > > > Component type: CHANNEL, name: ch2 started > > > 25 Feb 2013 19:55:52,619 INFO [lifecycleSupervisor-1-1] > > > (org.apache.flume.channel.file.LogFile$SequentialReader.next:491) - > > > Encountered EOF at 222290119 in /flume1/data/log-20 > > > > > > > > > > > > > > > > > > > > > > > > > >
