El mié., 6 mar. 2019 a las 15:41, Peter Viskup (<[email protected]>) escribió:
>
> It is very old 8.15 version, cannot update at the moment.
> Is there any way to reorder the queue entries?

sorry, I missed

> How that could happen?

first-class program bug. Dug out the issue:
https://github.com/rsyslog/rsyslog/issues/1404

commit is:
c49e42f4f8381fc8e92579c41cefb2c85fe45929

HTH
Rainer

>
> On Wed, Mar 6, 2019 at 3:37 PM Rainer Gerhards <[email protected]> 
> wrote:
>>
>> El mié., 6 mar. 2019 a las 15:29, Peter Viskup (<[email protected]>) 
>> escribió:
>> >
>> > Just spent some time to get more debugging information.
>> > Discovered the first queue file is always successfully processed and 
>> > removed by rsyslog, but then it is getting stuck.
>> > The rsyslog reports opening all queue files, but also reports issues with 
>> > dequeueing.
>> >
>> > 1103.425656259:main thread    : FWD queue[DA]: starting queue
>> > 1103.425663333:main thread    : FWD queue[DA]: .qi file name is 
>> > '/var/spool/rsyslog/fq_fwd.qi', len 28
>> > 1103.425669339:main thread    : FWD queue[DA]: I am a child
>> > 1103.425681264:main thread    : file stream /var/spool/rsyslog/fq_fwd.qi 
>> > params: flush interval 0, async write 0
>> > 1103.425695986:main thread    : file '/var/spool/rsyslog/fq_fwd.qi' opened 
>> > as #6 with mode 384
>> > 1103.425706903:main thread    : strm 0x561bc0071b10: opened file 
>> > '/var/spool/rsyslog/fq_fwd.qi' for READ as 6
>> > 1103.425715579:main thread    : strm 0x561bc0071b10: file 6 read 494 bytes
>> > 1103.425734836:main thread    : file stream fq_fwd params: flush interval 
>> > 0, async write 0
>> > 1103.425814667:main thread    : file stream fq_fwd params: flush interval 
>> > 0, async write 0
>> > 1103.425825255:main thread    : file stream fq_fwd params: flush interval 
>> > 0, async write 0
>> > 1103.425841762:main thread    : file '/var/spool/rsyslog/fq_fwd.00000883' 
>> > opened as #7 with mode 384
>> > 1103.425852933:main thread    : strm 0x561bc0072f50: opened file 
>> > '/var/spool/rsyslog/fq_fwd.00000883' for WRITE as 7
>> > 1103.425859374:main thread    : strm 0x561bc0072f50: file 7 seek, pos 
>> > 909312
>> > 1103.425870898:main thread    : file '/var/spool/rsyslog/fq_fwd.00000003' 
>> > opened as #8 with mode 384
>> > 1103.425879677:main thread    : strm 0x561bc0074370: opened file 
>> > '/var/spool/rsyslog/fq_fwd.00000003' for READ as 8
>> > 1103.425885753:main thread    : strm 0x561bc0074370: file 8 seek, pos 0
>> > 1103.425896027:main thread    : file '/var/spool/rsyslog/fq_fwd.00000003' 
>> > opened as #9 with mode 384
>> > 1103.425904545:main thread    : strm 0x561bc0075640: opened file 
>> > '/var/spool/rsyslog/fq_fwd.00000003' for READ as 9
>> > 1103.425910921:main thread    : strm 0x561bc0075640: file 9 seek, pos 0
>> > 1103.425923089:main thread    : strm 0x561bc0071b10: file 
>> > 6(/var/spool/rsyslog/fq_fwd.qi) closing
>> > 1103.425998623:main thread    : FWD queue[DA]: params: type 2, enq-only 0, 
>> > disk assisted 0, spoolDir '/var/spool/rsyslog', maxFileSz 1048576, 
>> > maxQSize 0, lqsize 295760, pqsize 295760, child 1, full delay -1, light 
>> > delay -1, deq batch size 8, high wtrmrk 0, low wtrmrk 1, discardmrk 0, max 
>> > wrkr 1, min msgs f. wrkr 0
>> > 1103.426008834:main thread    : FWD queue[DA]:Reg: finalizing construction 
>> > of worker thread pool (numworkerThreads 1)
>> > 1103.426015987:main thread    : FWD queue[DA]:Reg/w0: finalizing 
>> > construction of worker instance data (for 4 actions)
>> > 1103.426022435:main thread    : FWD queue[DA]: queue finished 
>> > initialization
>> > 1103.426028716:main thread    : FWD queue[DA]:Reg: high activity - 
>> > starting 1 additional worker thread(s).
>> > 1103.426076030:main thread    : FWD queue[DA]:Reg: started with state 0, 
>> > num workers now 1
>> > 1103.426086592:main thread    : FWD queue: DA queue initialized, disk 
>> > queue 0x561bc0071740
>> > 1103.426093044:main thread    : FWD queue: queue finished initialization
>> > 1103.426100687:main thread    : Action builtin:omfwd[0x561bc0061fc0]: 
>> > queue 0x561bc006eb90 started
>> > 1103.426107448:main thread    : Activating Ruleset Queue[(nil)] for 
>> > Ruleset RSYSLOG_DefaultRuleset
>> > ....(lines with main queue init)
>> > 1103.427736595:FWD queue[DA]:Reg/w0: strm 0x561bc0075640: file 9 read 4096 
>> > bytes
>> > 1103.427781297:FWD queue[DA]:Reg/w0: FWD queue[DA]: error -2308 dequeueing 
>> > element - ignoring, but strange things may happen
>>
>> I guess this is the root cause. It is:
>>
>> RS_RET_DS_PROP_SEQ_ERR = -2308,/**< property sequence error
>> deserializing object */
>>
>> So it looks like the properties are not within proper sequence inside
>> the queue file. I remember we had some issues with some older versions
>> (2 yrs ago?) but haven't heard that since then. If it is rsyslog
>> current (8.1901 or later), I'd say there is an issue with the queue
>> file itself. We then need to have a look at it's content.
>>
>> Rainer
>>
>> > 1103.427819347:FWD queue[DA]:Reg/w0: regular consumer finished, 
>> > iret=-2308, szlog 295759 sz phys 295760
>> > 1103.427826789:FWD queue[DA]:Reg/w0: DeleteProcessedBatch: we deleted 0 
>> > objects and enqueued 0 objects
>> > 1103.427833027:FWD queue[DA]:Reg/w0: doDeleteBatch: delete batch from 
>> > store, new sizes: log 295759, phys 295760
>> > 1103.427839158:FWD queue[DA]:Reg/w0: objDeserialize error -2029 during 
>> > header processing - trying to recover
>> > 1103.427845810:FWD queue[DA]:Reg/w0: deserializer has possibly been able 
>> > to re-sync and recover, state 0
>> > 1103.427883003:FWD queue[DA]:Reg/w0: strm 0x561bc0075640: file 9 read 4096 
>> > bytes
>> > 1103.427913803:FWD queue[DA]:Reg/w0: FWD queue[DA]: error -2308 dequeueing 
>> > element - ignoring, but strange things may happen
>> > 1103.427920657:FWD queue[DA]:Reg/w0: regular consumer finished, 
>> > iret=-2308, szlog 295758 sz phys 295760
>> > 1103.427926772:FWD queue[DA]:Reg/w0: DeleteProcessedBatch: we deleted 0 
>> > objects and enqueued 0 objects
>> > 1103.427932759:FWD queue[DA]:Reg/w0: doDeleteBatch: delete batch from 
>> > store, new sizes: log 295758, phys 295760
>> > 1103.427942303:FWD queue[DA]:Reg/w0: objDeserialize error -2029 during 
>> > header processing - trying to recover
>> > 1103.427949142:FWD queue[DA]:Reg/w0: deserializer has possibly been able 
>> > to re-sync and recover, state 0
>> > ....(same log entries)
>> > ....(and the same after switch to next queue file)
>> > 1103.466645194:FWD queue[DA]:Reg/w0: strm 0x561bc0075640: file 9 read 4096 
>> > bytes
>> > 1103.466657741:FWD queue[DA]:Reg/w0: FWD queue[DA]: error -2308 dequeueing 
>> > element - ignoring, but strange things may happen
>> > 1103.466664393:FWD queue[DA]:Reg/w0: regular consumer finished, 
>> > iret=-2308, szlog 295424 sz phys 295760
>> > 1103.466670503:FWD queue[DA]:Reg/w0: DeleteProcessedBatch: we deleted 0 
>> > objects and enqueued 0 objects
>> > 1103.466680360:FWD queue[DA]:Reg/w0: doDeleteBatch: delete batch from 
>> > store, new sizes: log 295424, phys 295760
>> > 1103.466686560:FWD queue[DA]:Reg/w0: objDeserialize error -2029 during 
>> > header processing - trying to recover
>> > 1103.466726572:FWD queue[DA]:Reg/w0: deserializer has possibly been able 
>> > to re-sync and recover, state 0
>> > 1103.466765643:FWD queue[DA]:Reg/w0: FWD queue[DA]: error -2308 dequeueing 
>> > element - ignoring, but strange things may happen
>> > 1103.466772781:FWD queue[DA]:Reg/w0: regular consumer finished, 
>> > iret=-2308, szlog 295423 sz phys 295760
>> > 1103.466779061:FWD queue[DA]:Reg/w0: DeleteProcessedBatch: we deleted 0 
>> > objects and enqueued 0 objects
>> > 1103.466785131:FWD queue[DA]:Reg/w0: doDeleteBatch: delete batch from 
>> > store, new sizes: log 295423, phys 295760
>> > 1103.466791068:FWD queue[DA]:Reg/w0: objDeserialize error -2029 during 
>> > header processing - trying to recover
>> > 1103.466805481:FWD queue[DA]:Reg/w0: strm 0x561bc0075640: file 9 read 46 
>> > bytes
>> > 1103.466813286:FWD queue[DA]:Reg/w0: strm 0x561bc0075640: file 9 read 0 
>> > bytes
>> > 1103.466819304:FWD queue[DA]:Reg/w0: strm 0x561bc0075640: file 9 EOF
>> > 1103.466830741:FWD queue[DA]:Reg/w0: strm 0x561bc0075640: file 9(fq_fwd) 
>> > closing
>> > 1103.466855189:FWD queue[DA]:Reg/w0: file 
>> > '/var/spool/rsyslog/fq_fwd.00000004' opened as #9 with mode 384
>> > 1103.466865694:FWD queue[DA]:Reg/w0: strm 0x561bc0075640: opened file 
>> > '/var/spool/rsyslog/fq_fwd.00000004' for READ as 9
>> > 1103.467635746:FWD queue[DA]:Reg/w0: strm 0x561bc0075640: file 9 read 4096 
>> > bytes
>> > 1103.467646862:FWD queue[DA]:Reg/w0: deserializer has possibly been able 
>> > to re-sync and recover, state 0
>> > 1103.467775663:FWD queue[DA]:Reg/w0: FWD queue[DA]: error -2308 dequeueing 
>> > element - ignoring, but strange things may happen
>> > 1103.467786682:FWD queue[DA]:Reg/w0: regular consumer finished, 
>> > iret=-2308, szlog 295422 sz phys 295760
>> > 1103.467793132:FWD queue[DA]:Reg/w0: DeleteProcessedBatch: we deleted 0 
>> > objects and enqueued 0 objects
>> > 1103.467799188:FWD queue[DA]:Reg/w0: doDeleteBatch: delete batch from 
>> > store, new sizes: log 295422, phys 295760
>> > 1103.467805425:FWD queue[DA]:Reg/w0: objDeserialize error -2029 during 
>> > header processing - trying to recover
>> > 1103.467812104:FWD queue[DA]:Reg/w0: deserializer has possibly been able 
>> > to re-sync and recover, state 0
>> > 1103.467842910:FWD queue[DA]:Reg/w0: strm 0x561bc0075640: file 9 read 4096 
>> > bytes
>> > 1103.467897344:FWD queue[DA]:Reg/w0: FWD queue[DA]: error -2308 dequeueing 
>> > element - ignoring, but strange things may happen
>> > 1103.467913962:FWD queue[DA]:Reg/w0: regular consumer finished, 
>> > iret=-2308, szlog 295421 sz phys 295760
>> > 1103.467957133:FWD queue[DA]:Reg/w0: DeleteProcessedBatch: we deleted 0 
>> > objects and enqueued 0 objects
>> > 1103.467964731:FWD queue[DA]:Reg/w0: doDeleteBatch: delete batch from 
>> > store, new sizes: log 295421, phys 295760
>> > 1103.467976624:FWD queue[DA]:Reg/w0: objDeserialize error -2029 during 
>> > header processing - trying to recover
>> > 1103.467983341:FWD queue[DA]:Reg/w0: deserializer has possibly been able 
>> > to re-sync and recover, state 0
>> >
>> > Seems strange. Any thoughts?
>> >
>> > Peter
>> >
>> >
>> > On Wed, Mar 6, 2019 at 12:10 PM Rainer Gerhards <[email protected]> 
>> > wrote:
>> >>
>> >> looks like two different queue. I need the full log - or at least a
>> >> couple of thousand lines around the place of where you think the error
>> >> is. The 2040 seems to be pretty ok, but the .00001 file open above is
>> >> not.
>> >>
>> >> Rainer
>> >>
>> >> El mié., 6 mar. 2019 a las 10:26, Peter Viskup (<[email protected]>) 
>> >> escribió:
>> >> >
>> >> > Following is complete log entry with 3 lines up and down:
>> >> >
>> >> > 3292.107997776:main thread    : file
>> >> > '/var/spool/rsyslog/fq_fwd.00000001' opened as #-1 with mode 384
>> >> > 3292.108007930:main thread    : strm 0x5557282b2370: open error 2,
>> >> > file '/var/spool/rsyslog/fq_fwd.00000001': No such file or directory
>> >> > 3292.108014657:main thread    : strm 0x5557282afb10: file
>> >> > 6(/var/spool/rsyslog/fq_fwd.qi) closing
>> >> > 3292.108023902:main thread    : FWD queue[DA]: state -2040 reading .qi
>> >> > file - can not read persisted info (if any)
>> >> > 3292.108030552:main thread    : file stream N/A params: flush interval
>> >> > 0, async write 0
>> >> > 3292.108043010:main thread    : file stream N/A params: flush interval
>> >> > 0, async write 0
>> >> > 3292.108052306:main thread    : file stream N/A params: flush interval
>> >> > 0, async write 0
>> >> >
>> >> > Peter
>> >> >
>> >> > On Tue, Mar 5, 2019 at 3:05 PM Rainer Gerhards 
>> >> > <[email protected]> wrote:
>> >> > >
>> >> > > El mar., 5 mar. 2019 a las 15:00, Peter Viskup via rsyslog
>> >> > > (<[email protected]>) escribió:
>> >> > > >
>> >> > > > After rsyslog crash and recover.qi.pl run the DA queue is not 
>> >> > > > dequeued.
>> >> > > > Rsyslog debug prints the message from queue.c file [1].
>> >> > > > What is could be the reason for this? Only some servers are affected
>> >> > > > by this issue. Others dequeue just fine.
>> >> > >
>> >> > > What's the state reported in the message?
>> >> > >
>> >> > > Rainer
>> >> > > >
>> >> > > > [1] 
>> >> > > > https://github.com/rsyslog/rsyslog/blob/master/runtime/queue.c#L863
>> >> > > >
>> >> > > > Peter
>> >> > > > _______________________________________________
>> >> > > > rsyslog mailing list
>> >> > > > http://lists.adiscon.net/mailman/listinfo/rsyslog
>> >> > > > http://www.rsyslog.com/professional-services/
>> >> > > > What's up with rsyslog? Follow https://twitter.com/rgerhards
>> >> > > > NOTE WELL: This is a PUBLIC mailing list, posts are ARCHIVED by a 
>> >> > > > myriad of sites beyond our control. PLEASE UNSUBSCRIBE and DO NOT 
>> >> > > > POST if you DON'T LIKE THAT.
_______________________________________________
rsyslog mailing list
http://lists.adiscon.net/mailman/listinfo/rsyslog
http://www.rsyslog.com/professional-services/
What's up with rsyslog? Follow https://twitter.com/rgerhards
NOTE WELL: This is a PUBLIC mailing list, posts are ARCHIVED by a myriad of 
sites beyond our control. PLEASE UNSUBSCRIBE and DO NOT POST if you DON'T LIKE 
THAT.

Reply via email to