Re: [rsyslog] rsyslog fails to start due to high queue
I will point out that no matter what software you run, you will eventually run into a case like this where the distro packages break for you, and you are faced with the need to run non-standard packages to work around a bug. As Rainer points out, if you aren't willing to compile your own version, even if he were to find that the bug was not fixed in the latest version, and create a fix for it, you still wouldn't be able to use the fix. David Lang On Fri, 25 Nov 2016, Kosta Psimoulis wrote: Thank you for much guys for your support, right now I have a much better picture of what is going on. I am aware how to build from source but this would be something difficult to maintain, I would probably need to create a custom repo. I think I have enough information right now to reconsider and evaluate a business decision whether to use rsyslog or look for another solution. I have some queue files that I can recover and rebuild with the information you have given but I am still not sure of how they got corrupted and what happened to the ones that were in memory, was everything saved on the hard drive or was there information that was lost. Kind Regards, Kosta On Fri, Nov 25, 2016 at 3:42 AM, David Langwrote: On Fri, 25 Nov 2016, mosto...@gmail.com wrote: TBH, it depends if you prefer building from source or instability Kidding away, I know there are problems in older versions, they are Isn't adiscon repo valid? It looks like we don't have a repo for Jessie, at least it's not included in the scripts/config.sh for rsyslog-pkg-debian This probably also means we don't have a Travis test box running Jessie. Packages for Wheezy will probably work (the only thing I can think of that would be likely to break is gnutls related dependencies) David Lang ___ rsyslog mailing list http://lists.adiscon.net/mailman/listinfo/rsyslog http://www.rsyslog.com/professional-services/ What's up with rsyslog? Follow https://twitter.com/rgerhards NOTE WELL: This is a PUBLIC mailing list, posts are ARCHIVED by a myriad of sites beyond our control. PLEASE UNSUBSCRIBE and DO NOT POST if you DON'T LIKE THAT. ___ rsyslog mailing list http://lists.adiscon.net/mailman/listinfo/rsyslog http://www.rsyslog.com/professional-services/ What's up with rsyslog? Follow https://twitter.com/rgerhards NOTE WELL: This is a PUBLIC mailing list, posts are ARCHIVED by a myriad of sites beyond our control. PLEASE UNSUBSCRIBE and DO NOT POST if you DON'T LIKE THAT. ___ rsyslog mailing list http://lists.adiscon.net/mailman/listinfo/rsyslog http://www.rsyslog.com/professional-services/ What's up with rsyslog? Follow https://twitter.com/rgerhards NOTE WELL: This is a PUBLIC mailing list, posts are ARCHIVED by a myriad of sites beyond our control. PLEASE UNSUBSCRIBE and DO NOT POST if you DON'T LIKE THAT.
Re: [rsyslog] rsyslog fails to start due to high queue
Agains't Rainer's advice, we are using adiscon repos and we're quite happy with them... El 25/11/16 a las 09:59, Kosta Psimoulis escribió: Thank you for much guys for your support, right now I have a much better picture of what is going on. I am aware how to build from source but this would be something difficult to maintain, I would probably need to create a custom repo. I think I have enough information right now to reconsider and evaluate a business decision whether to use rsyslog or look for another solution. I have some queue files that I can recover and rebuild with the information you have given but I am still not sure of how they got corrupted and what happened to the ones that were in memory, was everything saved on the hard drive or was there information that was lost. Kind Regards, Kosta On Fri, Nov 25, 2016 at 3:42 AM, David Langwrote: On Fri, 25 Nov 2016, mosto...@gmail.com wrote: TBH, it depends if you prefer building from source or instability Kidding away, I know there are problems in older versions, they are Isn't adiscon repo valid? It looks like we don't have a repo for Jessie, at least it's not included in the scripts/config.sh for rsyslog-pkg-debian This probably also means we don't have a Travis test box running Jessie. Packages for Wheezy will probably work (the only thing I can think of that would be likely to break is gnutls related dependencies) David Lang ___ rsyslog mailing list http://lists.adiscon.net/mailman/listinfo/rsyslog http://www.rsyslog.com/professional-services/ What's up with rsyslog? Follow https://twitter.com/rgerhards NOTE WELL: This is a PUBLIC mailing list, posts are ARCHIVED by a myriad of sites beyond our control. PLEASE UNSUBSCRIBE and DO NOT POST if you DON'T LIKE THAT. ___ rsyslog mailing list http://lists.adiscon.net/mailman/listinfo/rsyslog http://www.rsyslog.com/professional-services/ What's up with rsyslog? Follow https://twitter.com/rgerhards NOTE WELL: This is a PUBLIC mailing list, posts are ARCHIVED by a myriad of sites beyond our control. PLEASE UNSUBSCRIBE and DO NOT POST if you DON'T LIKE THAT. ___ rsyslog mailing list http://lists.adiscon.net/mailman/listinfo/rsyslog http://www.rsyslog.com/professional-services/ What's up with rsyslog? Follow https://twitter.com/rgerhards NOTE WELL: This is a PUBLIC mailing list, posts are ARCHIVED by a myriad of sites beyond our control. PLEASE UNSUBSCRIBE and DO NOT POST if you DON'T LIKE THAT.
Re: [rsyslog] rsyslog fails to start due to high queue
Thank you for much guys for your support, right now I have a much better picture of what is going on. I am aware how to build from source but this would be something difficult to maintain, I would probably need to create a custom repo. I think I have enough information right now to reconsider and evaluate a business decision whether to use rsyslog or look for another solution. I have some queue files that I can recover and rebuild with the information you have given but I am still not sure of how they got corrupted and what happened to the ones that were in memory, was everything saved on the hard drive or was there information that was lost. Kind Regards, Kosta On Fri, Nov 25, 2016 at 3:42 AM, David Langwrote: > On Fri, 25 Nov 2016, mosto...@gmail.com wrote: > > TBH, it depends if you prefer building from source or instability >>> Kidding away, I know there are problems in older versions, they are >>> >> Isn't adiscon repo valid? >> > > It looks like we don't have a repo for Jessie, at least it's not included > in the scripts/config.sh for rsyslog-pkg-debian > > This probably also means we don't have a Travis test box running Jessie. > > Packages for Wheezy will probably work (the only thing I can think of that > would be likely to break is gnutls related dependencies) > > David Lang > > ___ > rsyslog mailing list > http://lists.adiscon.net/mailman/listinfo/rsyslog > http://www.rsyslog.com/professional-services/ > What's up with rsyslog? Follow https://twitter.com/rgerhards > NOTE WELL: This is a PUBLIC mailing list, posts are ARCHIVED by a myriad > of sites beyond our control. PLEASE UNSUBSCRIBE and DO NOT POST if you > DON'T LIKE THAT. > ___ rsyslog mailing list http://lists.adiscon.net/mailman/listinfo/rsyslog http://www.rsyslog.com/professional-services/ What's up with rsyslog? Follow https://twitter.com/rgerhards NOTE WELL: This is a PUBLIC mailing list, posts are ARCHIVED by a myriad of sites beyond our control. PLEASE UNSUBSCRIBE and DO NOT POST if you DON'T LIKE THAT.
Re: [rsyslog] rsyslog fails to start due to high queue
On Fri, 25 Nov 2016, mosto...@gmail.com wrote: TBH, it depends if you prefer building from source or instability Kidding away, I know there are problems in older versions, they are Isn't adiscon repo valid? It looks like we don't have a repo for Jessie, at least it's not included in the scripts/config.sh for rsyslog-pkg-debian This probably also means we don't have a Travis test box running Jessie. Packages for Wheezy will probably work (the only thing I can think of that would be likely to break is gnutls related dependencies) David Lang ___ rsyslog mailing list http://lists.adiscon.net/mailman/listinfo/rsyslog http://www.rsyslog.com/professional-services/ What's up with rsyslog? Follow https://twitter.com/rgerhards NOTE WELL: This is a PUBLIC mailing list, posts are ARCHIVED by a myriad of sites beyond our control. PLEASE UNSUBSCRIBE and DO NOT POST if you DON'T LIKE THAT.
Re: [rsyslog] rsyslog fails to start due to high queue
2016-11-25 9:38 GMT+01:00 mosto...@gmail.com: > >> TBH, it depends if you prefer building from source or instability >> Kidding away, I know there are problems in older versions, they are > > Isn't adiscon repo valid? We don't have everything. Too few contributors :-( Rainer ___ rsyslog mailing list http://lists.adiscon.net/mailman/listinfo/rsyslog http://www.rsyslog.com/professional-services/ What's up with rsyslog? Follow https://twitter.com/rgerhards NOTE WELL: This is a PUBLIC mailing list, posts are ARCHIVED by a myriad of sites beyond our control. PLEASE UNSUBSCRIBE and DO NOT POST if you DON'T LIKE THAT.
Re: [rsyslog] rsyslog fails to start due to high queue
TBH, it depends if you prefer building from source or instability Kidding away, I know there are problems in older versions, they are Isn't adiscon repo valid? ___ rsyslog mailing list http://lists.adiscon.net/mailman/listinfo/rsyslog http://www.rsyslog.com/professional-services/ What's up with rsyslog? Follow https://twitter.com/rgerhards NOTE WELL: This is a PUBLIC mailing list, posts are ARCHIVED by a myriad of sites beyond our control. PLEASE UNSUBSCRIBE and DO NOT POST if you DON'T LIKE THAT.
Re: [rsyslog] rsyslog fails to start due to high queue
On Fri, 25 Nov 2016, Kosta Psimoulis wrote: Hello Rainer, Thank you for your honest answer, so you are basically saying that the queue will be lost ? I thought that my current version of rsyslog was stable enough, I guess I was wrong, in that case I will try a different queue system. I am sending a little bit extra debugging information, please let me know if you see anything useful and if you believe that this issue was fixed in 8.23 we do believe that it is fixed in the current version. Unfortunantly, you aren't running the current version. 8.4 is rather old, and even 8.16 is prior to some very substantial reliability fixes, including several that caused segfaults. In the short term, move the queue files to another directory and see if rsyslog starts. If so, we can look at trying to fix the queue files (there is a command that will rebuild the .qi file from the raw queue files, that may solve your issues. with the exception of encryption, the packages for wheezy should have no problems running on Jessie (although I agree that after a year we should be providing a Jessie repo, nobody called it to our attention, so we missed it) Unfortunantly, due to manpower limits, we (the community) can only support the most recent release of rsyslog, we can't do backports of fixes (although that is an option for people who get paid support). We tried to do that sort of thing and the project almost died under the load. When you rely on the distro provided versions instead of the upstream releases, it's on the basis that the distro is providing support for them. If the distro isn't providing support for a bug that you are running into, you need to look at shifting to the upstream version. Sometimes this is going to require that you maintain your own version of the package, built from source (to apply a fix, or because you need a compile-time option that's not in the default packages, or to use a new feature) The number of fixes since 8.16 (let alone 8.4) is huge, trying to backport them all, without backporting the other features added would probably result in a less reliable system than just running the latest 8.24 version. IIRC, 8.16 was the last release before we started using libfastjson, and part of the reason why Debian hasn't picked up the newer versions, even into backports, is concerns over long-term maintinance of libfastjson. But it fixed so many errors that we killing people's systems that we really need to shift. In theory, you should be able to clone the source git repos, and the rsyslog-pkg-debian repo and in the rsyslog-pkg-debian repo under scripts/config.sh add Jessie to the distro list and everything should work. We just haven't tested it yet (and yes, there is a bit more to getting this going, contact me off-list and I'll pass along the build scripts I put together for ubuntu, they should work almost the same on Debian) David Lang Kind Regards, Kosta 1388.716086990:action 3 queue[DA]:Reg/w0: in destructor: sendbuf 0xada9030 1388.716159868:action 3 queue[DA]:Reg/w0: relp engine is dispatching frame with command 'rsp' 1388.716530324:action 3 queue[DA]:Reg/w0: relpSessWaitState returns 10019 1388.718355599:action 3 queue[DA]:Reg/w0: in destructor: sendbuf 0xada9570 1388.719729919:action 3 queue[DA]:Reg/w0: relp engine created new client 0xadae250 1388.719865043:action 3 queue[DA]:Reg/w0: omrelp: endTransaction ==16816== Thread 2 rs:action 3 queue[DA]:R: ==16816== Invalid read of size 8 ==16816==at 0x76FEE94: relpCltHintBurstEnd (in /usr/lib/x86_64-linux-gnu/librelp.so.0.1.0) ==16816==by 0x74EFB7A: ??? (in /usr/lib/rsyslog/omrelp.so) ==16816==by 0x4529ED: ??? (in /usr/sbin/rsyslogd) ==16816==by 0x453517: ??? (in /usr/sbin/rsyslogd) ==16816==by 0x448F8A: ??? (in /usr/sbin/rsyslogd) ==16816==by 0x4457BD: wtiWorker (in /usr/sbin/rsyslogd) ==16816==by 0x4447CB: ??? (in /usr/sbin/rsyslogd) ==16816==by 0x50580A3: start_thread (pthread_create.c:309) ==16816==by 0x5F7A62C: clone (clone.S:111) ==16816== Address 0x20 is not stack'd, malloc'd or (recently) free'd ==16816== ==16816== ==16816== Process terminating with default action of signal 11 (SIGSEGV) ==16816== Access not within mapped region at address 0x20 ==16816==at 0x76FEE94: relpCltHintBurstEnd (in /usr/lib/x86_64-linux-gnu/librelp.so.0.1.0) ==16816==by 0x74EFB7A: ??? (in /usr/lib/rsyslog/omrelp.so) ==16816==by 0x4529ED: ??? (in /usr/sbin/rsyslogd) ==16816==by 0x453517: ??? (in /usr/sbin/rsyslogd) ==16816==by 0x448F8A: ??? (in /usr/sbin/rsyslogd) ==16816==by 0x4457BD: wtiWorker (in /usr/sbin/rsyslogd) ==16816==by 0x4447CB: ??? (in /usr/sbin/rsyslogd) ==16816==by 0x50580A3: start_thread (pthread_create.c:309) ==16816==by 0x5F7A62C: clone (clone.S:111) ==16816== If you believe this happened as a result of a stack ==16816== overflow in your program's main thread (unlikely but ==16816== possible), you can try to increase the
Re: [rsyslog] rsyslog fails to start due to high queue
2016-11-25 9:16 GMT+01:00 Kosta Psimoulis: > Thank you for your honest answer, so you are basically saying that the > queue will be lost ? There is this little tool: https://github.com/rsyslog/rsyslog/blob/master/tools/recover_qi.pl It works for your version as well and, depending on what is wrong, it can fix the queue files. Worth a try. > I thought that my current version of rsyslog was > stable enough, I guess I was wrong, in that case I will try a different > queue system. Well, as always in life: things are deemed stable unless someone finds it isn't the case. Then those things are fixed and we go back to think "we are stable" unless... That's a large part of why new releases happen, and that's part of the frequent release philosophy in open source. > I am sending a little bit extra debugging information, please > let me know if you see anything useful and if you believe that this issue > was fixed in 8.23 It's indeed interesting, as it may point into a bug that was fixed in librelp (I barely remember there was something). Unfortunately, I have made it my policy to NEVER AGAN look at old versions for fixing bugs. I have done so in the past, and wasted month of my life, because in 95% of the cases it turned out I hunt for a bug that was long fixed. So I only do this if I am actually paid for the pain, via an Adiscon support contract. I am NOT trying to get money out of you, and I understand that you do not like to upgrade or are even unable to do so. But again, time is very limited, I have a large TODO list and year-long experience has prooven that hunting bugs in old code bases is fruitless. Even more so if after hours I discover it's already fixed AND you need to apply a code patch, what you don't want to do/ are not permitted to do in the first place. Which means the effort was fruitless and wasteful for everyone involved (yes, this *is* *practical* *experience*, more than once). So the first filter in open source development for me is: if the user (or his org) is unwilling to update to current, there is no point in looking at something that might even remotely be a bug. I hope for your understanding. Rainer > > Kind Regards, > Kosta > > 1388.716086990:action 3 queue[DA]:Reg/w0: in destructor: sendbuf 0xada9030 > 1388.716159868:action 3 queue[DA]:Reg/w0: relp engine is dispatching frame > with command 'rsp' > 1388.716530324:action 3 queue[DA]:Reg/w0: relpSessWaitState returns 10019 > 1388.718355599:action 3 queue[DA]:Reg/w0: in destructor: sendbuf 0xada9570 > 1388.719729919:action 3 queue[DA]:Reg/w0: relp engine created new client > 0xadae250 > 1388.719865043:action 3 queue[DA]:Reg/w0: omrelp: endTransaction > ==16816== Thread 2 rs:action 3 queue[DA]:R: > ==16816== Invalid read of size 8 > ==16816==at 0x76FEE94: relpCltHintBurstEnd (in > /usr/lib/x86_64-linux-gnu/librelp.so.0.1.0) > ==16816==by 0x74EFB7A: ??? (in /usr/lib/rsyslog/omrelp.so) > ==16816==by 0x4529ED: ??? (in /usr/sbin/rsyslogd) > ==16816==by 0x453517: ??? (in /usr/sbin/rsyslogd) > ==16816==by 0x448F8A: ??? (in /usr/sbin/rsyslogd) > ==16816==by 0x4457BD: wtiWorker (in /usr/sbin/rsyslogd) > ==16816==by 0x4447CB: ??? (in /usr/sbin/rsyslogd) > ==16816==by 0x50580A3: start_thread (pthread_create.c:309) > ==16816==by 0x5F7A62C: clone (clone.S:111) > ==16816== Address 0x20 is not stack'd, malloc'd or (recently) free'd > ==16816== > ==16816== > ==16816== Process terminating with default action of signal 11 (SIGSEGV) > ==16816== Access not within mapped region at address 0x20 > ==16816==at 0x76FEE94: relpCltHintBurstEnd (in > /usr/lib/x86_64-linux-gnu/librelp.so.0.1.0) > ==16816==by 0x74EFB7A: ??? (in /usr/lib/rsyslog/omrelp.so) > ==16816==by 0x4529ED: ??? (in /usr/sbin/rsyslogd) > ==16816==by 0x453517: ??? (in /usr/sbin/rsyslogd) > ==16816==by 0x448F8A: ??? (in /usr/sbin/rsyslogd) > ==16816==by 0x4457BD: wtiWorker (in /usr/sbin/rsyslogd) > ==16816==by 0x4447CB: ??? (in /usr/sbin/rsyslogd) > ==16816==by 0x50580A3: start_thread (pthread_create.c:309) > ==16816==by 0x5F7A62C: clone (clone.S:111) > ==16816== If you believe this happened as a result of a stack > ==16816== overflow in your program's main thread (unlikely but > ==16816== possible), you can try to increase the size of the > ==16816== main thread stack using the --main-stacksize= flag. > ==16816== The main thread stack size used in this run was 8388608. > ==16816== > ==16816== HEAP SUMMARY: > ==16816== in use at exit: 1,136,880 bytes in 2,685 blocks > ==16816== total heap usage: 15,434 allocs, 12,749 frees, 3,325,304 bytes > allocated > ==16816== > ==16816== LEAK SUMMARY: > ==16816==definitely lost: 0 bytes in 0 blocks > ==16816==indirectly lost: 0 bytes in 0 blocks > ==16816== possibly lost: 1,152 bytes in 4 blocks > ==16816==still reachable: 1,135,728 bytes in 2,681 blocks > ==16816== suppressed: 0 bytes in 0 blocks >
Re: [rsyslog] rsyslog fails to start due to high queue
Hello Rainer, Thank you for your honest answer, so you are basically saying that the queue will be lost ? I thought that my current version of rsyslog was stable enough, I guess I was wrong, in that case I will try a different queue system. I am sending a little bit extra debugging information, please let me know if you see anything useful and if you believe that this issue was fixed in 8.23 Kind Regards, Kosta 1388.716086990:action 3 queue[DA]:Reg/w0: in destructor: sendbuf 0xada9030 1388.716159868:action 3 queue[DA]:Reg/w0: relp engine is dispatching frame with command 'rsp' 1388.716530324:action 3 queue[DA]:Reg/w0: relpSessWaitState returns 10019 1388.718355599:action 3 queue[DA]:Reg/w0: in destructor: sendbuf 0xada9570 1388.719729919:action 3 queue[DA]:Reg/w0: relp engine created new client 0xadae250 1388.719865043:action 3 queue[DA]:Reg/w0: omrelp: endTransaction ==16816== Thread 2 rs:action 3 queue[DA]:R: ==16816== Invalid read of size 8 ==16816==at 0x76FEE94: relpCltHintBurstEnd (in /usr/lib/x86_64-linux-gnu/librelp.so.0.1.0) ==16816==by 0x74EFB7A: ??? (in /usr/lib/rsyslog/omrelp.so) ==16816==by 0x4529ED: ??? (in /usr/sbin/rsyslogd) ==16816==by 0x453517: ??? (in /usr/sbin/rsyslogd) ==16816==by 0x448F8A: ??? (in /usr/sbin/rsyslogd) ==16816==by 0x4457BD: wtiWorker (in /usr/sbin/rsyslogd) ==16816==by 0x4447CB: ??? (in /usr/sbin/rsyslogd) ==16816==by 0x50580A3: start_thread (pthread_create.c:309) ==16816==by 0x5F7A62C: clone (clone.S:111) ==16816== Address 0x20 is not stack'd, malloc'd or (recently) free'd ==16816== ==16816== ==16816== Process terminating with default action of signal 11 (SIGSEGV) ==16816== Access not within mapped region at address 0x20 ==16816==at 0x76FEE94: relpCltHintBurstEnd (in /usr/lib/x86_64-linux-gnu/librelp.so.0.1.0) ==16816==by 0x74EFB7A: ??? (in /usr/lib/rsyslog/omrelp.so) ==16816==by 0x4529ED: ??? (in /usr/sbin/rsyslogd) ==16816==by 0x453517: ??? (in /usr/sbin/rsyslogd) ==16816==by 0x448F8A: ??? (in /usr/sbin/rsyslogd) ==16816==by 0x4457BD: wtiWorker (in /usr/sbin/rsyslogd) ==16816==by 0x4447CB: ??? (in /usr/sbin/rsyslogd) ==16816==by 0x50580A3: start_thread (pthread_create.c:309) ==16816==by 0x5F7A62C: clone (clone.S:111) ==16816== If you believe this happened as a result of a stack ==16816== overflow in your program's main thread (unlikely but ==16816== possible), you can try to increase the size of the ==16816== main thread stack using the --main-stacksize= flag. ==16816== The main thread stack size used in this run was 8388608. ==16816== ==16816== HEAP SUMMARY: ==16816== in use at exit: 1,136,880 bytes in 2,685 blocks ==16816== total heap usage: 15,434 allocs, 12,749 frees, 3,325,304 bytes allocated ==16816== ==16816== LEAK SUMMARY: ==16816==definitely lost: 0 bytes in 0 blocks ==16816==indirectly lost: 0 bytes in 0 blocks ==16816== possibly lost: 1,152 bytes in 4 blocks ==16816==still reachable: 1,135,728 bytes in 2,681 blocks ==16816== suppressed: 0 bytes in 0 blocks ==16816== Rerun with --leak-check=full to see details of leaked memory ==16816== ==16816== For counts of detected and suppressed errors, rerun with: -v ==16816== ERROR SUMMARY: 1 errors from 1 contexts (suppressed: 0 from 0) Killed On Fri, Nov 25, 2016 at 3:03 AM, Rainer Gerhardswrote: > 2016-11-25 8:59 GMT+01:00 Kosta Psimoulis sourceknowledge.com>: > > Hello, > > > > Perhaps I wasn't clear on the first message but I am getting Segmentation > > fault. > > > > this error was on syslog: > > Nov 24 20:18:25 ip-172-16-1-152 systemd[1]: rsyslog.service: main process > > exited, code=killed, status=11/SEGV > > > > and when I run it debug mode (-dn), it is trying to process the queue and > > gives the following error: > > . > > 9771.929711621:action 3 queue[DA]:Reg/w0: relp engine is dispatching > frame > > with command 'rsp' > > 9771.929715606:action 3 queue[DA]:Reg/w0: in rsp command handler, txnr > 201, > > code 200, text 'OK' > > 9771.929723831:action 3 queue[DA]:Reg/w0: DEL sess 0x7f21a40023b0 unacked > > 1, sessState 6 > > 9771.929727757:action 3 queue[DA]:Reg/w0: in destructor: sendbuf > > 0x7f21a401de00 > > 9771.929733344:action 3 queue[DA]:Reg/w0: relp engine is dispatching > frame > > with command 'rsp' > > 9771.929739980:action 3 queue[DA]:Reg/w0: relpSessWaitState returns 10019 > > 9771.929754387:action 3 queue[DA]:Reg/w0: in destructor: sendbuf > > 0x7f21a4025730 > > 9771.929771799:action 3 queue[DA]:Reg/w0: relp engine created new client > > 0x7f21a40023b0 > > 9771.929780641:action 3 queue[DA]:Reg/w0: omrelp: endTransaction > > Segmentation fault > > > > 8.4.2 is the latest version available as a package on Debian Jessie 8.6 > and > > 8.16 if I use the backports. There are no repositories for Debian Jessie, > > only for Wheezy, are you suggesting to install 8.23 from source on a > > production server ? > > TBH, it depends if you
Re: [rsyslog] rsyslog fails to start due to high queue
2016-11-25 8:59 GMT+01:00 Kosta Psimoulis: > Hello, > > Perhaps I wasn't clear on the first message but I am getting Segmentation > fault. > > this error was on syslog: > Nov 24 20:18:25 ip-172-16-1-152 systemd[1]: rsyslog.service: main process > exited, code=killed, status=11/SEGV > > and when I run it debug mode (-dn), it is trying to process the queue and > gives the following error: > . > 9771.929711621:action 3 queue[DA]:Reg/w0: relp engine is dispatching frame > with command 'rsp' > 9771.929715606:action 3 queue[DA]:Reg/w0: in rsp command handler, txnr 201, > code 200, text 'OK' > 9771.929723831:action 3 queue[DA]:Reg/w0: DEL sess 0x7f21a40023b0 unacked > 1, sessState 6 > 9771.929727757:action 3 queue[DA]:Reg/w0: in destructor: sendbuf > 0x7f21a401de00 > 9771.929733344:action 3 queue[DA]:Reg/w0: relp engine is dispatching frame > with command 'rsp' > 9771.929739980:action 3 queue[DA]:Reg/w0: relpSessWaitState returns 10019 > 9771.929754387:action 3 queue[DA]:Reg/w0: in destructor: sendbuf > 0x7f21a4025730 > 9771.929771799:action 3 queue[DA]:Reg/w0: relp engine created new client > 0x7f21a40023b0 > 9771.929780641:action 3 queue[DA]:Reg/w0: omrelp: endTransaction > Segmentation fault > > 8.4.2 is the latest version available as a package on Debian Jessie 8.6 and > 8.16 if I use the backports. There are no repositories for Debian Jessie, > only for Wheezy, are you suggesting to install 8.23 from source on a > production server ? TBH, it depends if you prefer building from source or instability ;-) Kidding away, I know there are problems in older versions, they are fixed, and so I do not know how *I* can help you without you making sure you run the fixes. If you don't like to apply them yourself (I can understand that), I think you should coordinate with the Debian folks (Michael who packages rsyslog for deb is also on the list) and ask them for update or to apply the relevant patches. For the version you have, I think the solution is to remove the queue files, restart rsyslog and hope for the best. Might work. Sorry I have no better answer, Rainer ___ rsyslog mailing list http://lists.adiscon.net/mailman/listinfo/rsyslog http://www.rsyslog.com/professional-services/ What's up with rsyslog? Follow https://twitter.com/rgerhards NOTE WELL: This is a PUBLIC mailing list, posts are ARCHIVED by a myriad of sites beyond our control. PLEASE UNSUBSCRIBE and DO NOT POST if you DON'T LIKE THAT.
Re: [rsyslog] rsyslog fails to start due to high queue
Hello, Perhaps I wasn't clear on the first message but I am getting Segmentation fault. this error was on syslog: Nov 24 20:18:25 ip-172-16-1-152 systemd[1]: rsyslog.service: main process exited, code=killed, status=11/SEGV and when I run it debug mode (-dn), it is trying to process the queue and gives the following error: . 9771.929711621:action 3 queue[DA]:Reg/w0: relp engine is dispatching frame with command 'rsp' 9771.929715606:action 3 queue[DA]:Reg/w0: in rsp command handler, txnr 201, code 200, text 'OK' 9771.929723831:action 3 queue[DA]:Reg/w0: DEL sess 0x7f21a40023b0 unacked 1, sessState 6 9771.929727757:action 3 queue[DA]:Reg/w0: in destructor: sendbuf 0x7f21a401de00 9771.929733344:action 3 queue[DA]:Reg/w0: relp engine is dispatching frame with command 'rsp' 9771.929739980:action 3 queue[DA]:Reg/w0: relpSessWaitState returns 10019 9771.929754387:action 3 queue[DA]:Reg/w0: in destructor: sendbuf 0x7f21a4025730 9771.929771799:action 3 queue[DA]:Reg/w0: relp engine created new client 0x7f21a40023b0 9771.929780641:action 3 queue[DA]:Reg/w0: omrelp: endTransaction Segmentation fault 8.4.2 is the latest version available as a package on Debian Jessie 8.6 and 8.16 if I use the backports. There are no repositories for Debian Jessie, only for Wheezy, are you suggesting to install 8.23 from source on a production server ? Kind Regards, Kosta On Fri, Nov 25, 2016 at 1:15 AM, Rainer Gerhards <rgerha...@hq.adiscon.com> wrote: > I don't see anything conrete either, but I would guess from context > that the queue got corrupted and on startup rsyslog may run into a > segfault. But no evidence for that. It is known that versions prior to > 8.22.0 sometimes had problems with corrupted disk queues. > > My suggestion is to delete the queue files and update to the curretn > 8.23.0 version. > > Rainer > > 2016-11-25 4:06 GMT+01:00 David Lang <da...@lang.hm>: > > it's no clear to me that rsyslog actually failed. It looks like systemd > saw > > the warning about ~ and decided to shut it down > > > > David Lang > > > > On Thu, 24 Nov 2016, Kosta Psimoulis wrote: > > > >> Date: Thu, 24 Nov 2016 20:36:47 -0500 > >> From: Kosta Psimoulis <kosta.psimou...@sourceknowledge.com> > >> Reply-To: rsyslog-users <rsyslog@lists.adiscon.com> > >> To: rsyslog@lists.adiscon.com > >> Subject: [rsyslog] rsyslog fails to start due to high queue > >> > >> > >> Hello, > >> > >> There was a misconfiguration for a remote forwarding server, the issue > was > >> fixed and the host was corrected by rsyslog is refusing to start with > the > >> following error: > >> > >> Rsyslog version: 8.4.2 > >> Distribution: Latest Debian Jessie Linux 8.6 > >> > >> -- Unit rsyslog.service has begun starting up. > >> Nov 24 20:18:32 ip-172-16-1-152 systemd[1]: rsyslog.service start > request > >> repeated too quickly, refusing to start. > >> Nov 24 20:18:32 ip-172-16-1-152 systemd[1]: Failed to start System > Logging > >> Service. > >> -- Subject: Unit rsyslog.service has failed > >> -- Defined-By: systemd > >> -- Support: http://lists.freedesktop.org/mailman/listinfo/systemd-devel > >> > >> Nov 24 20:18:26 ip-172-16-1-152 rsyslogd: [origin software="rsyslogd" > >> swVersion="8.4.2" x-pid="16534" x-info="http://www.rsyslog.com;] start > >> Nov 24 20:18:26 ip-172-16-1-152 rsyslogd-2307: warning: ~ action is > >> deprecated, consider using the 'stop' statement instead [try > >> http://www.rsyslog.com/e/2307 ] > >> Nov 24 20:18:25 ip-172-16-1-152 systemd[1]: rsyslog.service: main > process > >> exited, code=killed, status=11/SEGV > >> Nov 24 20:18:25 ip-172-16-1-152 systemd[1]: Unit rsyslog.service entered > >> failed state. > >> Nov 24 20:18:26 ip-172-16-1-152 systemd[1]: rsyslog.service holdoff time > >> over, scheduling restart. > >> Nov 24 20:18:26 ip-172-16-1-152 systemd[1]: Stopping System Logging > >> Service... > >> Nov 24 20:18:26 ip-172-16-1-152 systemd[1]: Starting System Logging > >> Service... > >> Nov 24 20:18:26 ip-172-16-1-152 systemd[1]: Started System Logging > >> Service. > >> Nov 24 20:18:26 ip-172-16-1-152 rsyslogd: [origin software="rsyslogd" > >> swVersion="8.4.2" x-pid="16540" x-info="http://www.rsyslog.com;] start > >> Nov 24 20:18:26 ip-172-16-1-152 rsyslogd-2307: warning: ~ action is > >> deprecated, consider using the 'stop' statement instead [try > >> http://www.rsyslog.com/e/2307 ] > >> > >>
Re: [rsyslog] rsyslog fails to start due to high queue
I don't see anything conrete either, but I would guess from context that the queue got corrupted and on startup rsyslog may run into a segfault. But no evidence for that. It is known that versions prior to 8.22.0 sometimes had problems with corrupted disk queues. My suggestion is to delete the queue files and update to the curretn 8.23.0 version. Rainer 2016-11-25 4:06 GMT+01:00 David Lang <da...@lang.hm>: > it's no clear to me that rsyslog actually failed. It looks like systemd saw > the warning about ~ and decided to shut it down > > David Lang > > On Thu, 24 Nov 2016, Kosta Psimoulis wrote: > >> Date: Thu, 24 Nov 2016 20:36:47 -0500 >> From: Kosta Psimoulis <kosta.psimou...@sourceknowledge.com> >> Reply-To: rsyslog-users <rsyslog@lists.adiscon.com> >> To: rsyslog@lists.adiscon.com >> Subject: [rsyslog] rsyslog fails to start due to high queue >> >> >> Hello, >> >> There was a misconfiguration for a remote forwarding server, the issue was >> fixed and the host was corrected by rsyslog is refusing to start with the >> following error: >> >> Rsyslog version: 8.4.2 >> Distribution: Latest Debian Jessie Linux 8.6 >> >> -- Unit rsyslog.service has begun starting up. >> Nov 24 20:18:32 ip-172-16-1-152 systemd[1]: rsyslog.service start request >> repeated too quickly, refusing to start. >> Nov 24 20:18:32 ip-172-16-1-152 systemd[1]: Failed to start System Logging >> Service. >> -- Subject: Unit rsyslog.service has failed >> -- Defined-By: systemd >> -- Support: http://lists.freedesktop.org/mailman/listinfo/systemd-devel >> >> Nov 24 20:18:26 ip-172-16-1-152 rsyslogd: [origin software="rsyslogd" >> swVersion="8.4.2" x-pid="16534" x-info="http://www.rsyslog.com;] start >> Nov 24 20:18:26 ip-172-16-1-152 rsyslogd-2307: warning: ~ action is >> deprecated, consider using the 'stop' statement instead [try >> http://www.rsyslog.com/e/2307 ] >> Nov 24 20:18:25 ip-172-16-1-152 systemd[1]: rsyslog.service: main process >> exited, code=killed, status=11/SEGV >> Nov 24 20:18:25 ip-172-16-1-152 systemd[1]: Unit rsyslog.service entered >> failed state. >> Nov 24 20:18:26 ip-172-16-1-152 systemd[1]: rsyslog.service holdoff time >> over, scheduling restart. >> Nov 24 20:18:26 ip-172-16-1-152 systemd[1]: Stopping System Logging >> Service... >> Nov 24 20:18:26 ip-172-16-1-152 systemd[1]: Starting System Logging >> Service... >> Nov 24 20:18:26 ip-172-16-1-152 systemd[1]: Started System Logging >> Service. >> Nov 24 20:18:26 ip-172-16-1-152 rsyslogd: [origin software="rsyslogd" >> swVersion="8.4.2" x-pid="16540" x-info="http://www.rsyslog.com;] start >> Nov 24 20:18:26 ip-172-16-1-152 rsyslogd-2307: warning: ~ action is >> deprecated, consider using the 'stop' statement instead [try >> http://www.rsyslog.com/e/2307 ] >> >> The warnings are fine, they are from aws cloudinit: >> >> My extra configuration file is: >> $EscapeControlCharactersOnReceive off >> $MaxMessageSize 64k >> local6.info /efs/backup/i-XX/access-backup.log >> local6.err /efs/backup/i-X/error-backup.log >> module(load="omrelp") >> local6.* action( >>type="omrelp" >>RebindInterval="200" >>queue.FileName="apacheRule1" >>queue.SaveOnShutdown="on" >>queue.Type="LinkedList" >>action.resumeRetryCount="-1" >>target="XX.us-east-1.elb.amazonaws.com" port="20514" >> ) >> & stop >> >> >> There are 13 files in /var/spool/rsyslog that won't process, here are some >> info: >> >> root@ip-172-16-1-152:/var/spool/rsyslog# cat apacheRule1.qi >> > +iQueueSize:2:4:9959: >> +tVars.disk.sizeOnDisk:2:8:13356962: >>> >>> End >> >> . >> > +iCurrFNum:2:2:13: >> +pszFName:1:11:apacheRule1: >> +iMaxFiles:2:8:1000: >> +bDeleteOnClose:2:1:0: >> +sType:2:1:1: >> +tOperationsMode:2:1:2: >> +tOpenMode:2:3:384: >> +iCurrOffs:2:6:762769: >> +inode:2:1:0: >>> >>> End >> >> . >> > +iCurrFNum:2:1:1: >> +pszFName:1:11:apacheRule1: >> +iMaxFiles:2:8:1000: >> +bDeleteOnClose:2:1:1: >> +sType:2:1:1: >> +tOperationsMode:2:1:1: >> +tOpenMode:2:3:384: >> +iCurrOffs:2:5:20591: >> +inode:2:6:414600: >>> >>> End >> >> . >> Any hints
Re: [rsyslog] rsyslog fails to start due to high queue
it's no clear to me that rsyslog actually failed. It looks like systemd saw the warning about ~ and decided to shut it down David Lang On Thu, 24 Nov 2016, Kosta Psimoulis wrote: Date: Thu, 24 Nov 2016 20:36:47 -0500 From: Kosta Psimoulis <kosta.psimou...@sourceknowledge.com> Reply-To: rsyslog-users <rsyslog@lists.adiscon.com> To: rsyslog@lists.adiscon.com Subject: [rsyslog] rsyslog fails to start due to high queue Hello, There was a misconfiguration for a remote forwarding server, the issue was fixed and the host was corrected by rsyslog is refusing to start with the following error: Rsyslog version: 8.4.2 Distribution: Latest Debian Jessie Linux 8.6 -- Unit rsyslog.service has begun starting up. Nov 24 20:18:32 ip-172-16-1-152 systemd[1]: rsyslog.service start request repeated too quickly, refusing to start. Nov 24 20:18:32 ip-172-16-1-152 systemd[1]: Failed to start System Logging Service. -- Subject: Unit rsyslog.service has failed -- Defined-By: systemd -- Support: http://lists.freedesktop.org/mailman/listinfo/systemd-devel Nov 24 20:18:26 ip-172-16-1-152 rsyslogd: [origin software="rsyslogd" swVersion="8.4.2" x-pid="16534" x-info="http://www.rsyslog.com;] start Nov 24 20:18:26 ip-172-16-1-152 rsyslogd-2307: warning: ~ action is deprecated, consider using the 'stop' statement instead [try http://www.rsyslog.com/e/2307 ] Nov 24 20:18:25 ip-172-16-1-152 systemd[1]: rsyslog.service: main process exited, code=killed, status=11/SEGV Nov 24 20:18:25 ip-172-16-1-152 systemd[1]: Unit rsyslog.service entered failed state. Nov 24 20:18:26 ip-172-16-1-152 systemd[1]: rsyslog.service holdoff time over, scheduling restart. Nov 24 20:18:26 ip-172-16-1-152 systemd[1]: Stopping System Logging Service... Nov 24 20:18:26 ip-172-16-1-152 systemd[1]: Starting System Logging Service... Nov 24 20:18:26 ip-172-16-1-152 systemd[1]: Started System Logging Service. Nov 24 20:18:26 ip-172-16-1-152 rsyslogd: [origin software="rsyslogd" swVersion="8.4.2" x-pid="16540" x-info="http://www.rsyslog.com;] start Nov 24 20:18:26 ip-172-16-1-152 rsyslogd-2307: warning: ~ action is deprecated, consider using the 'stop' statement instead [try http://www.rsyslog.com/e/2307 ] The warnings are fine, they are from aws cloudinit: My extra configuration file is: $EscapeControlCharactersOnReceive off $MaxMessageSize 64k local6.info /efs/backup/i-XX/access-backup.log local6.err /efs/backup/i-X/error-backup.log module(load="omrelp") local6.* action( type="omrelp" RebindInterval="200" queue.FileName="apacheRule1" queue.SaveOnShutdown="on" queue.Type="LinkedList" action.resumeRetryCount="-1" target="XX.us-east-1.elb.amazonaws.com" port="20514" ) & stop There are 13 files in /var/spool/rsyslog that won't process, here are some info: root@ip-172-16-1-152:/var/spool/rsyslog# cat apacheRule1.qi End . End . End . Any hints of how I can fix this issue and prevent it from happening in the future ? Kind Regards, Kosta ___ rsyslog mailing list http://lists.adiscon.net/mailman/listinfo/rsyslog http://www.rsyslog.com/professional-services/ What's up with rsyslog? Follow https://twitter.com/rgerhards NOTE WELL: This is a PUBLIC mailing list, posts are ARCHIVED by a myriad of sites beyond our control. PLEASE UNSUBSCRIBE and DO NOT POST if you DON'T LIKE THAT. ___ rsyslog mailing list http://lists.adiscon.net/mailman/listinfo/rsyslog http://www.rsyslog.com/professional-services/ What's up with rsyslog? Follow https://twitter.com/rgerhards NOTE WELL: This is a PUBLIC mailing list, posts are ARCHIVED by a myriad of sites beyond our control. PLEASE UNSUBSCRIBE and DO NOT POST if you DON'T LIKE THAT.
[rsyslog] rsyslog fails to start due to high queue
Hello, There was a misconfiguration for a remote forwarding server, the issue was fixed and the host was corrected by rsyslog is refusing to start with the following error: Rsyslog version: 8.4.2 Distribution: Latest Debian Jessie Linux 8.6 -- Unit rsyslog.service has begun starting up. Nov 24 20:18:32 ip-172-16-1-152 systemd[1]: rsyslog.service start request repeated too quickly, refusing to start. Nov 24 20:18:32 ip-172-16-1-152 systemd[1]: Failed to start System Logging Service. -- Subject: Unit rsyslog.service has failed -- Defined-By: systemd -- Support: http://lists.freedesktop.org/mailman/listinfo/systemd-devel Nov 24 20:18:26 ip-172-16-1-152 rsyslogd: [origin software="rsyslogd" swVersion="8.4.2" x-pid="16534" x-info="http://www.rsyslog.com;] start Nov 24 20:18:26 ip-172-16-1-152 rsyslogd-2307: warning: ~ action is deprecated, consider using the 'stop' statement instead [try http://www.rsyslog.com/e/2307 ] Nov 24 20:18:25 ip-172-16-1-152 systemd[1]: rsyslog.service: main process exited, code=killed, status=11/SEGV Nov 24 20:18:25 ip-172-16-1-152 systemd[1]: Unit rsyslog.service entered failed state. Nov 24 20:18:26 ip-172-16-1-152 systemd[1]: rsyslog.service holdoff time over, scheduling restart. Nov 24 20:18:26 ip-172-16-1-152 systemd[1]: Stopping System Logging Service... Nov 24 20:18:26 ip-172-16-1-152 systemd[1]: Starting System Logging Service... Nov 24 20:18:26 ip-172-16-1-152 systemd[1]: Started System Logging Service. Nov 24 20:18:26 ip-172-16-1-152 rsyslogd: [origin software="rsyslogd" swVersion="8.4.2" x-pid="16540" x-info="http://www.rsyslog.com;] start Nov 24 20:18:26 ip-172-16-1-152 rsyslogd-2307: warning: ~ action is deprecated, consider using the 'stop' statement instead [try http://www.rsyslog.com/e/2307 ] The warnings are fine, they are from aws cloudinit: My extra configuration file is: $EscapeControlCharactersOnReceive off $MaxMessageSize 64k local6.info /efs/backup/i-XX/access-backup.log local6.err /efs/backup/i-X/error-backup.log module(load="omrelp") local6.* action( type="omrelp" RebindInterval="200" queue.FileName="apacheRule1" queue.SaveOnShutdown="on" queue.Type="LinkedList" action.resumeRetryCount="-1" target="XX.us-east-1.elb.amazonaws.com" port="20514" ) & stop There are 13 files in /var/spool/rsyslog that won't process, here are some info: root@ip-172-16-1-152:/var/spool/rsyslog# cat apacheRule1.qi End . End . End . Any hints of how I can fix this issue and prevent it from happening in the future ? Kind Regards, Kosta ___ rsyslog mailing list http://lists.adiscon.net/mailman/listinfo/rsyslog http://www.rsyslog.com/professional-services/ What's up with rsyslog? Follow https://twitter.com/rgerhards NOTE WELL: This is a PUBLIC mailing list, posts are ARCHIVED by a myriad of sites beyond our control. PLEASE UNSUBSCRIBE and DO NOT POST if you DON'T LIKE THAT.