Re: Sendmail eats CPU on r317039 [after -r316874 it may be -r316951 and -r316973 are not enough to fix everything]

2017-04-20 Thread Mark Millard
On 2017-Apr-20, at 2:31 AM, Tom Vijlbrief  wrote:

>> Op wo 19 apr. 2017 09:11 schreef Tom Vijlbrief :
>> I'm currently rebuilding world and kernel on a just completed SVN checkout.
>> 
>> Note that the normal sendmail daemon which listens for incoming traffic does 
>> NOT loop.
>> 
>> The sendmail instance which tries local delivery (echo Hi | mail root) or 
>> the msp_queue instance is looping.
>> 
>> It might be an arm64 specific issue, but a few weeks ago this was not an 
>> issue.
>> 
> I just completed a full rebuild on the Pine64 and I cannot reproduce the 
> problem, so there is probably no issue anymore...
> 
> (Except the spurious interrupts issue)

FYI:

I've not seen a spurious interrupts notification on
arm64/aarch64.

I have seen such on cortex-a7 (the armv6/7 examples
that I have access to are this: so just armv7).


For cortex-a7 I build with the following local experiment
for information gathering:

# svnlite diff /usr/src/sys/arm/arm/gic.c
Index: /usr/src/sys/arm/arm/gic.c
===
--- /usr/src/sys/arm/arm/gic.c  (revision 317015)
+++ /usr/src/sys/arm/arm/gic.c  (working copy)
@@ -672,9 +672,13 @@
 
if (irq >= sc->nirqs) {
 #ifdef GIC_DEBUG_SPURIOUS
+#define EXPECTED_SPURIOUS_IRQ 1023
+   if (irq != EXPECTED_SPURIOUS_IRQ) {
device_printf(sc->gic_dev,
-   "Spurious interrupt detected: last irq: %d on CPU%d\n",
+   "Spurious interrupt %d detected of %d: last irq: %d on 
CPU%d\n",
+   irq, sc->nirqs,
sc->last_irq[PCPU_GET(cpuid)], PCPU_GET(cpuid));
+}
 #endif
return (FILTER_HANDLED);
}
@@ -720,6 +724,16 @@
if (irq < sc->nirqs)
goto dispatch_irq;
 
+   if (irq != EXPECTED_SPURIOUS_IRQ) {
+#undef EXPECTED_SPURIOUS_IRQ
+#ifdef GIC_DEBUG_SPURIOUS
+   device_printf(sc->gic_dev,
+   "Spurious end interrupt %d detected of %d: last irq: %d on 
CPU%d\n",
+   irq, sc->nirqs,
+   sc->last_irq[PCPU_GET(cpuid)], PCPU_GET(cpuid));
+#endif
+   }
+
return (FILTER_HANDLED);
 }

It has never reported a non-1023 IRQ.

Quoting arm_gic_architecture_specification.pdf (various places about the 1023 
IRQ figure):

• A processor reads the GICC_IAR and obtains the interrupt ID 1023, 
indicating a spurious interrupt. The processor can return from its interrupt 
service routine without writing to its GICC_EOIR.

The spurious interrupt ID indicates that the original interrupt is no longer 
pending, typically because another target processor is handling it.

and . . .

• 1023  This value is returned to a processor, in response to an 
interrupt acknowledge, if there is no pending interrupt with sufficient 
priority for it to be signaled to the processor.

and . . .

GICC_IAR read  GICC_CTLR.AckCtl 
 Returned interrupt ID
Highest priority pending interrupt^a is Group 0 Non-secure x
 Interrupt ID 1023

No pending interrupts^a x  x
 Interrupt ID 1023

Interrupt signaling of the required interrupt group by CPU interface disabled
x  x
 Interrupt ID 1023

^a. Of sufficient priority to be signaled to the processor if signaling by the 
CPU interface is enabled.

===
Mark Millard
markmi at dsl-only.net

___
freebsd-current@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to "freebsd-current-unsubscr...@freebsd.org"

Re: Sendmail eats CPU on r317039 [after -r316874 it may be -r316951 and -r316973 are not enough to fix everything]

2017-04-20 Thread Tom Vijlbrief
Op wo 19 apr. 2017 09:11 schreef Tom Vijlbrief <tvijlbr...@gmail.com>:

> I'm currently rebuilding world and kernel on a just completed SVN checkout.
>
> Note that the normal sendmail daemon which listens for incoming traffic
> does NOT loop.
>
> The sendmail instance which tries local delivery (echo Hi | mail root) or
> the msp_queue instance is looping.
>
> It might be an arm64 specific issue, but a few weeks ago this was not an
> issue.
>

I just completed a full rebuild on the Pine64 and I cannot reproduce the
problem, so there is probably no issue anymore...

(Except the spurious interrupts issue)


> Op di 18 apr. 2017 om 21:15 schreef Mark Millard <mar...@dsl-only.net>:
>
>> Ronald Klop ronald-lists at klop.ws wrote on Tue Apr 18 09:59:50 UTC
>> 2017:
>>
>> > there is a thread ono this list about a problem in syslogd which made
>> > syslog-clients (like sendmail busy-looping on logging.
>> > That might be related to this. (it is fixed in the source already, so
>> > upgrading again might help)
>> > See the thread with subject like 'Re: r316958: booting a server takes
>> >10
>> > minutes!'
>> >
>> > Regards,
>> >
>> > Ronald.
>>
>>
>> Yes. But Tom V.'s report is for -r317039, which is after the reported
>> fixes as far as I can tell. Something besides syslogd might also cause
>> problems?
>>
>> In my nearly-default -r317015 ardm64 context [as a VirtualBox guest]
>> I've not seen the problem, where I did before. (The only reason sendmail
>> runs in my context is for the messages FreeBSD sends to it own local
>> accounts. I do not otherwise use mail in this context.)
>>
>> Tom V.'s report vs. others finding lack of a problem suggests that the
>> coverage of the fixes is incomplete somehow but useful. I happen to not
>> be doing whatever causes the problem to appear. I've no clue what might
>> be different or unusual in Tom V.'s context.
>>
>> There is also the possibility that Tom V.'s report is a fully independent
>> issue. But such does not seem all that likely on the initial information.
>>
>>
>> > On 2017-Apr-17, at 7:57 AM, Mark Millard 
>> wrote:
>> >
>> >> Just an FYI of a more recent report of runaway sendmail on a
>> >> more recent system version ( -r317039 ):
>> >>
>> >> Begin forwarded message:
>> >>
>> >>> From: Tom Vijlbrief 
>> >>> Subject: Sendmail eats CPU on r317039
>> >>> Date: April 17, 2017 at 3:39:37 AM PDT
>> >>> To: "freebsd-current at freebsd.org" ,
>> freebsd-arm 
>> >>>
>> >>> On a recent kernel sendmail is constantly consuming CPU.
>> >>>
>> >>> truss -p PID
>> >>>
>> >>> shows:
>> >>>
>> >>> sendto(3,"<22>Apr 17 10:30:33 sendmail[362"...,163,0x0,NULL,0) ERR#55
>> 'No
>> >>> buffer space available'
>> >>> nanosleep({ 0.01000 }) = 0 (0x0)
>> >>> sendto(3,"<22>Apr 17 10:30:33 sendmail[362"...,163,0x0,NULL,0) ERR#55
>> 'No
>> >>> buffer space available'
>> >>> nanosleep({ 0.01000 })
>> >>> ...
>> >>>
>> >>> This is on an arm64 system
>> >>
>> >> Analysis of Tom V.'s context for this may be required.
>>
>> ===
>> Mark Millard
>> markmi at dsl-only.net
>>
>> ___
>> freebsd-...@freebsd.org mailing list
>> https://lists.freebsd.org/mailman/listinfo/freebsd-arm
>> To unsubscribe, send any mail to "freebsd-arm-unsubscr...@freebsd.org"
>>
>
___
freebsd-current@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to "freebsd-current-unsubscr...@freebsd.org"


Re: Sendmail eats CPU on r317039 [after -r316874 it may be -r316951 and -r316973 are not enough to fix everything]

2017-04-19 Thread Tom Vijlbrief
I'm currently rebuilding world and kernel on a just completed SVN checkout.

Note that the normal sendmail daemon which listens for incoming traffic
does NOT loop.

The sendmail instance which tries local delivery (echo Hi | mail root) or
the msp_queue instance is looping.

It might be an arm64 specific issue, but a few weeks ago this was not an
issue.

Op di 18 apr. 2017 om 21:15 schreef Mark Millard <mar...@dsl-only.net>:

> Ronald Klop ronald-lists at klop.ws wrote on Tue Apr 18 09:59:50 UTC 2017:
>
> > there is a thread ono this list about a problem in syslogd which made
> > syslog-clients (like sendmail busy-looping on logging.
> > That might be related to this. (it is fixed in the source already, so
> > upgrading again might help)
> > See the thread with subject like 'Re: r316958: booting a server takes >10
> > minutes!'
> >
> > Regards,
> >
> > Ronald.
>
>
> Yes. But Tom V.'s report is for -r317039, which is after the reported
> fixes as far as I can tell. Something besides syslogd might also cause
> problems?
>
> In my nearly-default -r317015 ardm64 context [as a VirtualBox guest]
> I've not seen the problem, where I did before. (The only reason sendmail
> runs in my context is for the messages FreeBSD sends to it own local
> accounts. I do not otherwise use mail in this context.)
>
> Tom V.'s report vs. others finding lack of a problem suggests that the
> coverage of the fixes is incomplete somehow but useful. I happen to not
> be doing whatever causes the problem to appear. I've no clue what might
> be different or unusual in Tom V.'s context.
>
> There is also the possibility that Tom V.'s report is a fully independent
> issue. But such does not seem all that likely on the initial information.
>
>
> > On 2017-Apr-17, at 7:57 AM, Mark Millard  wrote:
> >
> >> Just an FYI of a more recent report of runaway sendmail on a
> >> more recent system version ( -r317039 ):
> >>
> >> Begin forwarded message:
> >>
> >>> From: Tom Vijlbrief 
> >>> Subject: Sendmail eats CPU on r317039
> >>> Date: April 17, 2017 at 3:39:37 AM PDT
> >>> To: "freebsd-current at freebsd.org" ,
> freebsd-arm 
> >>>
> >>> On a recent kernel sendmail is constantly consuming CPU.
> >>>
> >>> truss -p PID
> >>>
> >>> shows:
> >>>
> >>> sendto(3,"<22>Apr 17 10:30:33 sendmail[362"...,163,0x0,NULL,0) ERR#55
> 'No
> >>> buffer space available'
> >>> nanosleep({ 0.01000 }) = 0 (0x0)
> >>> sendto(3,"<22>Apr 17 10:30:33 sendmail[362"...,163,0x0,NULL,0) ERR#55
> 'No
> >>> buffer space available'
> >>> nanosleep({ 0.01000 })
> >>> ...
> >>>
> >>> This is on an arm64 system
> >>
> >> Analysis of Tom V.'s context for this may be required.
>
> ===
> Mark Millard
> markmi at dsl-only.net
>
> ___
> freebsd-...@freebsd.org mailing list
> https://lists.freebsd.org/mailman/listinfo/freebsd-arm
> To unsubscribe, send any mail to "freebsd-arm-unsubscr...@freebsd.org"
>
___
freebsd-current@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to "freebsd-current-unsubscr...@freebsd.org"


Re: Sendmail eats CPU on r317039 [after -r316874 it may be -r316951 and -r316973 are not enough to fix everything]

2017-04-18 Thread Mark Millard
Ronald Klop ronald-lists at klop.ws wrote on Tue Apr 18 09:59:50 UTC 2017:

> there is a thread ono this list about a problem in syslogd which made  
> syslog-clients (like sendmail busy-looping on logging.
> That might be related to this. (it is fixed in the source already, so  
> upgrading again might help)
> See the thread with subject like 'Re: r316958: booting a server takes >10  
> minutes!'
> 
> Regards,
> 
> Ronald.


Yes. But Tom V.'s report is for -r317039, which is after the reported
fixes as far as I can tell. Something besides syslogd might also cause
problems?

In my nearly-default -r317015 ardm64 context [as a VirtualBox guest]
I've not seen the problem, where I did before. (The only reason sendmail
runs in my context is for the messages FreeBSD sends to it own local
accounts. I do not otherwise use mail in this context.)

Tom V.'s report vs. others finding lack of a problem suggests that the
coverage of the fixes is incomplete somehow but useful. I happen to not
be doing whatever causes the problem to appear. I've no clue what might
be different or unusual in Tom V.'s context.

There is also the possibility that Tom V.'s report is a fully independent
issue. But such does not seem all that likely on the initial information.


> On 2017-Apr-17, at 7:57 AM, Mark Millard  wrote:
> 
>> Just an FYI of a more recent report of runaway sendmail on a
>> more recent system version ( -r317039 ):
>> 
>> Begin forwarded message:
>> 
>>> From: Tom Vijlbrief 
>>> Subject: Sendmail eats CPU on r317039
>>> Date: April 17, 2017 at 3:39:37 AM PDT
>>> To: "freebsd-current at freebsd.org" , 
>>> freebsd-arm 
>>> 
>>> On a recent kernel sendmail is constantly consuming CPU.
>>> 
>>> truss -p PID
>>> 
>>> shows:
>>> 
>>> sendto(3,"<22>Apr 17 10:30:33 sendmail[362"...,163,0x0,NULL,0) ERR#55 'No
>>> buffer space available'
>>> nanosleep({ 0.01000 }) = 0 (0x0)
>>> sendto(3,"<22>Apr 17 10:30:33 sendmail[362"...,163,0x0,NULL,0) ERR#55 'No
>>> buffer space available'
>>> nanosleep({ 0.01000 })
>>> ...
>>> 
>>> This is on an arm64 system
>> 
>> Analysis of Tom V.'s context for this may be required.

===
Mark Millard
markmi at dsl-only.net

___
freebsd-current@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to "freebsd-current-unsubscr...@freebsd.org"


Re: Fwd: Sendmail eats CPU on r317039 [after -r316874 it may be -r316951 and -r316973 are not enough to fix everything]

2017-04-18 Thread Ronald Klop
there is a thread ono this list about a problem in syslogd which made  
syslog-clients (like sendmail busy-looping on logging.
That might be related to this. (it is fixed in the source already, so  
upgrading again might help)
See the thread with subject like 'Re: r316958: booting a server takes >10  
minutes!'


Regards,

Ronald.



On Mon, 17 Apr 2017 16:57:39 +0200, Mark Millard <mar...@dsl-only.net>  
wrote:



Just an FYI of a more recent report of runaway sendmail on a
more recent system version ( -r317039 ):

Begin forwarded message:


From: Tom Vijlbrief 
Subject: Sendmail eats CPU on r317039
Date: April 17, 2017 at 3:39:37 AM PDT
To: "freebsd-current at freebsd.org" ,  
freebsd-arm 


On a recent kernel sendmail is constantly consuming CPU.

truss -p PID

shows:

sendto(3,"<22>Apr 17 10:30:33 sendmail[362"...,163,0x0,NULL,0) ERR#55  
'No

buffer space available'
nanosleep({ 0.01000 }) = 0 (0x0)
sendto(3,"<22>Apr 17 10:30:33 sendmail[362"...,163,0x0,NULL,0) ERR#55  
'No

buffer space available'
nanosleep({ 0.01000 })
...

This is on an arm64 system


Analysis of Tom V.'s context for this may be required.

===
Mark Millard
markmi at dsl-only.net

___
freebsd-current@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to  
"freebsd-current-unsubscr...@freebsd.org"

___
freebsd-current@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to "freebsd-current-unsubscr...@freebsd.org"


Fwd: Sendmail eats CPU on r317039 [after -r316874 it may be -r316951 and -r316973 are not enough to fix everything]

2017-04-17 Thread Mark Millard
Just an FYI of a more recent report of runaway sendmail on a
more recent system version ( -r317039 ):

Begin forwarded message:

> From: Tom Vijlbrief 
> Subject: Sendmail eats CPU on r317039
> Date: April 17, 2017 at 3:39:37 AM PDT
> To: "freebsd-current at freebsd.org" , 
> freebsd-arm 
> 
> On a recent kernel sendmail is constantly consuming CPU.
> 
> truss -p PID
> 
> shows:
> 
> sendto(3,"<22>Apr 17 10:30:33 sendmail[362"...,163,0x0,NULL,0) ERR#55 'No
> buffer space available'
> nanosleep({ 0.01000 }) = 0 (0x0)
> sendto(3,"<22>Apr 17 10:30:33 sendmail[362"...,163,0x0,NULL,0) ERR#55 'No
> buffer space available'
> nanosleep({ 0.01000 })
> ...
> 
> This is on an arm64 system

Analysis of Tom V.'s context for this may be required.

===
Mark Millard
markmi at dsl-only.net

___
freebsd-current@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to "freebsd-current-unsubscr...@freebsd.org"


Sendmail eats CPU on r317039

2017-04-17 Thread Tom Vijlbrief
On a recent kernel sendmail is constantly consuming CPU.

truss -p PID

shows:

sendto(3,"<22>Apr 17 10:30:33 sendmail[362"...,163,0x0,NULL,0) ERR#55 'No
buffer space available'
nanosleep({ 0.01000 }) = 0 (0x0)
sendto(3,"<22>Apr 17 10:30:33 sendmail[362"...,163,0x0,NULL,0) ERR#55 'No
buffer space available'
nanosleep({ 0.01000 })
...

This is on an arm64 system
___
freebsd-current@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to "freebsd-current-unsubscr...@freebsd.org"