from:"Martin Blapp"

Re: Open-vm-tools port available for testing

2008-03-28 Thread Martin Blapp



Hi,

Thanks for doing this, seems to be some problems with the downloading of the 
tarball though. See below:


Just fixed that. The problem was the subdir name.

--
Martin

Martin Blapp, [EMAIL PROTECTED] [EMAIL PROTECTED]
--
ImproWare AG, UNIXSP  ISP, Zurlindenstrasse 29, 4133 Pratteln, CH
Phone: +41 61 826 93 00 Fax: +41 61 826 93 01
PGP: finger -l [EMAIL PROTECTED]
PGP Fingerprint: B434 53FC C87C FE7B 0A18 B84C 8686 EF22 D300 551E
--

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Open-vm-tools port available for testing

2008-03-27 Thread Martin Blapp



Hi,

I've just made a port for FreeBSD 7 and FreeBSD 6 for the
Open-vmware-tools. Any Feedback is welcome.

http://antispam.imp.ch/patches/open-vmware-tools-freebsd-port.tgz

--
Martin

Martin Blapp, [EMAIL PROTECTED] [EMAIL PROTECTED]
--
ImproWare AG, UNIXSP  ISP, Zurlindenstrasse 29, 4133 Pratteln, CH
Phone: +41 61 826 93 00 Fax: +41 61 826 93 01
PGP: finger -l [EMAIL PROTECTED]
PGP Fingerprint: B434 53FC C87C FE7B 0A18 B84C 8686 EF22 D300 551E
--
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Mimedefang crashes on FreeBSD 6 STABLE, amd64

2007-10-19 Thread Martin Blapp



Hi everybody,

I'm trying to get mimedefang running on amd64. But unfortunatly the threaded
milter part ('mimedefang') does segfault after some time, normally 1-2 minutes.

pid 2331 (mimedefang), uid 1001: exited on signal 11 (core dumped)

gdb /idms/bin/mimedefang mimedefang-2331.core
#0  0x00080066389c in pthread_testcancel () from /lib/libpthread.so.2
[New Thread 0x56 (runnable)]
[New Thread 0x599800 (runnable)]
[New Thread 0x546c00 (runnable)]
[New Thread 0x5ad400 (runnable)]
[New Thread 0x5ad000 (runnable)]
[New Thread 0x599c00 (runnable)]
[New Thread 0x560800 (runnable)]
[New Thread 0x55e800 (runnable)]
[New Thread 0x55ec00 (runnable)]
[New Thread 0x55e400 (runnable)]
[New Thread 0x560c00 (runnable)]
[New Thread 0x58fc00 (runnable)]
[New Thread 0x57fc00 (runnable)]
[New Thread 0x599000 (runnable)]
[New Thread 0x52ac00 (runnable)]
[New Thread 0x57f800 (runnable)]
[New Thread 0x58f000 (runnable)]
[New Thread 0x57f400 (runnable)]
[New Thread 0x58f800 (runnable)]
[New Thread 0x52a800 (runnable)]
[New Thread 0x57f000 (runnable)]
[New Thread 0x546800 (runnable)]
[New Thread 0x52a400 (sleeping)]
[New Thread 0x52a000 (LWP 100058)]
[New Thread 0x524000 (runnable)]
[New LWP 100266]

Unfortunaltly the stack trace doesn't seem to be very usable:

(gdb) where
#0  0x00080066c3fc in kse_thr_interrupt () at kse_thr_interrupt.S:2
#1  0x00080065390a in sig_daemon (arg=0x0) at 
/usr/src/lib/libpthread/thread/thr_sig.c:214
#2  0x000800661e2e in kse_sched_single (kmbx=0x521318) at 
/usr/src/lib/libpthread/thread/thr_kern.c:886
#3  0x in ?? ()
Cannot access memory at address 0x7fbff000

(gdb) frame 2
#2  0x000800661e2e in kse_sched_single (kmbx=0x521318) at 
/usr/src/lib/libpthread/thread/thr_kern.c:886

886 pthread_exit(curthread-start_routine(curthread-arg));
(gdb) p kmbx

$1 = (struct kse_mailbox *) 0x521318

(gdb) p *kmbx
$2 = {km_version = 0, km_curthread = 0x0, km_completed = 0x0, km_sigscaught = 
{__bits = {0, 0, 0, 0}}, km_flags = 19,
  km_func = 0x800661560 kse_sched_single, km_stack = {ss_sp = 0x7f9ff000 
Address 0x7f9ff000 out of bounds,
ss_size = 2097152, ss_flags = 0}, km_udata = 0x51c600, km_timeofday = 
{tv_sec = 0, tv_nsec = 0}, km_quantum = 0, km_lwp = 100058,

  __spare2__ = {0, 0, 0, 0, 0, 0, 0}}

I've tried to replace libpthread.so.2 with libc_r.6 or libthr.2, but
this doesn't help at all, I get smiliar segfaults. libc_r.6 is a userland
threading library, so it's definitly not the kernel which has a problem.

Are there known bugs with mimedefang on 64bit architectures ?

--
Martin

Martin Blapp, [EMAIL PROTECTED] [EMAIL PROTECTED]
--
ImproWare AG, UNIXSP  ISP, Zurlindenstrasse 29, 4133 Pratteln, CH
Phone: +41 61 826 93 00 Fax: +41 61 826 93 01
PGP: finger -l [EMAIL PROTECTED]
PGP Fingerprint: B434 53FC C87C FE7B 0A18 B84C 8686 EF22 D300 551E
--

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: Mimedefang crashes on FreeBSD 6 STABLE, amd64

2007-10-19 Thread Martin Blapp



A kdump output shows always the same output. The file descriptor '108/0x6c'
doesn't look very valid 

--

 36626 mimedefang RET   kse_release 0
 36626 mimedefang RET   kse_release 0
 36626 mimedefang RET   fork 0
 36626 mimedefang CALL  kse_release(0x537f70)
 36626 mimedefang RET   kse_release 0
 36626 mimedefang RET   kse_release 0
 36626 mimedefang CALL  kse_release(0x521f70)
 36626 mimedefang CALL  gettimeofday(0x7f5f8eb0,0)
 36626 mimedefang CALL  kse_release(0x53ff70)
 36626 mimedefang CALL  kse_release(0x53bf70)
 36626 mimedefang RET   kse_release 0
 36626 mimedefang RET   kse_release 0
 36626 mimedefang CALL  kse_release(0x521f70)
 36626 mimedefang CALL  getpid
 36626 mimedefang RET   getpid 36626/0x8f12
 36626 mimedefang CALL  sendto(0x3,0x7f5f93b0,0x6c,0,0,0)
 36626 mimedefang GIO   fd 3 wrote 108 bytes
   20Oct 19 11:55:57 mimedefang[36626]: 5422080 - 0x540c00: 
mimedefang.c(1924): EXIT cleanup: SMFIS_CONTINUE

 36626 mimedefang RET   sendto 108/0x6c
 36626 mimedefang CALL  gettimeofday(0x7f5f8f10,0)
 36626 mimedefang RET   gettimeofday 0
 36626 mimedefang CALL  getpid
 36626 mimedefang RET   getpid 36626/0x8f12
 36626 mimedefang CALL  sendto(0x3,0x7f5f9410,0x6c,0,0,0)
 36626 mimedefang GIO   fd 3 wrote 108 bytes
   20Oct 19 11:55:57 mimedefang[36626]: 5422080 - 0x540c00: 
mimedefang.c(1888): EXIT mfclose: SMFIS_CONTINUE

 36626 mimedefang RET   sendto 108/0x6c
 36626 mimedefang PSIG  SIGSEGV SIG_DFL
 36626 mimedefang CALL  kse_thr_interrupt(0,0x4,0xb)
 36626 mimedefang NAMI  /var/core/mimedefang-36626.core


--

   20Oct 19 11:53:03 mimedefang[33960]: 5422080 - 0x51de00: 
mimedefang.c(1922): ENTER cleanup
 33960 mimedefang RET   sendto 93/0x5d
 33960 mimedefang CALL  gettimeofday(0x7f5f8eb0,0)
 33960 mimedefang RET   gettimeofday 0
 33960 mimedefang CALL  getpid
 33960 mimedefang RET   getpid 33960/0x84a8
 33960 mimedefang CALL  sendto(0x3,0x7f5f93b0,0x6c,0,0,0)
 33960 mimedefang GIO   fd 3 wrote 108 bytes
   20Oct 19 11:53:03 mimedefang[33960]: 5422080 - 0x51de00: 
mimedefang.c(1924): EXIT cleanup: SMFIS_CONTINUE
 33960 mimedefang RET   sendto 108/0x6c
 33960 mimedefang CALL  gettimeofday(0x7f5f8f10,0)
 33960 mimedefang RET   gettimeofday 0
 33960 mimedefang CALL  getpid
 33960 mimedefang RET   getpid 33960/0x84a8
 33960 mimedefang CALL  sendto(0x3,0x7f5f9410,0x6c,0,0,0)
 33960 mimedefang GIO   fd 3 wrote 108 bytes
   20Oct 19 11:53:03 mimedefang[33960]: 5422080 - 0x51de00: 
mimedefang.c(1888): EXIT mfclose: SMFIS_CONTINUE
 33960 mimedefang RET   sendto 108/0x6c
 33960 mimedefang PSIG  SIGSEGV SIG_DFL
 33960 mimedefang CALL  kse_thr_interrupt(0,0x4,0xb)
 33960 mimedefang NAMI  /var/core/mimedefang-33960.core

--

Martin Blapp, [EMAIL PROTECTED] [EMAIL PROTECTED]
--
ImproWare AG, UNIXSP  ISP, Zurlindenstrasse 29, 4133 Pratteln, CH
Phone: +41 61 826 93 00 Fax: +41 61 826 93 01
PGP: finger -l [EMAIL PROTECTED]
PGP Fingerprint: B434 53FC C87C FE7B 0A18 B84C 8686 EF22 D300 551E
--
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: [Mimedefang] Re: Mimedefang crashes on FreeBSD 6 STABLE, amd64

2007-10-19 Thread Martin Blapp



Hi,

Looks like I found the problem and it was a local patch - ouch. Some
casts that worked in i386 didn't work on amd64 ... sigh.

Sorry for the noise.

--
Martin
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Panic: vm_page_free: freeing wired page with 6.2 RELEASE

2007-04-22 Thread Martin Blapp



Hi,

Just got this panic on a loaded mailserver ... The server was rocking stable
up to this panic, and after we loaded it a bit more recently, it paniced.

Any ideas ?

--
Martin

#0  doadump () at pcpu.h:165
#1  0xc067550a in boot (howto=260) at /usr/src/sys/kern/kern_shutdown.c:409
#2  0xc0675831 in panic (fmt=0xc0915bd0 vm_page_free: freeing wired page) at 
/usr/src/sys/kern/kern_shutdown.c:565
#3  0xc07f15a3 in vm_page_free_toq (m=0xc25380b0) at 
/usr/src/sys/vm/vm_page.c:1131
#4  0xc07f0ad1 in vm_page_free (m=0xc25380b0) at /usr/src/sys/vm/vm_page.c:471
#5  0xc07e4be0 in vm_fault (map=0xcadcf378, vaddr=678141952, fault_type=1 
'\001', fault_flags=0)
at /usr/src/sys/vm/vm_fault.c:632
#6  0xc088df07 in trap_pfault (frame=0xeb31f820, usermode=0, eva=678141952) at 
/usr/src/sys/i386/i386/trap.c:722
#7  0xc088dc15 in trap (frame=  {tf_fs = -1065549816, tf_es = -65496, tf_ds = -65496, tf_edi = -548069376, 
tf_esi = 678141952, tf_ebp = -349046640, tf_isp = -349046708, tf_ebx = 16384, 
tf_edx = 678158336, tf_ecx = 4096, tf_eax = -349045504, tf_trapno = 12, tf_err = 
0, tf_eip = -1064779658, tf_cs = 32, tf_eflags = 66054, tf_esp = -349046248, 
tf_ss = -349046324})at /usr/src/sys/i386/i386/trap.c:435

#8  0xc0879d4a in calltrap () at /usr/src/sys/i386/i386/exception.s:139
#9  0xc088c076 in slow_copyin () at /usr/src/sys/i386/i386/support.s:872

Martin Blapp, [EMAIL PROTECTED] [EMAIL PROTECTED]
--
ImproWare AG, UNIXSP  ISP, Zurlindenstrasse 29, 4133 Pratteln, CH
Phone: +41 61 826 93 00 Fax: +41 61 826 93 01
PGP: finger -l [EMAIL PROTECTED]
PGP Fingerprint: B434 53FC C87C FE7B 0A18 B84C 8686 EF22 D300 551E
--

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: Panic: vm_page_free: freeing wired page with 6.2 RELEASE

2007-04-22 Thread Martin Blapp



Hi,


Can you post your kernel configuration file?


It's the generic SMP kernel file (which includes GENERIC), with
nothing changed.

sysctl.conf is:

kern.sugid_coredump=1
kern.timecounter.hardware=TSC
kern.corefile=/var/core/%N-%P.core
kern.maxfiles=64000
kern.maxfilesperproc=32000

The server has two CPUs. The only special thing
is that with the current BIOS ACPI is broken.

--
Martin

FreeBSD 6.2-RELEASE #0: Tue Jan 23 11:13:16 CET 2007
[EMAIL PROTECTED]:/usr/obj/usr/src/sys/SMP
Timecounter i8254 frequency 1193182 Hz quality 0
CPU: Intel(R) Xeon(TM) CPU 3.00GHz (3000.13-MHz 686-class CPU)
  Origin = GenuineIntel  Id = 0xf43  Stepping = 3

Features=0xbfebfbffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,
SSE2,SS,HTT,TM,PBE
  Features2=0x641dSSE3,RSVD2,MON,DS_CPL,CNTX-ID,CX16,b14
  AMD Features=0x2000LM
  Logical CPUs per core: 2
real memory  = 3623665664 (3455 MB)
avail memory = 3546501120 (3382 MB)
MPTable: IBM ENSW x346 SMP
FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs
 cpu0 (BSP): APIC ID:  0
 cpu1 (AP): APIC ID:  6
ioapic0: Assuming intbase of 0
ioapic1: Assuming intbase of 24
ioapic2: Assuming intbase of 48
ioapic3: Assuming intbase of 72
ioapic4: Assuming intbase of 96
ioapic4 Version 2.0 irqs 96-119 on motherboard
ioapic3 Version 2.0 irqs 72-95 on motherboard
ioapic2 Version 2.0 irqs 48-71 on motherboard
ioapic1 Version 2.0 irqs 24-47 on motherboard
ioapic0 Version 2.0 irqs 0-23 on motherboard
kbd1 at kbdmux0
ath_hal: 0.9.17.2 (AR5210, AR5211, AR5212, RF5111, RF5112, RF2413, RF5413)
ACPI-0159: *** Error: AcpiLoadTables: Could not get RSDP, AE_NO_ACPI_TABLES
ACPI-0213: *** Error: AcpiLoadTables: Could not load tables: 
AE_NO_ACPI_TABLES

ACPI: table load failed: AE_NO_ACPI_TABLES
cpu0 on motherboard
cpu1 on motherboard
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

RE: Clamav-90_2 Lockup with freebsd 6.2

2007-03-18 Thread Martin Blapp



Hi,


I am not sure why. But my dual xeon with libthr on clamav-90.1 still gives
very high cpu usage.


It's the same case here. What happens if you limit kern.smp.maxcpus
to 1 ? Does it still use the same amount cpu time ? What happens if
you link clamd against libc_r ?

--
Martin
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: Clamav-90_2 Lockup with freebsd 6.2

2007-03-16 Thread Martin Blapp



Hi,

I just fixed those issues with the port.

Thanks for reporting !

Martin Blapp, [EMAIL PROTECTED] [EMAIL PROTECTED]
--
ImproWare AG, UNIXSP  ISP, Zurlindenstrasse 29, 4133 Pratteln, CH
Phone: +41 61 826 93 00 Fax: +41 61 826 93 01
PGP: finger -l [EMAIL PROTECTED]
PGP Fingerprint: B434 53FC C87C FE7B 0A18 B84C 8686 EF22 D300 551E
--

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: Clamav-90_2 Lockup with freebsd 6.2

2007-03-12 Thread Martin Blapp



Hi,


[snip patch]

Does this patch make the libmap.conf hack unneeded?



I frist thought so, but no, the more threads are running concurrently
the slower libpthreads behaves, as it gets more lock contention. Until
this is addressed somehow, use libthr.so

--
Martin
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: Clamav-90_2 Lockup with freebsd 6.2

2007-03-10 Thread Martin Blapp



Hi,


Don't use it, it's broken.

-trog


Nope, it looks like a race in cli_scanmail() which deadlocks somewhere
with libpthread.so. This workaround fixes the problem for me. I'm still
investigating where the real cause for this problem is.

--- libclamav/scanners.cTue Feb 13 02:06:28 2007
+++ libclamav/scanners.cSat Mar 10 12:00:16 2007
@@ -38,6 +38,9 @@
 #include netinet/in.h
 #endif

+#  include pthread.h
+static pthread_mutex_t extractmail_mutex = PTHREAD_MUTEX_INITIALIZER;
+
 #if HAVE_MMAP
 #if HAVE_SYS_MMAN_H
 #include sys/mman.h
@@ -1652,12 +1655,16 @@
 /*
  * Extract the attachments into the temporary directory
  */
+pthread_mutex_lock(extractmail_mutex);
 if((ret = cli_mbox(dir, desc, ctx))) {
-   if(!cli_leavetemps_flag)
+   if(!cli_leavetemps_flag) {
+   pthread_mutex_unlock(extractmail_mutex);
cli_rmdirs(dir);
+   }
free(dir);
return ret;
 }
+pthread_mutex_unlock(extractmail_mutex);

 ret = cli_scandir(dir, ctx);

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: Clamav-90_2 Lockup with freebsd 6.2

2007-03-10 Thread Martin Blapp



And even more narrowed down ...

--- libclamav/mbox.c.orig   Tue Feb 13 14:06:57 2007
+++ libclamav/mbox.cSat Mar 10 14:09:09 2007
@@ -413,6 +413,7 @@

 #ifdef CL_THREAD_SAFE
 static pthread_mutex_t tables_mutex = PTHREAD_MUTEX_INITIALIZER;
+static  pthread_mutex_t body_mutex = PTHREAD_MUTEX_INITIALIZER;
 #endif

 #ifndefO_BINARY
@@ -1494,6 +1495,7 @@
/*
 * Write out the last entry in the mailbox
 */
+   pthread_mutex_lock(body_mutex);
if((retcode == CL_SUCCESS)  messageGetBody(body)) {
messageSetCTX(body, ctx);
switch(parseEmailBody(body, NULL, mctx, 0)) {
@@ -1505,6 +1507,7 @@
break;
}
}
+   pthread_mutex_unlock(body_mutex);

/*
 * Tidy up and quit

Martin Blapp, [EMAIL PROTECTED] [EMAIL PROTECTED]
--
ImproWare AG, UNIXSP  ISP, Zurlindenstrasse 29, 4133 Pratteln, CH
Phone: +41 61 826 93 00 Fax: +41 61 826 93 01
PGP: finger -l [EMAIL PROTECTED]
PGP Fingerprint: B434 53FC C87C FE7B 0A18 B84C 8686 EF22 D300 551E
--

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: Clamav-90_2 Lockup with freebsd 6.2

2007-03-08 Thread Martin Blapp



Hi,


No, It's not. Today I added a new server with fresh clamav-0.90_3
package. Sockstat again started to jump to the sky.


Clamd with libpthread.so is still broken. Please use libthr.so. I'm currently
investigating why libpthreads.so has problems with clamd, and it looks to me
like a library bug.

--
Martin
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: Clamav-90_2 Lockup with freebsd 6.2 (fwd)

2007-03-05 Thread Martin Blapp



After further analyzing I think that pthread_cond_timedwait() in 
libpthread.so.2 has some issues.


While the default clamd worker timeout of 30 seconds is reached with
libc_r.so.6 and libthr.so.2 and pthread_cond_timedwait() returns ETIMEDOUT
there, libpthread.so doesn't get any ETIMEDOUT errors back at all, and the code
can never reach the part where the worker thread gets detached and the thread
count gets decreased.

With libpthread.so.2 pthread_cond_timedwait() returns always 0.

The manpage tells me:


The pthread_cond_timedwait() function atomically blocks the current
thread waiting on the condition variable specified by cond, and unblocks
the mutex specified by mutex.  The waiting thread unblocks only after
another thread calls pthread_cond_signal(3), or pthread_cond_broadcast(3)
with the same condition variable, or if the system time reaches the time
specified in abstime, and the current thread reacquires the lock on
mutex.


That doesn't seem to work with libpthread.so.2. Any hints ?

--
Martin

Martin Blapp, [EMAIL PROTECTED] [EMAIL PROTECTED]
--
ImproWare AG, UNIXSP  ISP, Zurlindenstrasse 29, 4133 Pratteln, CH
Phone: +41 61 826 93 00 Fax: +41 61 826 93 01
PGP: finger -l [EMAIL PROTECTED]
PGP Fingerprint: B434 53FC C87C FE7B 0A18 B84C 8686 EF22 D300 551E
--

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: Clamav-90_2 Lockup with freebsd 6.2

2007-03-04 Thread Martin Blapp



Hi all,

After adding some debug stuff to clamd running with freebsd
libpthread.so I found that:

looking at the output value of 'ps -auxH | grep clamd | grep -v grep | wc -l'
is always staying at 6 threads, but the number of threadpool-thr_alive is
going higher and higher. So threadpool-thr_alive isn't decreased and
is completly incorrect.

In my tests the number of threads with libpthread.so is never going higher than 
6 threads. That explains, why a higher load is going to delay all scan 
operations more and more.


With libthr.so the value of threadpool-thr_alive is equal with the 
output of the ps. It can go very high, but always goes back if no more

work is there to do.

After the maxthreads limit is reached, clamd becomes completly unresponsive
with libpthreads.so.

SIGKILL and SIGSTOP don't work because some (unexisting) worker threads block 
on pthread_cond_timedwait(). Only kill -9 helps. Of course, the counter 
threadpool-thr_alive is wrong and may cause this problem.


Maybe someone with more threads knowledge can help here. I'll now add some debug
stuff to see where threadpool-thr_alive is going to be increased and where it
is decreased. The strange thing is that this works fine with libthr.so.

Martin
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: Clamav-90_2 Lockup with freebsd 6.2

2007-03-04 Thread Martin Blapp



Hi,


looking at the output value of 'ps -auxH | grep clamd | grep -v grep | wc -l'
is always staying at 6 threads, but the number of threadpool-thr_alive is
going higher and higher. So threadpool-thr_alive isn't decreased and
is completly incorrect.


After setting IdleTimeout very low to 5 seconds, the threadpool-thr_alive gets 
decreased correctly. There are still always 6 threads running, but further 
threads get cleaned up correctly.


IMHO the whole clamd thread management is somehow broken.

--
Martin
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: Clamav-90_2 Lockup with freebsd 6.2

2007-03-04 Thread Martin Blapp



Hi,


Make sure clamd is checking the return value of pthread_create()
for errors.  You can also set LIBPTHREAD_PROCESS_SCOPE=yes or
LIBPTHREAD_SYSTEM_SCOPE=yes in your environment to force process
scope or system scope threads (libpthread only) for clamd.


Yes, it does check the return value. And setting system or process scope
doesn't make any difference. As I said, increasing the thread count
works, decreading doesn't.

--
Martin
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: Clamav-90_2 Lockup with freebsd 6.2

2007-03-02 Thread Martin Blapp



Hi,



[/usr/local/sbin/clamd]
libpthread.so.2 libthr.so.2
libpthread.so   libthr.so
libc_r.so.6 libpthread.so.2


Correct, just delete the last line. I forgot to delete
the only entry.



The right side of that last line should probably refer to libthr.so.2.
AFAICS, what you have there makes no sense.


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: Clamav-90_2 Lockup with freebsd 6.2

2007-03-01 Thread Martin Blapp



Hi,

Clamd is currently broken with libpthread for some threading-reason.
You definitly need to use libthr (which is still CPU hungry, but
works better).

/etc/libmap.conf

[/usr/local/sbin/clamd]
libpthread.so.2 libthr.so.2
libpthread.so   libthr.so
libc_r.so.6 libpthread.so.2

Martin



Dozens? Lucky you. I had 600 of them. OK, now we know that simscan
is not the problem.

--
One cannot sell the earth upon which the people walk
Tacunka Witco


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: Clamav-90_2 Lockup with freebsd 6.2

2007-02-27 Thread Martin Blapp



Hi,


Change the threading lib.  It fixed it for us.

% cat /etc/libmap.conf
[clamd]
libc_r.so.5 libthr.so.2
libc_r.so.6 libthr.so.2
libthr.so.2 libthr.so.2
libpthread.so.1 libthr.so.2
libpthread.so.2 libthr.so.2


Even with libthr the CPU usage is still far too high ...
I'm currently looking at the code.

Martin
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: [HEADSUP] New unionfs merged to RELENG_6 (stable)

2007-02-17 Thread Martin Blapp



Hi,

I feel the mount_union manpage isn't 100% correct anymore, especially the
the BUGS section ? Or do the things there still apply ?

Martin

Martin Blapp, [EMAIL PROTECTED] [EMAIL PROTECTED]
--
ImproWare AG, UNIXSP  ISP, Zurlindenstrasse 29, 4133 Pratteln, CH
Phone: +41 61 826 93 00 Fax: +41 61 826 93 01
PGP: finger -l [EMAIL PROTECTED]
PGP Fingerprint: B434 53FC C87C FE7B 0A18 B84C 8686 EF22 D300 551E
--

On Wed, 14 Feb 2007, Craig Rodrigues wrote:


Hi,

I have merged the new unionfs implementation from
CURRENT to RELENG_6 (stable branch).  This code
was submitted by Daichi GOTO and Masanori OZAWA.
Many, many thanks to GOTO-san and OZAWA-san for their work.

This code solves a lot of crashing problems that the old
unionfs implementation had.

Reports from people running SMP systems would be appreciated.
There probably is some room for performance optimization on SMP systems.

Also, thanks a lot to Kostik Belousov, who recently fixed some deadlock
problems in vfs_lookup.c, which helps a lot with unionfs.
--
Craig Rodrigues
[EMAIL PROTECTED]
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: Status of FreeBSD 6.2?

2006-12-26 Thread Martin Blapp



Build 6.2-RC2   24 December 2006Begin building the second release 
candidate

build for all Tier-1 platforms.


http://www.freebsd.org/releases/6.2R/schedule.html

Martin
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: ips(4) in toaster mode

2006-11-18 Thread Martin Blapp



Hi,


Also, the only harddisk on that host is the ips(4), so I can not obtain
a kernel dump. I'm not sure if this is a hardware failure, at least, no
led on the panel is shown red...


Hmm ? We do kernel dumps on ips(4) and it works.

dumpdev=/dev/ipsd0s1b

Martin
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: HEADS UP: FreeBSD 5.3, 5.4, 6.0 EoLs coming soon

2006-10-04 Thread Martin Blapp



Hi,


In June 2006, I opened a PR (kern/98622) about a regression on CARP
with IPv6 addresses: CARP is not usable with IPv6. Since I tracked
down the culprit commit (see appropriate info in the PR), I can
affirm that this regression appeared before the 6.1-RELEASE.


Wouldn't it be better to fix this in RELENG_6 before 6.2 RELEASE ?

Martin
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: CALL FOR TESTERS! [Re: 6.2 SHOWSTOPPER - em completely unusable on 6.2]

2006-10-04 Thread Martin Blapp



Hi,

What about with just the first change and not the second?  Anyways, I'm 
starting to see a trend here.  Problem reports are clustering around UP

systems, not SMP systems.  I don't know if that's just coincidence or not.


We've got also about twenty SMP Systems, seven of them now with 6.1 Prerelease 
and we don't have any affected systems. bge- and em- cards are working fine, 
even under high load situations.


Martin
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: problem with old /usr/src/contrib/amd

2006-10-03 Thread Martin Blapp



Hi,

I'll do an update of /usr/src/contrib/amd soon, after 6.2 is done.
Maybe we will see it then in 6.3 RELEASE :-)

Martin

Martin Blapp, [EMAIL PROTECTED] [EMAIL PROTECTED]
--
ImproWare AG, UNIXSP  ISP, Zurlindenstrasse 29, 4133 Pratteln, CH
Phone: +41 61 826 93 00 Fax: +41 61 826 93 01
PGP: finger -l [EMAIL PROTECTED]
PGP Fingerprint: B434 53FC C87C FE7B 0A18 B84C 8686 EF22 D300 551E
--

On Tue, 3 Oct 2006, Nicolas Martin wrote:


Hi all,
i was wondering if an update of /usr/src/contrib/amd is planned ?
I encounter  a problem using amd with nolock options, and it seems that
this problem was fixed on recent version of am-utils.
Many thanks,
Nicolas
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: Disappearing IPv6 default route

2006-10-01 Thread Martin Blapp



Hi,

Will you MFC before Beta 2 ?

Martin


-   } else if (req == RTM_ADD  SDL(gate)-sdl_alen == 0) {
+   } else if (req == RTM_ADD  SDL(gate)-sdl_alen == 0 
+   (rt-rt_flags  RTF_HOST) != 0) {
ln-ln_state = ND6_LLINFO_INCOMPLETE;

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: Frequent VFS crashes with RELENG_6

2006-09-30 Thread Martin Blapp



Hi,

1.) Bad ram ? Have you run some memory tester ?

2.) Have you background fsck running on this disk ? If
so try to boot into single user and do a full fsck on this
disk.

Martin

Martin Blapp, [EMAIL PROTECTED] [EMAIL PROTECTED]
--
ImproWare AG, UNIXSP  ISP, Zurlindenstrasse 29, 4133 Pratteln, CH
Phone: +41 61 826 93 00 Fax: +41 61 826 93 01
PGP: finger -l [EMAIL PROTECTED]
PGP Fingerprint: B434 53FC C87C FE7B 0A18 B84C 8686 EF22 D300 551E
--

On Sat, 30 Sep 2006, Vlad GALU wrote:


I've been getting random crashes like the one below, once or twice a
week, always in the same code path. The system is a RELENG_6 as of Wed
Sep 27 11:42:57 EEST 2006, running on amd64.

-- cut here --
#0  doadump () at pcpu.h:172
No locals.
#1  0x8022d033 in boot (howto=260) at 
../../../kern/kern_shutdown.c:409

  first_buf_printf = 1
#2  0x8022d687 in panic (fmt=0xff002bb6e260 °ö¾\) at
../../../kern/kern_shutdown.c:565
  bootopt = 260
  newpanic = 0
  ap = {{gp_offset = 16, fp_offset = 48, overflow_arg_area =
0xa7995790, reg_save_area = 0xa79956b0}}
  buf = vm_page_unwire: invalid wire count: 0, '\0' repeats 218 
times___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

TTY patches available for FreeBSD 5

2006-09-24 Thread Martin Blapp



Hi all,

For those who are interested I've put patches for FreeBSD 5 online.

Anybody having tty troubles and is using the perl module Expect, or
has heavy terminal usage (sshd) on his server and occasional panics
should give them a try. Those patches fix various panics in the tty
and virtual tty codepaths.

http://antispam.imp.ch/08-opensource.html

Martin

Martin Blapp, [EMAIL PROTECTED] [EMAIL PROTECTED]
--
ImproWare AG, UNIXSP  ISP, Zurlindenstrasse 29, 4133 Pratteln, CH
Phone: +41 61 826 93 00 Fax: +41 61 826 93 01
PGP: finger -l [EMAIL PROTECTED]
PGP Fingerprint: B434 53FC C87C FE7B 0A18 B84C 8686 EF22 D300 551E
--
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: Please confirm: top(1) cpu time broken on FreeBSD 6.1 (RELEASE/STABLE) SMP ?

2006-07-19 Thread Martin Blapp



Hi,


Disable hyperthreading in your BIOS.


Off course this is a solution, but I don't like
it. On a untuned, unmodified, unpatched system top(1)
should display the correct values IMHO.

Martin
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Please confirm: top(1) cpu time broken on FreeBSD 6.1 (RELEASE/STABLE) SMP ?

2006-07-18 Thread Martin Blapp



Hi all,

I'm currently seeing strang things going on on all our
upgraded SMP 2 CPU boxes. 4 Boxes are upgraded from 5.5 to
6 STABLE, one box is running 6.1 RELEASE.

All of them have SMP kernels, just using the provided GENERIC and SMP
kernel config files. Mergemaster has been done on all
boxes.

If a box is full loaded top(1) shows:

CPU states: 41.7% user,  0.0% nice,  7.7% system,  0.6% interrupt, 50.0% idle

I can buildworld -j 100, do verything what I want. The
sum of idle time never goes below 50.0% idle ! And the sum
of total CPU time never goes over 50.0%.

So I wonder if just top is broken (in 6.1 RELEASE too), or if this
is a SCHED4BSD bug.

Can someone confirm this issue ? We see it on IBL X-Series 346 Boxes,
as on IBM 306M Dual Core SMP systems.

Martin

Martin Blapp, [EMAIL PROTECTED] [EMAIL PROTECTED]
--
ImproWare AG, UNIXSP  ISP, Zurlindenstrasse 29, 4133 Pratteln, CH
Phone: +41 61 826 93 00 Fax: +41 61 826 93 01
PGP: finger -l [EMAIL PROTECTED]
PGP Fingerprint: B434 53FC C87C FE7B 0A18 B84C 8686 EF22 D300 551E
--
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: Please confirm: top(1) cpu time broken on FreeBSD 6.1 (RELEASE/STABLE) SMP ?

2006-07-18 Thread Martin Blapp



#include stdio.h

main() {
while(1) {}
}

cc -o test.c test

./test 
./test 
./test 
./test 
./test 
./test 
./test 
./test 

CPU states: 50.0% user,  0.0% nice,  0.0% system,  0.0% interrupt, 50.0% idle
Mem: 1160M Active, 988M Inact, 254M Wired, 34M Cache, 112M Buf, 573M Free
Swap: 8192M Total, 8192M Free

  PID USERNAME  THR PRI NICE   SIZERES STATE  C   TIME   WCPU COMMAND
21651 root1 1040  1216K   420K RUN2   0:03 28.85% test
21653 root1 1030  1216K   420K CPU0   0   0:02 28.26% test
21649 root1 1040  1216K   420K RUN2   0:03 26.12% test
21647 root1 1030  1216K   420K RUN2   0:02 25.04% test
21650 root1 1030  1216K   420K RUN0   0:02 24.50% test
21645 root1 1030  1216K   420K RUN0   0:03 23.31% test
21646 root1 1040  1216K   420K RUN2   0:03 22.81% test
21638 root1 1030  1216K   420K RUN2   0:03 21.79% test


Martin Blapp, [EMAIL PROTECTED] [EMAIL PROTECTED]
--
ImproWare AG, UNIXSP  ISP, Zurlindenstrasse 29, 4133 Pratteln, CH
Phone: +41 61 826 93 00 Fax: +41 61 826 93 01
PGP: finger -l [EMAIL PROTECTED]
PGP Fingerprint: B434 53FC C87C FE7B 0A18 B84C 8686 EF22 D300 551E
--

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: Please confirm: top(1) cpu time broken on FreeBSD 6.1 (RELEASE/STABLE) SMP ?

2006-07-18 Thread Martin Blapp



Hi,

Yes HTT is disabled (by default as I tought also by freebsd). But shouldn't top 
show then the correct results ?


Martin
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: Please confirm: top(1) cpu time broken on FreeBSD 6.1 (RELEASE/STABLE) SMP ?

2006-07-18 Thread Martin Blapp



Hi,

Ok, seems to be a HTT issue on a untuned system.

On two boxes I get with 'sysctl -a | grep smp | grep kern.smp.cpus'
kern.smp.cpus: 4

Here I see CPU-IDs 0 and 2 in top. No other CPUs seem to be
used. Here I get the broken CPU usage.

And yes, seeting hyperthreading_allowed=1 corrects the top usage.

On two other boxes where it works I get 'sysctl -a | grep smp | grep 
kern.smp.cpus' kern.smp.cpus: 2


Here I see CPU-ID's 0 and 1 in top. Here everything works as it should.

Martin

Martin Blapp, [EMAIL PROTECTED] [EMAIL PROTECTED]
--
ImproWare AG, UNIXSP  ISP, Zurlindenstrasse 29, 4133 Pratteln, CH
Phone: +41 61 826 93 00 Fax: +41 61 826 93 01
PGP: finger -l [EMAIL PROTECTED]
PGP Fingerprint: B434 53FC C87C FE7B 0A18 B84C 8686 EF22 D300 551E
--

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: Crash with FreeBSD 6.1 STABLE of today

2006-06-26 Thread Martin Blapp



Hi Max, Robert and Gavin,


Note, I am not sure if the patch still applies or is correct at all, but from
looking at it (and the name of the file) I seem to remember that there was a
problem with t_pgrp and t_session being accessed unlocked in some places.
Maybe it helps, maybe it doesn't.

[1] http://people.freebsd.org/~mlaier/tty.t_pgrp.diff


Some things are now different. The part at 2551,2567 has now
additional PGRP_LOCKs.

I've also added the proctree locking at line 1654 as I did in the
first patch. This part is unlocked too ... , don't you think ? That's
the place there panic appeared.

@@ -1654,8 +1668,8 @@
!ISSET(tp-t_cflag, CLOCAL)) {
SET(tp-t_state, TS_ZOMBIE);
CLR(tp-t_state, TS_CONNECTED);
+   sx_slock(proctree_lock);
if (tp-t_session) {
-   sx_slock(proctree_lock);
if (tp-t_session-s_leader) {
struct proc *p;

@@ -1664,8 +1678,8 @@
psignal(p, SIGHUP);
PROC_UNLOCK(p);
}
-   sx_sunlock(proctree_lock);
}
+   sx_sunlock(proctree_lock);
ttyflush(tp, FREAD | FWRITE);
return (0);
}

Also this place was unlocked:

@@ -942,8 +952,11 @@
 * Policy -- Don't allow FIOSETOWN on someone else's
 *   controlling tty
 */
-   if (tp-t_session != NULL  !isctty(p, tp))
+   sx_snlock(proctree_lock);
+   if (tp-t_session != NULL  !isctty(p, tp)) {
+   sx_sunlock(proctree_lock);
return (ENOTTY);
+   }

error = fsetown(*(int *)data, tp-t_sigio);
if (error)

Here is the complete patch. What do you thing about it ?
It's a FreeBSD 6 STABLE patch.

I'll test this patch now and tell you any problems with it.
Please do the same if you have time ...

Also available at:

http://mx.imp.ch/patch-tty.t_pgrp.diff

Martin

--- sys/kern/tty.c.orig Thu Dec 15 18:13:40 2005
+++ sys/kern/tty.c  Mon Jun 26 12:53:33 2006
@@ -329,8 +329,10 @@
tp-t_gen++;
tp-t_line = TTYDISC;
tp-t_hotchar = 0;
+   sx_xlock(proctree_lock);
tp-t_pgrp = NULL;
tp-t_session = NULL;
+   sx_xunlock(proctree_lock);
tp-t_state = 0;
knlist_clear(tp-t_rsel.si_note, 0);
knlist_clear(tp-t_wsel.si_note, 0);
@@ -401,11 +403,13 @@
return (0);
if (ISSET(iflag, BRKINT)) {
ttyflush(tp, FREAD | FWRITE);
+   sx_slock(proctree_lock);
if (tp-t_pgrp != NULL) {
PGRP_LOCK(tp-t_pgrp);
pgsignal(tp-t_pgrp, SIGINT, 1);
PGRP_UNLOCK(tp-t_pgrp);
}
+   sx_sunlock(proctree_lock);
goto endcase;
}
if (ISSET(iflag, PARMRK))
@@ -483,23 +487,27 @@
if (!ISSET(lflag, NOFLSH))
ttyflush(tp, FREAD | FWRITE);
ttyecho(c, tp);
+   sx_slock(proctree_lock);
if (tp-t_pgrp != NULL) {
PGRP_LOCK(tp-t_pgrp);
pgsignal(tp-t_pgrp,
CCEQ(cc[VINTR], c) ? SIGINT : 
SIGQUIT, 1);
PGRP_UNLOCK(tp-t_pgrp);
}
+   sx_sunlock(proctree_lock);
goto endcase;
}
if (CCEQ(cc[VSUSP], c)) {
if (!ISSET(lflag, NOFLSH))
ttyflush(tp, FREAD);
ttyecho(c, tp);
+   sx_slock(proctree_lock);
if (tp-t_pgrp != NULL) {
PGRP_LOCK(tp-t_pgrp);
pgsignal(tp-t_pgrp, SIGTSTP, 1);
PGRP_UNLOCK(tp-t_pgrp);
}
+   sx_sunlock(proctree_lock);
goto endcase;
}
}
@@ -617,11 +625,13 @@
 * ^T - kernel info and generate SIGINFO
 */
if

Re: Crash with FreeBSD 6.1 STABLE of today

2006-06-26 Thread Martin Blapp



Hi,

For those who got:

/usr/src/sys/kern/tty.c: In function `ttioctl':
/usr/src/sys/kern/tty.c:955: warning: implicit declaration of function 
`sx_snlock'
/usr/src/sys/kern/tty.c:955: warning: nested extern declaration of `sx_snlock'
*** Error code 1

There was a small typo in the previous patch. I've put the
modified one at:

http://mx.imp.ch/patch-tty.t_pgrp.diff

Thanks for testing !

Martin

Martin Blapp, [EMAIL PROTECTED] [EMAIL PROTECTED]
--
ImproWare AG, UNIXSP  ISP, Zurlindenstrasse 29, 4133 Pratteln, CH
Phone: +41 61 826 93 00 Fax: +41 61 826 93 01
PGP: finger -l [EMAIL PROTECTED]
PGP Fingerprint: B434 53FC C87C FE7B 0A18 B84C 8686 EF22 D300 551E
--

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: Crash with FreeBSD 6.1 STABLE of today

2006-06-26 Thread Martin Blapp



Hi all,

So far so good.

A remote stress testing of a tty session over serial cable
with a patched kernel worked fine.

How to proceed now ? The patch also applies to CURRENT
as there where no big changes since the repo has been
branched.

Should I commit it to CURRENT ?


http://mx.imp.ch/patch-tty.t_pgrp.diff


Martin
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: Crash with FreeBSD 6.1 STABLE of today

2006-06-26 Thread Martin Blapp



Hi,


Is the problem actually understood?  Do we know what's racing with what?
Given there only ever seems to be a single backtrace involved, as far as
I can tell, it's ttymodem racing with tty_close - can those two
functions alone be locked?


Correct: The only place there tp-t_session is set to NULL is in tty_close().

tp-t_pgrp = NULL;
tp-t_session = NULL;
tp-t_state = 0;

But I've seen this comment on peters page 
(http://people.freebsd.org/~peter/smp.html)

At the moment I've hacked all the console code to obtain the scheduler mutex 
instead of spltty()'ing all over the place, because in a word: it can't 
spltty() any more. It's illegal because the console code may be called at any time (see 
the next section). This is a hack because it isn't MP safe against tty 
interrupts running on another cpu. Since the current tty interrupt is a fast 
interrupt, I'm not sure that we're any worse off (fastints aren't masked by the 
cpl anyway).


IMHO it explains why we need the proctree_lock (in early SMP times 
scheduler mutex). Maybe this should be replaced with a proper tty subsystem 
mutex ...)


After looking at our SMPnewgen Page, Poul Henning and Thomas Moestel
are working on locking the tty subsystem. I'll CC those :-)

Martin
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: Crash with FreeBSD 6.1 STABLE of today

2006-06-23 Thread Martin Blapp



Hi,


(kgdb) p *tp-t_session
Cannot access memory at address 0x0

So here the problem is. Why is tp-t_session empty ? Maybe it has been already
free() earlier and we have some race here ?


Race was exactly my conclusion last time I looked into this.
http://docs.FreeBSD.org/cgi/mid.cgi?20041204110815.E80797

Something, somewhere is playing with t_session without locking...



Any idea how we can track this down ? It's a very bad thing to have in 61.
And yes, the serial output is going over a serial console.

Martin
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: Crash with FreeBSD 6.1 STABLE of today

2006-06-23 Thread Martin Blapp



Hi,

Maybe this is the solution ? IMHO there is a race window
open between the first tp-t_session test and the locking
of the proc tree.

Martin

+++ src/sys/kern/tty.c
--- src/sys/kern/tty.c

+   sx_slock(proctree_lock);
if (tp-t_session) {
-   sx_slock(proctree_lock);
if (tp-t_session-s_leader) {
struct proc *p;

p = tp-t_session-s_leader;
PROC_LOCK(p);
psignal(p, SIGHUP);
PROC_UNLOCK(p);
}
-   sx_sunlock(proctree_lock);
}
+   sx_sunlock(proctree_lock);

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: Crash with FreeBSD 6.1 STABLE of today

2006-06-23 Thread Martin Blapp



Hi,


I'm not sure if t_session is supposed to be protected by the proctree


Correct.


lock though.  With an initial glance of the code, it would seem odd to
be protected by the proctree lock, although I can't see any other locks
Someone with more knowledge of this code will probably know the answer
to this.

There does seem to be a worrying comment above tty_close (which is the
only place that t_session seems to be set to NULL):

* XXX our caller should have done `spltty(); l_close(); tty_close();'
* and l_close() should have flushed, but we repeat the spltty() and
* the flush in case there are buggy callers.

As I understand it, spltty() is now a no-op.  Does this mean that this
code is now essentially running without any locks that were used to
serialise changes to struct tty in days gone by?  Or is the whole tty
subsystem still running under Giant?


I thought this too. Maybe Robert knows more.

Martin
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Hardlock with Reboot option in beastie.4th

2006-06-22 Thread Martin Blapp



Hi,

Yesterday I've installed 6.1. After rebooting, I got the beastie
loader screen. I've typed 8 (reboot) to get into the
raid bios again, but then the server freezed. Does this happen
only to me or is it a common problem ?

The server affected is a IBM 346 X-Series server with 2 CPUS.

Martin

Martin Blapp, [EMAIL PROTECTED] [EMAIL PROTECTED]
--
ImproWare AG, UNIXSP  ISP, Zurlindenstrasse 29, 4133 Pratteln, CH
Phone: +41 61 826 93 00 Fax: +41 61 826 93 01
PGP: finger -l [EMAIL PROTECTED]
PGP Fingerprint: B434 53FC C87C FE7B 0A18 B84C 8686 EF22 D300 551E
--

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: Crash with FreeBSD 6.1 STABLE of today

2006-06-22 Thread Martin Blapp



Hi,


Unfortunaltly I get this with the debug kernel.
Does one have to boot with the debug.kernel itself
to get a trace which is usable ?


Sigh. A recompile helped !

(kgdb) where
#0  doadump () at pcpu.h:165
#1  0xc066355e in boot (howto=260) at /usr/src/sys/kern/kern_shutdown.c:409
#2  0xc06638b5 in panic (fmt=0xc0891732 %s) at 
/usr/src/sys/kern/kern_shutdown.c:565
#3  0xc085c6b6 in trap_fatal (frame=0xed6e4ab8, eva=4) at 
/usr/src/sys/i386/i386/trap.c:836
#4  0xc085c3bf in trap_pfault (frame=0xed6e4ab8, usermode=0, eva=4) at 
/usr/src/sys/i386/i386/trap.c:744
#5  0xc085bfb5 in trap (frame=
  {tf_fs = 8, tf_es = 40, tf_ds = -1063714776, tf_edi = -1064042304, tf_esi 
= 0, tf_ebp = -311538944, tf_isp = -311538972, tf_ebx = -967615488, tf_edx = 
-1063651212, tf_ecx = -941099136, tf_eax = 0, tf_trapno = 12, tf_err = 0, tf_eip = -1066845359, tf_cs 
= 32, tf_eflags = 66194, tf_esp = -967615488, tf_ss = 0})

at /usr/src/sys/i386/i386/trap.c:434
#6  0xc0848bea in calltrap () at /usr/src/sys/i386/i386/exception.s:139
#7  0xc0693b51 in ttymodem (tp=0xc6535c00, flag=-1063651212) at 
/usr/src/sys/kern/tty.c:1659
#8  0xc0698362 in ptcclose (dev=0x0, flags=3, fmt=8192, td=0xc7e7f780) at 
linedisc.h:136
#9  0xc0638a6f in giant_close (dev=0xcb3c1100, fflag=3, devtype=8192, 
td=0xc7e7f780) at /usr/src/sys/kern/kern_conf.c:266
#10 0xc06162bf in devfs_close (ap=0xed6e4b7c) at 
/usr/src/sys/fs/devfs/devfs_vnops.c:287
#11 0xc086dc1c in VOP_CLOSE_APV (vop=0x0, a=0xc099f874) at vnode_if.c:426
#12 0xc06c87e2 in vn_close (vp=0xc9cdf660, flags=3, file_cred=0x0, 
td=0xc7e7f780) at vnode_if.h:227
#13 0xc06c974a in vn_closefile (fp=0xc6fc5438, td=0xc7e7f780) at 
/usr/src/sys/kern/vfs_vnops.c:865
#14 0xc06162e7 in devfs_close_f (fp=0xc6fc5438, td=0xc7e7f780) at 
/usr/src/sys/fs/devfs/devfs_vnops.c:297
#15 0xc0642cdc in fdrop_locked (fp=0xc6fc5438, td=0xc7e7f780) at file.h:295
#16 0xc0642c29 in fdrop (fp=0xc6fc5438, td=0xc7e7f780) at 
/usr/src/sys/kern/kern_descrip.c:2122
#17 0xc06411c7 in closef (fp=0xc6fc5438, td=0xc7e7f780) at 
/usr/src/sys/kern/kern_descrip.c:1942
#18 0xc063e329 in close (td=0xc7e7f780, uap=0x0) at 
/usr/src/sys/kern/kern_descrip.c:1007
#19 0xc085c9f7 in syscall (frame=
  {tf_fs = 59, tf_es = 59, tf_ds = 59, tf_edi = 0, tf_esi = 673925920, 
tf_ebp = -1077941928, tf_isp = -311538332, tf_ebx = 673852332, tf_edx = 
673925920, tf_ecx = 673925920,
tf_eax = 6, tf_trapno = 12, tf_err = 2, tf_eip = 673354727, tf_cs = 51, 
tf_eflags = 518, tf_esp = -1077941956, tf_ss = 59}) at 
/usr/src/sys/i386/i386/trap.c:981

#20 0xc0848c3f in Xint0x80_syscall () at /usr/src/sys/i386/i386/exception.s:200
#21 0x0033 in ?? ()
Previous frame inner to this frame (corrupt stack?)
(kgdb)

(kgdb) frame 5
#5  0xc085bfb5 in trap (frame=
  {tf_fs = 8, tf_es = 40, tf_ds = -1063714776, tf_edi = -1064042304, tf_esi 
= 0, tf_ebp = -311538944, tf_isp = -311538972, tf_ebx = -967615488, tf_edx = 
-1063651212, tf_ecx =
 -941099136, tf_eax = 0, tf_trapno = 12, tf_err = 0, tf_eip = -1066845359, tf_cs 
= 32, tf_eflags = 66194, tf_esp = -967615488, tf_ss = 0})

at /usr/src/sys/i386/i386/trap.c:434

(kgdb) frame 8
#8  0xc0698362 in ptcclose (dev=0x0, flags=3, fmt=8192, td=0xc7e7f780) at 
linedisc.h:136


136 return ((*linesw[tp-t_line]-l_modem)(tp, flag));
(kgdb) list
131
132 static __inline int
133 ttyld_modem(struct tty *tp, int flag)
134 {
135
136 return ((*linesw[tp-t_line]-l_modem)(tp, flag));
137 }
138
139 #endif /* _KERNEL */

(kgdb) frame 7

(kgdb) p *tp-t_session
Cannot access memory at address 0x0

(kgdb) frame 7
#7  0xc0693b51 in ttymodem (tp=0xc6535c00, flag=-1063651212) at 
/usr/src/sys/kern/tty.c:1659
1659if (tp-t_session-s_leader) {
(kgdb) list
1654!ISSET(tp-t_cflag, CLOCAL)) {
1655SET(tp-t_state, TS_ZOMBIE);
1656CLR(tp-t_state, TS_CONNECTED);
1657if (tp-t_session) {
1658sx_slock(proctree_lock);
1659if (tp-t_session-s_leader) {
1660struct proc *p;
1661
1662p = tp-t_session-s_leader;
1663PROC_LOCK(p);

(kgdb) p *tp-t_session
Cannot access memory at address 0x0

So here the problem is. Why is tp-t_session empty ? Maybe it has been already 
free() earlier and we have some race here ?


Martin
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Crash with FreeBSD 6.1 STABLE of today

2006-06-21 Thread Martin Blapp



Hi Robert and others,

I just upgraded from 5.5 (stable btw.) to 6.1 and
after 10 hours I got a nice panic. Does this look
like some tty problem ?

This is the machine which made that many problems with
PREEMTION enabled in earlier releases of 5.x. Is it possible
that I'm hitting now again the same bugs or is it clearly
a tty related problem ?

kgdb /var/core/kernel.debug /var/core/vmcore.6

#0  0xc0663002 in doadump ()
#1  0xc066355e in boot ()
#2  0xc06638b5 in panic ()
#3  0xc085c6b6 in trap_fatal ()
#4  0xc085c3bf in trap_pfault ()
#5  0xc085bfb5 in trap ()
#6  0xc0848bea in calltrap ()
#7  0xc0693b51 in ttymodem ()
#8  0xc0698362 in ptcclose ()
#9  0xc0638a6f in giant_close ()
#10 0xc06162bf in devfs_close ()
#11 0xc086dc1c in VOP_CLOSE_APV ()
#12 0xc06c87e2 in vn_close ()
#13 0xc06c974a in vn_closefile ()
#14 0xc06162e7 in devfs_close_f ()
#15 0xc0642cdc in fdrop_locked ()
#16 0xc0642c29 in fdrop ()
#17 0xc06411c7 in closef ()
#18 0xc063e329 in close ()
#19 0xc085c9f7 in syscall ()
#20 0xc0848c3f in Xint0x80_syscall ()
#21 0x0033 in ?? ()
Previous frame inner to this frame (corrupt stack?)

Unfortunaltly I get this with the debug kernel.
Does one have to boot with the debug.kernel itself
to get a trace which is usable ?

kgdb /var/core/kernel.debug /var/core/vmcore.6

kgdb: kvm_read: invalid address (0x21)
kgdb: kvm_read: invalid address (0x21)
kgdb: kvm_read: invalid address (0x21)
kgdb: kvm_read: invalid address (0x21)
kgdb: kvm_read: invalid address (0x21)
kgdb: kvm_read: invalid address (0x21)
kgdb: kvm_read: invalid address (0x21)
kgdb: kvm_read: invalid address (0x21)
kgdb: kvm_read: invalid address (0x21)
kgdb: kvm_read: invalid address (0x21)
kgdb: kvm_read: invalid address (0x21)
kgdb: kvm_read: invalid address (0x21)
kgdb: kvm_read: invalid address (0x21)
kgdb: kvm_read: invalid address (0x21)
kgdb: kvm_read: invalid address (0x21)
kgdb: kvm_read: invalid address (0x21)
kgdb: kvm_read: invalid address (0x21)
kgdb: kvm_read: invalid address (0x21)
kgdb: kvm_read: invalid address (0x21)
kgdb: kvm_read: invalid address (0x21)
kgdb: kvm_read: invalid address (0x21)

Martin Blapp, [EMAIL PROTECTED] [EMAIL PROTECTED]
--
ImproWare AG, UNIXSP  ISP, Zurlindenstrasse 29, 4133 Pratteln, CH
Phone: +41 61 826 93 00 Fax: +41 61 826 93 01
PGP: finger -l [EMAIL PROTECTED]
PGP Fingerprint: B434 53FC C87C FE7B 0A18 B84C 8686 EF22 D300 551E
--
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: 4.8 PR1 locks up for rm(1) big files

2003-03-05 Thread Martin Blapp


Hi,

 Did you notice what the wait channel was? If the machine is still
 there, you could hit ^T and see what it says.

I just reproduced it on a second faster server. The file is also
80GB big.

Unfortunatly I lost the link of my ssh connection. After
~10 mins the server was again running and up.

So this seems to be a scheduler and softupdates issue !

Martin

To Unsubscribe: send mail to [EMAIL PROTECTED]
with unsubscribe freebsd-stable in the body of the message

HEADS UP: Broken if_dc NICs with MACs like 08:08[...] or 00:00[...]

2003-02-07 Thread Martin Blapp

 */
+   sc-dc_pmode == DC_PMODE_MII 
+   ((txstat  0x)  ~(DC_TXSTAT_ERRSUM|
+   DC_TXSTAT_NOCARRIER|DC_TXSTAT_CARRLOST)))
txstat = ~DC_TXSTAT_ERRSUM;
}

@@ -2553,11 +2649,13 @@
DC_INC(idx, DC_TX_LIST_CNT);
}

-if (idx != sc-dc_cdata.dc_tx_cons) {
+   if (idx != sc-dc_cdata.dc_tx_cons) {
+   /* some buffers have been freed */
sc-dc_cdata.dc_tx_cons = idx;
-ifp-if_flags = ~IFF_OACTIVE;
-}
-ifp-if_timer = (sc-dc_cdata.dc_tx_cnt == 0) ? 0 : 5;
+   ifp-if_flags = ~IFF_OACTIVE;
+   }
+   ifp-if_timer = (sc-dc_cdata.dc_tx_cnt == 0) ? 0 : 5;
+
return;
 }

Index: if_dcreg.h
===
RCS file: /home/ncvs/src/sys/pci/if_dcreg.h,v
retrieving revision 1.4.2.20
diff -u -r1.4.2.20 if_dcreg.h
--- if_dcreg.h  5 Feb 2003 22:10:04 -   1.4.2.20
+++ if_dcreg.h  7 Feb 2003 19:01:18 -
@@ -688,13 +688,14 @@
u_int8_tdc_pmode;
u_int8_tdc_link;
u_int8_tdc_cachesize;
+   int dc_romwidth;
int dc_pnic_rx_bug_save;
unsigned char   *dc_pnic_rx_buf;
int dc_if_flags;
int dc_if_media;
u_int32_t   dc_flags;
u_int32_t   dc_txthresh;
-   u_int8_tdc_srom[1024];
+   u_int8_t*dc_srom;
struct dc_mediainfo *dc_mi;
struct dc_list_data *dc_ldata;
struct dc_chain_datadc_cdata;

Martin Blapp, [EMAIL PROTECTED] [EMAIL PROTECTED]
--
ImproWare AG, UNIXSP  ISP, Zurlindenstrasse 29, 4133 Pratteln, CH
Phone: +41 61 826 93 00 Fax: +41 61 826 93 01
PGP: finger -l [EMAIL PROTECTED]
PGP Fingerprint: B434 53FC C87C FE7B 0A18 B84C 8686 EF22 D300 551E
--



To Unsubscribe: send mail to [EMAIL PROTECTED]
with unsubscribe freebsd-stable in the body of the message

Re: double fault with vinum and 4.5 RC3

2002-01-27 Thread Martin Blapp



Hi Greg.

 As I've mentioned elsewhere, this is seriously suboptimal.  The
 mirror command is a toy for people getting used to Vinum.  You want

Why is this suboptimal ? It's fast done. It seems to work. Else you
have to say: not supported, since it can panic your machine.

 a proper config file.  Then create one drive per spindle, and choose
 your subdisk sizes to match what you want.  Specifically, your config
 file should look like this:

So why this isn't noted in the documentation ? Also the documentation
is not up to date. vinum(4) still refers to raw disks etc.

  took da0 out and replaced it with a new disk. I switched jumpers, so
  da0 got da1, and da1 got da0 and rebooted.

 I still don't know why you're switching jumpers.  That's not needed.
 But it doesn't change anything.

Vinum still doesn't support booting from a vinum volume.

If I do not switch jumpers, I'm not able to boot. I could also change the
SCSI boot ID in the BIOS, but then FreeBSD cannot boot into Multiuser,
since it cannot mount root from ad1s1a (it likes to do it from ad0s1a)

 
  And boom I got a double fault.

 Yes, we've found the cause of that.  What I get now is:

I'll look now it fixes my problem. Just rebooted and made new kernel.

Martin


To Unsubscribe: send mail to [EMAIL PROTECTED]
with unsubscribe freebsd-stable in the body of the message

Re: double fault with vinum and 4.5 RC3

2002-01-27 Thread Martin Blapp



Hi all,

 I tried looking, but the net connection was terrible.  Anyway, I've
 been able to reproduce it here, and I have a patch which should make
 it go away:

Just tried the patch and it fixes my problem completly. Thanks
Thomas for the excelent work you've done.

And thank you Greg for providing a fix.

We really should include this important fix in 4.5-Release. So I sent a copy
to [EMAIL PROTECTED] too. Can you send them your fix too Greg ?

Martin


To Unsubscribe: send mail to [EMAIL PROTECTED]
with unsubscribe freebsd-stable in the body of the message

Re: double fault with vinum and 4.5 RC3

2002-01-27 Thread Martin Blapp



Greg,

 As I've mentioned elsewhere, this is seriously suboptimal.  The
 mirror command is a toy for people getting used to Vinum.  You want
 a proper config file.  Then create one drive per spindle, and choose
 your subdisk sizes to match what you want.  Specifically, your config
 file should look like this:

   drive 0 device /dev/da0e
   drive 1 device /dev/da1e
   volume var setupstate
plex org concat
  sd drive 0 len 2096000s
plex org concat
  sd drive 1 len 2096000s
   volume docsis setupstate
plex org concat
 sd drive 0 len 1257000s
plex org concat
 sd drive 1 len 1257000s
   volume docsisvar setupstate
plex org concat
 sd drive 0 len 4651000s
plex org concat
 sd drive 1 len 4651000s

Is this really mirrored ? I want a mirror, not a concatenated drive !
I want RAID 1.

And - this isn't very optimal for a user to create.

I do not want to calculate the sd offset size. I'm not a
beginner with vinum, but I really do not want to do this.

IMHO there should be a way for vinum to add this dynamically
like:

vinum sd-add -d 0 -n var -l 1GB
vinum sd-add -d 0 -n docsis -l 6GB
vinum sd-add -d 0 -n docsisvar -l end

A program under unix does not have to be very cryptic,
it also has to be userfriendly.

Martin


To Unsubscribe: send mail to [EMAIL PROTECTED]
with unsubscribe freebsd-stable in the body of the message

Re: double fault with vinum and 4.5 RC3

2002-01-27 Thread Martin Blapp



Hi Greg,

 So it seems.  I've heard of this in a couple of cases.  It would be
 interesting to see the log output, but if I understand you correctly
 you have since found a way to work around the problem.  I suspect that
 there was something in the disk label which confused the issue, but it
 would be nice to find what it is.  Possibly /dev/da1e would have
 worked.

I tried first with /dev/da1e. Then I shought I should use /dev/da1s1e.
No way. Errm, If it's allowed to ask, what is exactly the difference
between the name with slice (s1e) and just the slice entry (e) ?

I suppose I know what happened. da1 still had the vinum information
stored at the same place as before from my three mirrors I made before.

One time it worked, but vinum showed me a size of 1024m, exactly the
size what /var was before. You see, da0s1e is exactly at the same place
as the da0s1g with the new config !

I just did:

- Removed the three slices da0s1e, da0s1f, da0s1g.
- Removed the three slices da1s1e, da1s1f, da1s1g.
- Made a big partiton on them.
- Executed newfs on it.
- Relabeled this partition as type vinum
- Executed vinum create config and had the failure.

Should be easy to find out what vinum does wrong here.

Btw: I have a crash dump available for the race you said,
it you are interested. If you login to the machine, there
are also the log entries in /var/log/messages.

Another thing I noted: newfs seems to make less superblock backups
on big volumes now. Vinum still does make as many backups as FreeBSD
did earlier for other filesystems.

At least my experiments lead to some bug discovery ;)

Martin


To Unsubscribe: send mail to [EMAIL PROTECTED]
with unsubscribe freebsd-stable in the body of the message

Re: NTPD dumps core in 4.5-PRERELEASE

2001-12-25 Thread Martin Blapp



Hi Keltia,

 According to Martin Blapp:
  I see something similar here. Nsrexec, a networker client
  which gets started in rc.d segfaults since 1-2 month here.

 Ask Matt for a 6.0 client. He recompiled one two months ago and it has stopped
 segfaulting for me on CURRENT.

drwxr-xr-x  2 root  wheel  512 Dec  3 11:20 nwclient-6.0.2

Martin


To Unsubscribe: send mail to [EMAIL PROTECTED]
with unsubscribe freebsd-stable in the body of the message

Re: StarOffice woes on 4.4RC

2001-08-31 Thread Martin Blapp



Hi,

I suspect a wrong library, thus it is the linux_base port. Can you
do some debugging with kdump and strace ?

Martin

Martin Blapp, [EMAIL PROTECTED]
--
Improware AG, UNIX solution and service provider
Zurlindenstrasse 29, 4133 Pratteln, Switzerland
Phone: +41 061 826 93 00: +41 61 826 93 01
PGP Fingerprint: 57E 7CCD 2769 E7AC C5FA  DF2C 19C6 DCD1 1B3A EC9C
--


To Unsubscribe: send mail to [EMAIL PROTECTED]
with unsubscribe freebsd-stable in the body of the message

52 matches

Mail list logo