RE: unexplained system hangs - possible smbfs issue ??

2005-10-10 Thread Murray Taylor
 

---
The information transmitted in this e-mail is for the exclusive
use of the intended addressee and may contain confidential
and/or privileged material. Any review, re-transmission,
dissemination or other use of it, or the taking of any action
in reliance upon this information by persons and/or entities
other than the intended recipient is prohibited. If you
received this in error, please inform the sender and/or
addressee immediately and delete the material. 

E-mails may not be secure, may contain computer viruses and
may be corrupted in transmission. Please carefully check this
e-mail (and any attachment) accordingly. No warranties are
given and no liability is accepted for any loss or damage
caused by such matters.
---

***This Email has been scanned for Viruses by MailMarshal.***
 

-Original Message-
From: [EMAIL PROTECTED]
[mailto:[EMAIL PROTECTED] On Behalf Of Garrett Cooper
Sent: Monday, October 10, 2005 3:35 PM
To: freebsd-questions@freebsd.org
Subject: Re: unexplained system hangs - possible smbfs issue ??

Murray,
 Have you thought of looking into filing a bug report with the  
Samba people (http://www.samba.org/)? This may be an issue with  
either your client program, or the SMB implementation in Win2k3,  
which can be solved by getting the ball rolling with SMB and/or  
possibly MS.
 Either way, that is quite a few files to have to parse through,  
and although it may seem somewhat ludicrous, adding an additional  
script to presort out your minute reports would greatly reduce the  
amount of open-file records you need, and while that may not be a  
permanent solution it can serve as a better base for sorting your  
data. You could just create proper directories on the Win2k3 server,  
like %BASE_DIR%\Year\Day\Hour, if you get a large volume of files, or  
just strictly put them in a daily directory since it sounds like your  
volume is manageable. Plus, it's probably easier for humans to manage  
as opposed to 2000+ flat files in the same directory ;). Any SQL  
would handle this issue nicely as well since one of databases' best  
selling points is this type of application.
-Garrett
___


Garrett,

Thanks for your input, as it all ties things up with the observed
problem
which we have been slowly closing in on by a process of isolating
processes onto a 'sacrifical' host. 

BTW it still seems to be a bit time dependant, unless the 
comments regarding the extra files of zero length mentioned in other
replies and PR's apply here. Our test bed _never_ crashed under high
load testing
(20K+ files grown incrementally).  Maybe the test bed always had
'appropriate'
files and/or file structures. (sigh)

(really silly thought - I wonder if it is as 'simple' as needing 
an even / odd file count when the count gets high?? )

A 'move_files to dated directory' process is being built within the main
process as a final 
operation for cleanup. This will mean that the smbfs will never have
more than
10 files or so in any given 1 minute cycle.

cheers
mjt

 
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: unexplained system hangs - possible smbfs issue ??

2005-10-09 Thread Garrett Cooper

On Oct 9, 2005, at 5:14 PM, Murray Taylor wrote:


Hi all,

I have a system using smbfs that locks up every few hours.

The system is running a cron job every minute, that opens
a directory that is actually a smbfs mounted directory on a
Windows Server 2003 machine. The program reads the directory
looking for a particular file pattern, opens and processes the file
then closes and renames the file.

What this means is that the smnfs mounted directory file count
constantly
increases.

The hang mentioned is just that... the cron job doesnt run, you cannot
login on the console (well you can type root and enter and thats it,
no password prompt appears). If there is an open console session,
a single command can be issued then the console freezes.
The only recovery is a power cycle.

We have run the programs under a test framework to do a month load
of files in an hour, and cant hang things, yet the production box
(which is technically a better machine, but runs the same OS versions
as the testbed) will hang under 'normal' use every few hours. Its  
almost
clock regular, except that it is slowly reducing the time between  
hangs.


We have just purged the Windows directory, and it seems so far to
have extended the hang window.

Are there any known issues with smbfs and large directories???

NB We have other processes that use smbfs without the large number of
files on the Win systems and these have run ok for years.


It will hang on both these production versions.

FreeBSD xxxhostnamexxx 4.10-RELEASE FreeBSD 4.10-RELEASE #0: Tue  
May 25

22:47:12 GMT 2004
[EMAIL PROTECTED]:/usr/obj/usr/src/sys/GENERIC  i386

FreeBSD xxxhostnamexxx 4.10-RELEASE-p7 FreeBSD 4.10-RELEASE-p7 #3: Thu
Apr 14 15:34:37 EST 2005
[EMAIL PROTECTED]:/usr/obj/usr/src/sys/SVMYSQL3  i386


mjt

--
A: Because it fouls the order in which people normally read text.
Q: Why is top-posting such a bad thing?
A: Top-posting.
Q: What is the most annoying thing on usenet and in e-mail?

Murray Taylor
Bytecraft Systems
P: +61 3 8710 2555
F: +61 3 8710 2599
D: +61 3 9238 4275
E: [EMAIL PROTECTED]


Those are a lot of files =\... a temporary solution might be to  
compress old log files past a particular date, or use an actual  
database system, but yeah... I could see something odd occurring with  
the FreeBSD machine.
However, you didn't provide a lot of information about the FBSD  
machine. What are the specs for it, hardware-wise? Just curious.
Also, what's the approximate amount of files in the directory in  
question?

-Garrett
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


RE: unexplained system hangs - possible smbfs issue ??

2005-10-09 Thread Murray Taylor
 

---
The information transmitted in this e-mail is for the exclusive
use of the intended addressee and may contain confidential
and/or privileged material. Any review, re-transmission,
dissemination or other use of it, or the taking of any action
in reliance upon this information by persons and/or entities
other than the intended recipient is prohibited. If you
received this in error, please inform the sender and/or
addressee immediately and delete the material. 

E-mails may not be secure, may contain computer viruses and
may be corrupted in transmission. Please carefully check this
e-mail (and any attachment) accordingly. No warranties are
given and no liability is accepted for any loss or damage
caused by such matters.
---

***This Email has been scanned for Viruses by MailMarshal.***
 

-Original Message-
From: [EMAIL PROTECTED]
[mailto:[EMAIL PROTECTED] On Behalf Of Garrett Cooper
Sent: Monday, October 10, 2005 11:45 AM
To: FreeBSD Questions
Subject: Re: unexplained system hangs - possible smbfs issue ??

On Oct 9, 2005, at 5:14 PM, Murray Taylor wrote:

 Hi all,

 I have a system using smbfs that locks up every few hours.

 The system is running a cron job every minute, that opens
 a directory that is actually a smbfs mounted directory on a
 Windows Server 2003 machine. The program reads the directory
 looking for a particular file pattern, opens and processes the file
 then closes and renames the file.

 What this means is that the smnfs mounted directory file count
 constantly
 increases.

 The hang mentioned is just that... the cron job doesnt run, you cannot
 login on the console (well you can type root and enter and thats it,
 no password prompt appears). If there is an open console session,
 a single command can be issued then the console freezes.
 The only recovery is a power cycle.

 We have run the programs under a test framework to do a month load
 of files in an hour, and cant hang things, yet the production box
 (which is technically a better machine, but runs the same OS versions
 as the testbed) will hang under 'normal' use every few hours. Its  
 almost
 clock regular, except that it is slowly reducing the time between  
 hangs.

 We have just purged the Windows directory, and it seems so far to
 have extended the hang window.

 Are there any known issues with smbfs and large directories???

 NB We have other processes that use smbfs without the large number of
 files on the Win systems and these have run ok for years.


 It will hang on both these production versions.

 FreeBSD xxxhostnamexxx 4.10-RELEASE FreeBSD 4.10-RELEASE #0: Tue  
 May 25
 22:47:12 GMT 2004
 [EMAIL PROTECTED]:/usr/obj/usr/src/sys/GENERIC  i386

 FreeBSD xxxhostnamexxx 4.10-RELEASE-p7 FreeBSD 4.10-RELEASE-p7 #3: Thu
 Apr 14 15:34:37 EST 2005
 [EMAIL PROTECTED]:/usr/obj/usr/src/sys/SVMYSQL3  i386


 mjt

 -- 
 A: Because it fouls the order in which people normally read text.
 Q: Why is top-posting such a bad thing?
 A: Top-posting.
 Q: What is the most annoying thing on usenet and in e-mail?

 Murray Taylor
 Bytecraft Systems
 P: +61 3 8710 2555
 F: +61 3 8710 2599
 D: +61 3 9238 4275
 E: [EMAIL PROTECTED]

 Those are a lot of files =\... a temporary solution might be to  
compress old log files past a particular date, or use an actual  
database system, but yeah... I could see something odd occurring with  
the FreeBSD machine.
 However, you didn't provide a lot of information about the FBSD  
machine. What are the specs for it, hardware-wise? Just curious.
 Also, what's the approximate amount of files in the directory in  
question?
-Garrett



Hi Garrett,

File count is on the order of 2500 and growing...
They are actually message files about 2 k long, which have to be kept
for 20 days
and we get 300 ish per day.

I am going to mod the processes to move files through the smbfs link
to other holding (rotated) directories on the FBSD box, but am trying to
nail down if this 
file count / access method is the root cause of the problem.


The machines are

CPU: Intel(R) Pentium(R) 4 CPU 2.00GHz (2000.04-MHz 686-class CPU)
  Origin = GenuineIntel  Id = 0xf24  Stepping = 4
 
Features=0x3febfbffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE
,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SS
E2,SS,HTT,TM
real memory  = 266801152 (260548K bytes)
avail memory = 254156800 (248200K bytes)

we have had the same problem on this one

CPU: Intel(R) Xeon(TM) CPU 3.20GHz (3200.13-MHz 686-class CPU)
  Origin = GenuineIntel  Id = 0xf41  Stepping = 1
 
Features=0xbfebfbffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE
,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SS
E2,SS,HTT,TM,PBE
  Hyperthreading: 2 logical CPUs
real memory  = 2147274752 (2096948K bytes)
avail memory = 2088042496 (2039104K bytes)

This one was also running the database at the time which got ugly, so

Re: unexplained system hangs - possible smbfs issue ??

2005-10-09 Thread Garrett Cooper

Murray,
Have you thought of looking into filing a bug report with the  
Samba people (http://www.samba.org/)? This may be an issue with  
either your client program, or the SMB implementation in Win2k3,  
which can be solved by getting the ball rolling with SMB and/or  
possibly MS.
Either way, that is quite a few files to have to parse through,  
and although it may seem somewhat ludicrous, adding an additional  
script to presort out your minute reports would greatly reduce the  
amount of open-file records you need, and while that may not be a  
permanent solution it can serve as a better base for sorting your  
data. You could just create proper directories on the Win2k3 server,  
like %BASE_DIR%\Year\Day\Hour, if you get a large volume of files, or  
just strictly put them in a daily directory since it sounds like your  
volume is manageable. Plus, it's probably easier for humans to manage  
as opposed to 2000+ flat files in the same directory ;). Any SQL  
would handle this issue nicely as well since one of databases' best  
selling points is this type of application.

-Garrett
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]