Re: DigiBoard Xem with 2 extenal modules

2008-08-05 Thread Peter Jeremy
On 2008-Aug-04 13:49:58 -0300, Alexandre Biancalana [EMAIL PROTECTED] wrote:
Modems connected but they only work on ports of the first module, any
decice connected on ports of second module does not work.

ISTR the problem I ran into was that the octet read out of the driver
bore little resemblance to the serial data written into the port.
The bit-rate would match and framing was reported as correct but I
would get nonsense in the top or bottom 4 bits.  My notes are at work
and I'll try and dig them up tomorrow.

Any other idea ?

All I can suggest is comparing the Linux driver with the FreeBSD one
(and maybe someone needs to confirm if it works in Linux).  I couldn't
find a difference but maybe someone will see something I missed.

-- 
Peter Jeremy
Please excuse any delays as the result of my ISP's inability to implement
an MTA that is either RFC2821-compliant or matches their claimed behaviour.


pgpUvI9NuauSF.pgp
Description: PGP signature


Re: em(4) on FreeBSD is sometimes annoying

2008-08-05 Thread Martin
Am Mon, 4 Aug 2008 12:12:24 -0700
schrieb Jack Vogel [EMAIL PROTECTED]:

 OK, so your EEPROM is does not have the bug. As I was
 saying before, I would like to see what back to back behavior is.
 
 And, BTW, back to back does NOT mean hook to the switch,
 that's the very thing that is suspicious. It means NIC to NIC,
 no DHCP, assigned addresses.  And then see that you pass
 traffic, and then unhook cable, see if link goes down, reconnect
 and it should go up.

With no /etc/rc.d/netif script involved during startup everything
always works as expected.

If I comment the line ifconfig_re0=DHCP and start my laptop. I can
assign the address. I can ping the other NIC. If can unhook the cable
the LED goes off, link goes down, I can plug it in again, I can ping
again.

I have also no problems if I start without ifconfig_re0=DHCP and run
dhclient em0 while ethernet cable is unplugged. If I plug it in
again, everything works like above.

But:
If I startup with ifconfig_em0=DHCP AND (binary and!) no cable nothing
of this works correctly.

There must be something ifconfig_em=DHCP causes on startup that
running dhclient does not cause and that provokes the dead state.

 Oh, and exactly what kernel, and driver revision are you using.

Kernel is FreeBSD-STABLE compile date Mon Aug 4 13:41:43 CEST 2008.
It's GENERIC. The em(4) driver is the one used in this STABLE version
of yesterday.



I should perhaps mention that I have some other problems with this
laptop. I cannot boot with a PCMCIA wireless card (atheros) already in
the slot, or I get 100% load on cbb(4) and the system is not usable. If
I boot without the Atheros card and I plug it in later, it mostly is
detected.

AND I get sometimes an NMI when Atheros card is in use AND (binary and
again!) the laptop is on battery. This happens also on Linux.

AND the laptop will almost never recognize the Atheros card, if I am
using a text terminal with high resolution (set by vidcontrol MODE_280
with VESA support compiled in; cannot reproduce now, because I'm
using GENERIC).

I don't know if these things are related. I will get some other laptop
in one or two months. I hope these things won't bother me anymore.

--
Martin


signature.asc
Description: PGP signature


Stable SATA pci card for FreeBSD 6.x/7.0

2008-08-05 Thread Sebastiaan van Erk

Hi,

I'm running FreeBSD 6.3 (I know, I should upgrade), and I just bought an 
add-on pci SATA controller for 2 extra SATA disks.


However, a lot of disk activity on the drives will often cause the 
machine to crash and spontaneously reboot. I checked out which chipset 
was on the card with pciconf -lv and I found it was the Sil 3512. 
Googling showed me that I'm not the only one with problems using this card.


Does anybody have experience with a (preferably not too expensive) 
2-port SATA expansion card which does not have any issues running under 
FreeBSD 6.3/7.0?


[pciconf -lv output]
[EMAIL PROTECTED]:10:0:  class=0x018000 card=0x35121095 chip=0x35121095 
rev=0x01

hdr=0x00
vendor = 'Silicon Image Inc (Was: CMD Technology Inc)'
device = 'Sil 3512 SATALink/SATARaid Controller'
class  = mass storage

[/var/log/messages before the crash]
Aug  5 11:16:14 piglet kernel: 
g_vfs_done():mirror/gm1s1e[WRITE(offset=111376236544, 
length=16384)]error = 6

Aug  5 11:16:17 piglet last message repeated 9 times

Regards,
Sebastiaan


smime.p7s
Description: S/MIME Cryptographic Signature


Re: Query timeouts on FreeBSD 7 over network

2008-08-05 Thread Igor V. Ruzanov

I've tried with the ULE scheduler and 4BSD and tried with and with
out PREEMPTION turned on. Nothing makes a difference.
First of all you could try to connect only two machines via cross-over 
cable, no any switches between the machines, no any VLANs and so on. 
FreeBSD-7.0 works better with ULE-scheduler and kernel should be 
preemtive (options PREEMPTION in kernel config).

- what is your kernel config?


I'm pretty sure this is related to the OS or the em driver in some way, because 
if I disable all ICMP rate limiting and run an extended ping from the local 
firewall, I experience a very low amount of random packet loss in no pattern, 
unlike if you have the ICMP rate limiting enabled.
Once again it would be better if you do analyze the traffic going throuth 
the em-interface excluding your DNS testing data. Try to get the network 
with no any walking packets but dnsperf traffic and no any upinks and/or 
downlinks.



Also, are there any documented recommendations for sysctl values for FreeBSD 
when running BIND for optimal performance?

- What options did you provide to configure script during BIND building?
One of necessary options should be --enable-threads if you build BIND 
under FreeBSD 7.0.



+---+
! CANMOS ISP Network!
+---+
! Best regards  !
! Igor V. Ruzanov, network operational staff!
! e-Mail: [EMAIL PROTECTED]   !
+---+

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Stable SATA pci card for FreeBSD 6.x/7.0

2008-08-05 Thread Jeremy Chadwick
On Tue, Aug 05, 2008 at 12:28:40PM +0200, Sebastiaan van Erk wrote:
 However, a lot of disk activity on the drives will often cause the  
 machine to crash and spontaneously reboot. I checked out which chipset  
 was on the card with pciconf -lv and I found it was the Sil 3512.  
 Googling showed me that I'm not the only one with problems using this 
 card.

Yes, most of the Silicon Image ICs I've read about have odd driver
problems or general issues (even under Windows).  The system rebooting
is an odd one; you sure your PSU can handle two disks?

 Does anybody have experience with a (preferably not too expensive)  
 2-port SATA expansion card which does not have any issues running under  
 FreeBSD 6.3/7.0?

Promise makes some consumer-priced cards which work very well under
FreeBSD (sos@ has full documentation on their cards).

Their RAID controllers (the consumer-level ones) **do not** require that
you use RAID; they support JBOD, and the disks will show up under
FreeBSD as ad(4) devices.  (If you choose to use the RAID, you'll still
see the ad(4) disks, but you'll also see an ar(4) device too.  This has
the added advantage of you being able to monitor SMART stats on the
disks themselves directly, etc...

 [pciconf -lv output]
 [EMAIL PROTECTED]:10:0:  class=0x018000 card=0x35121095 chip=0x35121095  
 rev=0x01
 hdr=0x00
 vendor = 'Silicon Image Inc (Was: CMD Technology Inc)'
 device = 'Sil 3512 SATALink/SATARaid Controller'
 class  = mass storage

 [/var/log/messages before the crash]
 Aug  5 11:16:14 piglet kernel:  
 g_vfs_done():mirror/gm1s1e[WRITE(offset=111376236544, length=16384)] error = 6
 Aug  5 11:16:17 piglet last message repeated 9 times

Are you sure this is being caused by the controller?  Have you checked
SMART statistics on both disks?  Assuming error == errno, errno 6 is
Device not configured.

There's been recent discussion of such messages being caused by the use
of gmirror or gjournal, when the mirror/journal is improperly set up.
(In one users' case, he was receiving similar errors, as well as the
filesystem failing during fsck.  Turns out he incorrectly configured
journalling, which nuked the last ~1MB of his UFS filesystem.)

I'm not saying this is the reason for the messages you see, but it's
something to keep in mind.

-- 
| Jeremy Chadwickjdc at parodius.com |
| Parodius Networking   http://www.parodius.com/ |
| UNIX Systems Administrator  Mountain View, CA, USA |
| Making life hard for others since 1977.  PGP: 4BD6C0CB |

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Query timeouts on FreeBSD 7 over network

2008-08-05 Thread Jeremy Chadwick
On Tue, Aug 05, 2008 at 03:04:50PM +0400, Igor V. Ruzanov wrote:
 I've tried with the ULE scheduler and 4BSD and tried with and with
 out PREEMPTION turned on. Nothing makes a difference.
 First of all you could try to connect only two machines via cross-over  
 cable, no any switches between the machines, no any VLANs and so on.  
 FreeBSD-7.0 works better with ULE-scheduler and kernel should be  
 preemtive (options PREEMPTION in kernel config).
 - what is your kernel config?

 I'm pretty sure this is related to the OS or the em driver in some way, 
 because if I disable all ICMP rate limiting and run an extended ping from 
 the local firewall, I experience a very low amount of random packet loss in 
 no pattern, unlike if you have the ICMP rate limiting enabled.

I believe -stable just got added to this thread, so I'm not sure if
these details were provided prior.  My apologies if this stuff has
already been dealt with.

1) Are there any messages from the kernel about watchdog timeouts or
other anomalies pertaining to the network?  Look in dmesg.

2) pciconf -lv (only include the Ethernet entries please), vmstat -i
and netstat -in output.

3) Try disabling MSI/MSI-X via /boot/loader.conf variables (you'll
need to reboot after this):
 hw.pci.enable_msi=0
 hw.pci.enable_msix=0

4) Disabling TSO on the interface, and in the OS:
 ifconfig emXX -tso
 sysctl net.inet.tcp.tso=0

-- 
| Jeremy Chadwickjdc at parodius.com |
| Parodius Networking   http://www.parodius.com/ |
| UNIX Systems Administrator  Mountain View, CA, USA |
| Making life hard for others since 1977.  PGP: 4BD6C0CB |

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Stable SATA pci card for FreeBSD 6.x/7.0

2008-08-05 Thread Sebastiaan van Erk

Hi,

Thanks for the reply.

Jeremy Chadwick wrote:

Yes, most of the Silicon Image ICs I've read about have odd driver
problems or general issues (even under Windows).  The system rebooting
is an odd one; you sure your PSU can handle two disks?


Well, I've got a 450W Asus PSU in there, but I've also got 6 hard disks
and 1 dvd-rom drive (mostly inactive) in there. The hard disks are
mostly 250/300GB but the two new ones are 1TB SATA drives. But the 450W
should easily be enough, shouldn't it?

Does anybody have experience with a (preferably not too expensive)  
2-port SATA expansion card which does not have any issues running under  
FreeBSD 6.3/7.0?


Promise makes some consumer-priced cards which work very well under
FreeBSD (sos@ has full documentation on their cards).

 

Their RAID controllers (the consumer-level ones) **do not** require that
you use RAID; they support JBOD, and the disks will show up under
FreeBSD as ad(4) devices.  (If you choose to use the RAID, you'll still
see the ad(4) disks, but you'll also see an ar(4) device too.  This has
the added advantage of you being able to monitor SMART stats on the
disks themselves directly, etc...


I'll have a look at that if I can't get this one stable. They're
reasonably priced, so if they're good with FreeBSD then that looks like
a good option to me.


[pciconf -lv output]
[EMAIL PROTECTED]:10:0:  class=0x018000 card=0x35121095 chip=0x35121095  
rev=0x01

hdr=0x00
vendor = 'Silicon Image Inc (Was: CMD Technology Inc)'
device = 'Sil 3512 SATALink/SATARaid Controller'
class  = mass storage

[/var/log/messages before the crash]
Aug  5 11:16:14 piglet kernel:  
g_vfs_done():mirror/gm1s1e[WRITE(offset=111376236544, length=16384)] error = 6

Aug  5 11:16:17 piglet last message repeated 9 times


Are you sure this is being caused by the controller?  Have you checked
SMART statistics on both disks?  Assuming error == errno, errno 6 is
Device not configured.


I did look at the smart stats [pasted them below]. What I will try next
is just to switch the two 250GB SATA drives on my main board with the
two 1TB drives on the controller and see if I still get the problems if
I really increase the load on the two 1TB drives.


There's been recent discussion of such messages being caused by the use
of gmirror or gjournal, when the mirror/journal is improperly set up.
(In one users' case, he was receiving similar errors, as well as the
filesystem failing during fsck.  Turns out he incorrectly configured
journalling, which nuked the last ~1MB of his UFS filesystem.)

I'm not saying this is the reason for the messages you see, but it's
something to keep in mind.


I'll try reconfigure the geom. I used an online tutorial, but I'm not
quite sure that I did everything correctly, though fsck worked alright.
I did do this one differently than usual though, usually I use full disk
mirror after I already initialized one of the disks, and then I convert
it to a mirror by using:

sysctl kern.geom.debugflags=16
gmirror label -v -b round-robin gm0 /dev/ad0
gmirror insert gm0 /dev/ad2

(Especially useful when you want the entire FreeBSD install to be
mirrored). I guess I can try this on the extra disks as well.

Regards,
Sebastiaan


smime.p7s
Description: S/MIME Cryptographic Signature


Re: Stable SATA pci card for FreeBSD 6.x/7.0

2008-08-05 Thread Sebastiaan van Erk

Hi,

Sorry for forgetting to paste the smart details. Pressed send too quickly.

However, when I did check the smart stats again, I noticed I'd been 
smartctling the wrong disk (duh), and smart was not enabled on the new 
disks. I enabled it now, and it comes with a bunch of warnings and other 
stuff


Considering it wasn't enabled, maybe the errors wouldn't show up anyway, 
but here's the output of the smartctl command just in somebody sees 
something to worry about in it... (The ECC recovery count looks rather 
high, I tried -F samsung and -F samsung2 but that didn't help).


Regards,
Sebastiaan

[EMAIL PROTECTED](ttyp3:59:0):~# smartctl -a /dev/ad4
smartctl version 5.37 [i386-portbld-freebsd6.3] Copyright (C) 2002-6 
Bruce Allen

Home page is http://smartmontools.sourceforge.net/

=== START OF INFORMATION SECTION ===
Device Model: SAMSUNG HD103UJ
Serial Number:S13PJ1BQ606865
Firmware Version: 1AA01112
User Capacity:1,000,204,886,016 bytes
Device is:In smartctl database [for details use: -P show]
ATA Version is:   7
ATA Standard is:  Not recognized. Minor revision code: 0x52
Local Time is:Tue Aug  5 15:15:20 2008 CEST

== WARNING: May need -F samsung or -F samsung2 enabled; see manual for 
details.


SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00) Offline data collection activity
was never started.
Auto Offline Data Collection: 
Disabled.
Self-test execution status:  (   0) The previous self-test routine 
completed
without error or no self-test 
has ever

been run.
Total time to complete Offline
data collection: (11811) seconds.
Offline data collection
capabilities:(0x7b) SMART execute Offline immediate.
Auto Offline data collection 
on/off support.

Suspend Offline collection upon new
command.
Offline surface scan supported.
Self-test supported.
Conveyance Self-test supported.
Selective Self-test supported.
SMART capabilities:(0x0003) Saves SMART data before entering
power-saving mode.
Supports SMART auto save timer.
Error logging capability:(0x01) Error logging supported.
General Purpose Logging supported.
Short self-test routine
recommended polling time:(   2) minutes.
Extended self-test routine
recommended polling time:( 198) minutes.
Conveyance self-test routine
recommended polling time:(  21) minutes.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME  FLAG VALUE WORST THRESH TYPE 
UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate 0x000f   253   253   051Pre-fail 
Always   -   0
  3 Spin_Up_Time0x0007   090   090   011Pre-fail 
Always   -   4050
  4 Start_Stop_Count0x0032   100   100   000Old_age 
Always   -   4
  5 Reallocated_Sector_Ct   0x0033   100   100   010Pre-fail 
Always   -   0
  7 Seek_Error_Rate 0x000f   253   253   051Pre-fail 
Always   -   0
  8 Seek_Time_Performance   0x0025   100   100   015Pre-fail 
Offline  -   0
  9 Power_On_Hours  0x0032   100   100   000Old_age 
Always   -   230
 10 Spin_Retry_Count0x0033   100   100   051Pre-fail 
Always   -   0
 11 Calibration_Retry_Count 0x0012   100   100   000Old_age 
Always   -   0
 12 Power_Cycle_Count   0x0032   100   100   000Old_age 
Always   -   4
 13 Read_Soft_Error_Rate0x000e   253   253   000Old_age 
Always   -   0
183 Unknown_Attribute   0x0032   100   100   000Old_age   Always 
  -   0
184 Unknown_Attribute   0x0033   100   100   099Pre-fail  Always 
  -   0
187 Unknown_Attribute   0x0032   100   100   000Old_age   Always 
  -   0
188 Unknown_Attribute   0x0032   100   100   000Old_age   Always 
  -   0
190 Temperature_Celsius 0x0022   056   056   000Old_age   Always 
  -   740818988
194 Temperature_Celsius 0x0022   052   052   000Old_age   Always 
  -   48 (Lifetime Min/Max 0/12324)
195 Hardware_ECC_Recovered  0x001a   100   100   000Old_age   Always 
  -   153751007
196 Reallocated_Event_Count 

Re: Stable SATA pci card for FreeBSD 6.x/7.0

2008-08-05 Thread Jeremy Chadwick
On Tue, Aug 05, 2008 at 03:16:41PM +0200, Sebastiaan van Erk wrote:
 Sorry for forgetting to paste the smart details. Pressed send too quickly.

A note for the list: Sebastiaan and I are discussing the details
off-list.  I don't know if he forgot to CC the list on his replies, or
if he intentionally sent them to me directly.  :-)

Just thought I'd make note of that here, in case readers wonder what
becomes of this issue.

-- 
| Jeremy Chadwickjdc at parodius.com |
| Parodius Networking   http://www.parodius.com/ |
| UNIX Systems Administrator  Mountain View, CA, USA |
| Making life hard for others since 1977.  PGP: 4BD6C0CB |

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Stable SATA pci card for FreeBSD 6.x/7.0

2008-08-05 Thread Anton - Valqk
Hi,
I'm interested and following the thread,
can you pls send the communication that's of importance to all
at the end?

cheers,
valqk.
Jeremy Chadwick wrote:
 On Tue, Aug 05, 2008 at 03:16:41PM +0200, Sebastiaan van Erk wrote:
   
 Sorry for forgetting to paste the smart details. Pressed send too quickly.
 

 A note for the list: Sebastiaan and I are discussing the details
 off-list.  I don't know if he forgot to CC the list on his replies, or
 if he intentionally sent them to me directly.  :-)

 Just thought I'd make note of that here, in case readers wonder what
 becomes of this issue.

   


-- 
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Max size of one swap slice

2008-08-05 Thread Lin Jui-Nan Eric
Hi All,

Recently we found that we can only allocate 32GB for one swap slice.
Does there is any sysctl oid  or any kernel option to increase it? Why
we have this restriction?
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Stable SATA pci card for FreeBSD 6.x/7.0

2008-08-05 Thread Sebastiaan van Erk

Hi,

Sorry about that, I believe I only messed up on my first reply, and I 
thought I mailed that to the list as well after I noticed I messed up.


Thing is, I'm used to replying to mailing lists using the Reply button 
and unfortunately the reply doesn't go to the mailing list when I do 
that... Some people don't like it when you send it to the mailing list 
and CC it to them personally, but since you apparently do, I'll just use 
reply-all from now on.


Sorry again about the mistake,

Regards,
Sebastiaan

Jeremy Chadwick wrote:

On Tue, Aug 05, 2008 at 03:16:41PM +0200, Sebastiaan van Erk wrote:

Sorry for forgetting to paste the smart details. Pressed send too quickly.


A note for the list: Sebastiaan and I are discussing the details
off-list.  I don't know if he forgot to CC the list on his replies, or
if he intentionally sent them to me directly.  :-)

Just thought I'd make note of that here, in case readers wonder what
becomes of this issue.



smime.p7s
Description: S/MIME Cryptographic Signature


Stuck in geli

2008-08-05 Thread Sean C. Farley

Rarely, a geli partition I have freezes a process in bufwait state.  It
occurs after an ATA timeout message:
Aug  5 03:47:13 thor kernel: ad10: TIMEOUT - WRITE_DMA retrying (1 retry left) 
LBA=219028637

The geli partition resides on an Intel MatrixRAID RAID1 mirror using the
ICH9R chipset (Asus P5K-E/WIFI).  Killing (even -9) the process does not
work.  Rebooting is the only solution, yet the system is unable to flush
the buffers and complete a clean unmounting.

Both drives in the mirror have both survived a smartctl -t offline scan.
Also, a previous time it was with ad12, so I strongly doubt it is the
drive.  It seems like a geli partition is unable to handle a timeout
from a drive.

ad10:
Model Family: Seagate Barracuda 7200.7 and 7200.7 Plus family
Device Model: ST3160827AS

ad12:
Model Family: Seagate Barracuda 7200.7 and 7200.7 Plus family
Device Model: ST3160827AS

Sean

P.S. I could not help myself with the subject line.  :)
--
[EMAIL PROTECTED]
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Max size of one swap slice

2008-08-05 Thread David Wolfskill
On Tue, Aug 05, 2008 at 11:39:11PM +0800, Lin Jui-Nan Eric wrote:
 ...
 Recently we found that we can only allocate 32GB for one swap slice.

Yes.

 Does there is any sysctl oid  or any kernel option to increase it? Why
 we have this restriction?

I don't know why, though I suspect it may have something to do with the
way storage in the swap space is allocated  addressed.

You may, however, have more than 1 swap space.  (Per man pages for
swapon(8) and swapoff(8), the default maximum is 4.)

Peace,
david
-- 
David H. Wolfskill  [EMAIL PROTECTED]
I submit that conspiracy would be an appropriate collective noun for cats.

See http://www.catwhisker.org/~david/publickey.gpg for my public key.


pgp9jUikPpVDV.pgp
Description: PGP signature


Re: Max size of one swap slice

2008-08-05 Thread Chuck Swiger

On Aug 5, 2008, at 8:39 AM, Lin Jui-Nan Eric wrote:

Recently we found that we can only allocate 32GB for one swap slice.
Does there is any sysctl oid or any kernel option to increase it?  Why
we have this restriction?


This size limitation likely predates the availability of disks larger  
than 32GB.


It's hard to conceive of why you'd want to add so much swap space,  
anyway-- if you've got programs which actually need to deal with 10s  
of gigabytes worth of data, then they ought to maintain a smaller/ 
reasonable-sized working set in RAM and do disk I/O as needed  
themselves rather than depend upon the VM pager, anyways.


(Well, when using a BSD-derived kernel, anyways.  Some of the Mach  
kernels support userland VM pager implementations, so that things like  
a database or Photoshop can provide their own pager which understands  
their workload and chooses which pages to evict or replace better than  
the default system pager algorithm can.)


Regards,
--
-Chuck

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Stable SATA pci card for FreeBSD 6.x/7.0

2008-08-05 Thread Sebastiaan van Erk

Jeremy Chadwick wrote:


First and foremost, you've forgotten to CC the mailing list on all but
one of your replies.  I'll assume this is intentional, but it's probably
not for the best, as readers may find your post and wonder what the
outcome was.


It was not intentional, it hit reply instead of reply-all. Sorry. I will 
reply this to the list, so other interested parties can follow the 
thread and your informative replies.



On Tue, Aug 05, 2008 at 02:47:45PM +0200, Sebastiaan van Erk wrote:

Hi,

Thanks for the reply.

Jeremy Chadwick wrote:

Yes, most of the Silicon Image ICs I've read about have odd driver
problems or general issues (even under Windows).  The system rebooting
is an odd one; you sure your PSU can handle two disks?
Well, I've got a 450W Asus PSU in there, but I've also got 6 hard disks  
and 1 dvd-rom drive (mostly inactive) in there. The hard disks are  
mostly 250/300GB but the two new ones are 1TB SATA drives. But the 450W  
should easily be enough, shouldn't it?


Without getting into semantics, a 450W PSU may be on the light side for
6 disks.  I'm fairly amazed you're able to power up that machine without
disk errors or other problems during POST.  You'll be having 6 disks
spin up all simultaneously -- and spin-up is when disks draw the most
power, and possibly during normal operation.

If you have a different (or larger) PSU, I would recommend trying that
to see if it addresses your problem.  A PSU which isn't providing enough
power will cause the disks to occasionally disconnect from the bus, or
the machine sporadtically lock up, reboot (power-cycle), or other odd
things.


Unfortunately I don't have a larger PSU lying around, but I could buy 
one; though I'd like to try some other stuff first because I've had 6 
disks in my PC before without any problems.



[/var/log/messages before the crash]
Aug  5 11:16:14 piglet kernel:   
g_vfs_done():mirror/gm1s1e[WRITE(offset=111376236544, length=16384)] 
error = 6

Aug  5 11:16:17 piglet last message repeated 9 times

Are you sure this is being caused by the controller?  Have you checked
SMART statistics on both disks?  Assuming error == errno, errno 6 is
Device not configured.
I did look at the smart stats [pasted them below]. What I will try next  
is just to switch the two 250GB SATA drives on my main board with the  
two 1TB drives on the controller and see if I still get the problems if  
I really increase the load on the two 1TB drives.


More and more information about your system configuration is coming to
light.  Your original post didn't disclose any of that; now I know you
have 6 disks in the system, 2 of which are using on-board SATA (no idea
what controller), and 2 which are using a Silicon Image controller.
What are the remaining 2 disks connected to?


Sorry that I didn't give you that information immediately. The problem 
when you do that though is that the post is sometimes ignored because it 
is deemed too long or complicated (at least I've seen that happen). I'll 
glady post any relevant data.


My other (on-board) SATA controller is a VIA controller; and I've never 
had any problems with it (although the hardware raid messed up once a 
year or 2 ago, and since then I've been using software raid without any 
issues).


[EMAIL PROTECTED]:15:0:  class=0x010400 card=0x71421462 chip=0x31491106 
rev=0x80 hdr=0x00

vendor = 'VIA Technologies Inc'
device = 'VT8237  VT6410 SATA RAID Controller'
class  = mass storage
subclass   = RAID

The remaining disks are PATA disks which are in the on-board IDE 
controller. It's a legacy computer that's been upgraded a lot, though 
it's not too obsolete, the CPU's a AMD Sempron(tm) Processor 2600+ 
(1599.83-MHz 686-class CPU).



Your recommended method of troubleshooting (swapping the 250G for the
1TB) is a good idea.  But hear me loud and clear: just because you
switch the disks and the problem disappears for a few hours doesn't mean
it's gone.  There have been **many** people who have shown up on the
mailing lists stating I did X thing and now it works!, only to find
that a week later it *didn't* fix the problem.


Yes, I don't really expect it to solve the problem, but was thinking 
that at least I could try and stress test the known working disks on the 
controller and try to see if it's the controller that's the problem or 
the disks (or something else). I've been able to reproduce the crashes 
pretty well by just doing a lot of disk IO on the 1TB disks only (so the 
other disks were pretty idle during the tests).



There's been recent discussion of such messages being caused by the use
of gmirror or gjournal, when the mirror/journal is improperly set up.
(In one users' case, he was receiving similar errors, as well as the
filesystem failing during fsck.  Turns out he incorrectly configured
journalling, which nuked the last ~1MB of his UFS filesystem.)

I'm not saying this is the reason for the messages you see, but it's
something to keep in 

Re: Max size of one swap slice

2008-08-05 Thread Max Laier
On Tuesday 05 August 2008 17:39:11 Lin Jui-Nan Eric wrote:
 Recently we found that we can only allocate 32GB for one swap slice.
 Does there is any sysctl oid  or any kernel option to increase it? Why
 we have this restriction?

this is a consequence of the data structure used to manage swap space.  See 
sys/blist.h for details.  It *seems* that you *might* be able to increase the 
coverage by decreasing BLIST_META_RADIX, but that's from a quick glance and 
most certainly not a good idea.

However, the blist is a abstract enough API so that you can likely replace it 
with something that supports 64bit addresses (and thus 512*2^64 bytes of swap 
space per device) ... but I don't see why you'd want to do something like 
this.  Remember that you need memory to manage your swap space as well!

-- 
/\  Best regards,  | [EMAIL PROTECTED]
\ /  Max Laier  | ICQ #67774661
 X   http://pf4freebsd.love2party.net/  | [EMAIL PROTECTED]
/ \  ASCII Ribbon Campaign  | Against HTML Mail and News
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Max size of one swap slice

2008-08-05 Thread Matthew Dillon
: Recently we found that we can only allocate 32GB for one swap slice.
: Does there is any sysctl oid  or any kernel option to increase it? Why
: we have this restriction?
:
:this is a consequence of the data structure used to manage swap space.  See 
:sys/blist.h for details.  It *seems* that you *might* be able to increase the 
:coverage by decreasing BLIST_META_RADIX, but that's from a quick glance and 
:most certainly not a good idea.
:
:However, the blist is a abstract enough API so that you can likely replace it 
:with something that supports 64bit addresses (and thus 512*2^64 bytes of swap 
:space per device) ... but I don't see why you'd want to do something like 
:this.  Remember that you need memory to manage your swap space as well!
:
:-- 
:/\  Best regards,  | [EMAIL PROTECTED]
:\ /  Max Laier  | ICQ #67774661

The core structures can handle 2 billion swap pages == 2TB of swap,
but the blist code hits arithmatic overflows if a single blist has
more then (0x4000 / BLIST_META_RADIX) = 1G/16 = 64M swap blocks,
or 256GB.

I think the VM/BIO system had additional overflow issues due to
conversions back and forth between PAGE_SIZE and DEV_BSIZE which
further restricted the limit to 32GB.  Those restrictions may be gone
now that FreeBSD is using 64 bit block numbers, so you may be able to
pop it up to 256GB with virtually no effort (but you need to test it
significantly!).

With some work on the blist code only (not its structures) the arithmatic
overflow issues could also be resolved, increasing the swap capability
to 2TB.

I do not recommend changing any of the core blist structure, particularly
not BLIST_META_RADIX.  Just don't try :-).  You do NOT want to bump
the swap block number fields to 64 bits.

Also note that significant memory is used to manage that much swap.  It's
a factor of 1:16384 or so for the blist structures and probably about
the same amount for the vm_object tracking structures.  32G of swap needs
around 2-4MB of wired ram.

-Matt
Matthew Dillon 
[EMAIL PROTECTED]
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: HEADS UP: inpcb/inpcbinfo rwlocking: coming to a 7-STABLE branch near you

2008-08-05 Thread David G Lawrence
 The thrust of this change is to replace the mutexes protecting the inpcb 
 and inpcbinfo data structures with read-write locks (rwlocks).  These 

   That's really cool and directly affects my current work project. I'm
developing (have developed, actually) a multi-threaded, 5000+ member VoIP/SIP
conferencing server called Nconnect. It a primarily UDP application running
on FreeBSD 7. This generates and receives about 250,000 UDP packets a second,
with 200 byte packets, resulting in about 400Mbps of traffic in each
direction. The current bottleneck is the kernel UDP processing. It should
be possible to scale to 1+ members if kernel UDP processing had optimal
concurrency.
   Anyway, thumbs up (and not for the middle-eastern meaning :-)) - I'm
looking forward to the MFC.

-DG

Dr. David G. Lawrence
President
Download Technologies, Inc. - http://www.downloadtech.com - (866) 399 8500
Pave the road of life with opportunities.
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Upcoming ABI Breakage in RELENG_7

2008-08-05 Thread Matt
On Tue, Jul 29, 2008 at 10:45 AM, Ken Smith [EMAIL PROTECTED] wrote:

 Normally the FreeBSD Project tries very hard to avoid ABI breakage in
 Stable Branches.  However occasionally the fix for a bug can not be
 implemented without ABI breakage, and it is decided that the fix
 warrants the impact of the ABI breakage.  We have one of those
 situations coming along for RELENG_7 (what will become FreeBSD 7.1).
 The ABI breakage should only impact kernel modules that are not part of
 the baseline system (those will be patched by the MFC) which deal with
 advisory locks.  As such the impact should not cause many people
 problems.

 The work that will be MFCed fixes issues with filesystem advisory locks,
 and moves the advisory locks list from filesystem-private data
 structures into the vnode structure.

 The MFC will be done by Kostantin Belousov some time this coming Friday
 (August 1st, 2008) if you have concerns and want to watch for it.


Just updated to 7-STABLE last night and found that Truecrypt (which
uses a combination of fuse and mdconfig to mount its encrypted
volumes) is now completely unable to mount its encrypted volumes.
I've rebuilt fusefs-kmod and fusefs-libs as well as Truecrypt to no
avail.

As Truecrypt is implementing an encrypted file system, I am assuming
at this point that the kernel ABI breakage may be what is causing this
new problem.  Are the any calls in particular that I should be looking
for in the Truecrypt code to see where it might be choking on the new
kernel ABI?  I've briefly reviewed some of the changes in r181119 but
lack the experience to discern anything useful from them.  I'd like to
file an issue with the Truecrypt devs if I can get an idea of where
the breakage might be.

Thanks,
Matt

 Thanks.

 --
Ken Smith
 - From there to here, from here to  |   [EMAIL PROTECTED]
  there, funny things are everywhere.   |
  - Theodore Geisel |


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Upcoming ABI Breakage in RELENG_7

2008-08-05 Thread Doug Barton

Matt wrote:

Just updated to 7-STABLE last night and found that Truecrypt (which
uses a combination of fuse and mdconfig to mount its encrypted
volumes) is now completely unable to mount its encrypted volumes.
I've rebuilt fusefs-kmod and fusefs-libs as well as Truecrypt to no
avail.


Silly question, but did you build AND install both a new world AND a 
new kernel before you rebuilt your ports?


Doug

--

This .signature sanitized for your protection

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Upcoming ABI Breakage in RELENG_7

2008-08-05 Thread Matt
On Tue, Aug 5, 2008 at 5:09 PM, Doug Barton [EMAIL PROTECTED] wrote:
 Matt wrote:

 Just updated to 7-STABLE last night and found that Truecrypt (which
 uses a combination of fuse and mdconfig to mount its encrypted
 volumes) is now completely unable to mount its encrypted volumes.
 I've rebuilt fusefs-kmod and fusefs-libs as well as Truecrypt to no
 avail.

 Silly question, but did you build AND install both a new world AND a new
 kernel before you rebuilt your ports?

Yes - full, clean (deleted /usr/obj/* before build) build on world +
kernel.  I've run it through a couple ktrace runs looking for obvious
problems, but haven't found anything that I can interpret yet.  It
appears the failures are triggering around some file-descriptor
errors, but nothing that makes any sense yet.  I'll look at things a
little closer when I get some more time - need to boot into a backup
copy of the 7-RELEASE system to get some baselines.

Matt

 Doug

 --

This .signature sanitized for your protection


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Stable SATA pci card for FreeBSD 6.x/7.0

2008-08-05 Thread Jeremy Chadwick
On Tue, Aug 05, 2008 at 07:08:20PM +0200, Sebastiaan van Erk wrote:
 Jeremy Chadwick wrote:
 On Tue, Aug 05, 2008 at 02:47:45PM +0200, Sebastiaan van Erk wrote:
 Hi,

 Thanks for the reply.

 Jeremy Chadwick wrote:
 Yes, most of the Silicon Image ICs I've read about have odd driver
 problems or general issues (even under Windows).  The system rebooting
 is an odd one; you sure your PSU can handle two disks?
 Well, I've got a 450W Asus PSU in there, but I've also got 6 hard 
 disks  and 1 dvd-rom drive (mostly inactive) in there. The hard disks 
 are  mostly 250/300GB but the two new ones are 1TB SATA drives. But 
 the 450W  should easily be enough, shouldn't it?

 Without getting into semantics, a 450W PSU may be on the light side for
 6 disks.  I'm fairly amazed you're able to power up that machine without
 disk errors or other problems during POST.  You'll be having 6 disks
 spin up all simultaneously -- and spin-up is when disks draw the most
 power, and possibly during normal operation.

 If you have a different (or larger) PSU, I would recommend trying that
 to see if it addresses your problem.  A PSU which isn't providing enough
 power will cause the disks to occasionally disconnect from the bus, or
 the machine sporadtically lock up, reboot (power-cycle), or other odd
 things.

 Unfortunately I don't have a larger PSU lying around, but I could buy  
 one; though I'd like to try some other stuff first because I've had 6  
 disks in my PC before without any problems.

See the very bottom of my mail.  I don't believe the PSU is the problem,
after reviewing your SMART statistics.

 ...parts of thread cut...

 My other (on-board) SATA controller is a VIA controller; and I've never  
 had any problems with it (although the hardware raid messed up once a  
 year or 2 ago, and since then I've been using software raid without any  
 issues).

Okay, so you've got an onboard VIA (VT6410) SATA controller, an onboard
VIA IDE controller, and a PCI SATA controller.  I'd still like to know
which disks are attached to what controller, and if any of the devices
are sharing IRQs.  Can you provide the output from the following two
commands?

dmesg | egrep 'atapci|(ad|ata)[0-9]+'
vmstat -i

I'm just trying to narrow stuff down.

 Your recommended method of troubleshooting (swapping the 250G for the
 1TB) is a good idea.  But hear me loud and clear: just because you
 switch the disks and the problem disappears for a few hours doesn't mean
 it's gone.  There have been **many** people who have shown up on the
 mailing lists stating I did X thing and now it works!, only to find
 that a week later it *didn't* fix the problem.

 Yes, I don't really expect it to solve the problem, but was thinking  
 that at least I could try and stress test the known working disks on the  
 controller and try to see if it's the controller that's the problem or  
 the disks (or something else). I've been able to reproduce the crashes  
 pretty well by just doing a lot of disk IO on the 1TB disks only (so the  
 other disks were pretty idle during the tests).

It's interesting that the disks which are giving you trouble are Samsung
disks.  There's some history here which you should be made aware of:

In July, Daniel Eriksson reported data corruption occurring with his
nVidia MCP55 chipset when 1TB Samsung disks were attached to it.  The
same disks on another controller performed fine.  The corruption was
being detected by ZFS as checksum errors.  (UFS/UFS2 won't detect this
sort of thing, unless the corruption is occurring somewhere within the
filesystem tables.)

http://lists.freebsd.org/pipermail/freebsd-stable/2008-July/043427.html

Soren Schmidt (ata(4) author) replied that there are some nVidia
chipset-related fixes for ATA in -CURRENT, and provided a patch.  Daniel
reported that the patch made absolutely no difference:

http://lists.freebsd.org/pipermail/freebsd-stable/2008-July/043434.html

Daniel also tried using a firmware patch for his Samsung disks, which
limit the SATA speed to SATA150, but the speed was still negotiated as
SATA300 (indicating the vendors' own f/w patch is broken, or FreeBSD
does not play well with it).  The f/w patch didn't fix his problem
either:

http://lists.freebsd.org/pipermail/freebsd-stable/2008-July/043432.html

[EMAIL PROTECTED] reported using his MCP55 controller without any
problem -- as long as he didn't use Samsung disks.  He stated that he
believes Samsung disks are PATA disks that use a PATA-to-SATA adapter
inside of the drive, leading to problems (and yes, those adapters are
known to cause all sorts of mayhem):

http://lists.freebsd.org/pipermail/freebsd-stable/2008-July/043485.html

I'm not sure what became of the thread; Daniel never provided a
post-mortem.  I'm left to believe he probably took [EMAIL PROTECTED]'s
advice and switched to another disk vendor.

 ...parts of thread cut...
 ...smartctl output for both Samsung disks...

Thanks for upgrading to 5.38.  All the SMART statistics for these 

Re: Stuck in geli

2008-08-05 Thread Jeremy Chadwick
On Tue, Aug 05, 2008 at 10:45:16AM -0500, Sean C. Farley wrote:
 Rarely, a geli partition I have freezes a process in bufwait state.  It
 occurs after an ATA timeout message:
 Aug  5 03:47:13 thor kernel: ad10: TIMEOUT - WRITE_DMA retrying (1 retry 
 left) LBA=219028637

This looks like the issue I've been tracking for months now.  I'm sorry
the document isn't complete; it's an issue of time...

http://wiki.freebsd.org/JeremyChadwick/ATA_issues_and_troubleshooting

 The geli partition resides on an Intel MatrixRAID RAID1 mirror using the
 ICH9R chipset (Asus P5K-E/WIFI).  Killing (even -9) the process does not
 work.  Rebooting is the only solution, yet the system is unable to flush
 the buffers and complete a clean unmounting.

After reading my above Wiki page, I hope you consider disabling
MatrixRAID and avoiding it entirely on FreeBSD.  There are patches to
address major issues which have been sitting untouched, despite patches
included, for 2+ years.  Draw your own conclusions.

Also, you won't be able to kill -9 a process in that state.  The kernel
(or some piece of it) is hung, not the process.  The fact that a reboot
is required also does not surprise me.

You *might* have been able to detach the ATA/SATA channel using
atacontrol to get access to the system, but then again it might result
in a system panic (see Wiki).

 Both drives in the mirror have both survived a smartctl -t offline scan.

This doesn't really mean anything; the SMART statistics, self-test
log, and error log are what's more important.  Chances are it's not a
disk issue though...

 Also, a previous time it was with ad12, so I strongly doubt it is the
 drive.  It seems like a geli partition is unable to handle a timeout
 from a drive.

The problem is not with geli(4), as I see it.

 ad10:
 Model Family: Seagate Barracuda 7200.7 and 7200.7 Plus family
 Device Model: ST3160827AS

 ad12:
 Model Family: Seagate Barracuda 7200.7 and 7200.7 Plus family
 Device Model: ST3160827AS

My experiences with disk timeouts on FreeBSD is that the OS does not
handle it well at all, regardless of geli(4) being used or not.  The
entire system can deadlock, and in some cases panic (which for me is
the more common result).

I can't help myself here -- Linux's libata handles this much more
elegantly.  In the case of a failure similar to the above, there is a
brief system deadlock and then full system recovery with EIO (I/O error)
being returned to any process stuck in that state.  There *is* data
loss, but I don't think there's anything one can do about that (on Linux
or FreeBSD); journalling filesystems should solve that aspect.

-- 
| Jeremy Chadwickjdc at parodius.com |
| Parodius Networking   http://www.parodius.com/ |
| UNIX Systems Administrator  Mountain View, CA, USA |
| Making life hard for others since 1977.  PGP: 4BD6C0CB |

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Stable SATA pci card for FreeBSD 6.x/7.0

2008-08-05 Thread Andrey V. Elsukov

Sebastiaan van Erk wrote:

[/var/log/messages before the crash]
Aug  5 11:16:14 piglet kernel: 
g_vfs_done():mirror/gm1s1e[WRITE(offset=111376236544, 
length=16384)]error = 6

Aug  5 11:16:17 piglet last message repeated 9 times


Can you show which messages where before these?

--
WBR, Andrey V. Elsukov
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Stable SATA pci card for FreeBSD 6.x/7.0

2008-08-05 Thread Andrew Snow


I heartily recommend 3ware controllers for FreeBSD 6/7, even if you only 
need 2 ports.


- Andrew

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: 5.x to 6.x or 7.x with 64MB /

2008-08-05 Thread Morgan Reed
On Tue, Aug 5, 2008 at 2:48 AM, Nick Barnes [EMAIL PROTECTED] wrote:
 It occurs to me that if ad0s1a is insufficient then I could use ad0s1g
 as swap, and repurpose ad0s1b as a new /.  Is it straightforward to
 installworld/mergemaster to somewhere other than / ?

The DESTDIR directive will allow you to redirect your installworld to
a different location, as for mergemaster IIRC this can be done (been a
while since I was working on my embedded stuff that needed all this
FreeBSD 6.something in about 8MB) but I can't remember what switches.

It might be worth considering building /bin and /sbin dynamically
(20oddMB to about 500kB), mind I can't remember where the required
libs would be, if they're in /lib it'd be all good, if they're in
/usr/lib you're stuck.

All things considered I think your best option is to move / to a
different partition. Should be relatively painless to do from a
LiveCD, mount everything, duplicate / to somewhere else, modify fstab
and the bootloader config, reboot.
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]