Re: nfs lockd errors after NetApp software upgrade.

2019-12-23 Thread Adam McDougall
On 12/22/19 12:01 PM, Rick Macklem wrote:

> Well, I've noted the flawed protocol. Here's an example (from my limited 
> understanding of these protocols, where there has never been a published 
> spec) :
> - The NLM supports a "blocking lock request" that goes something like this...
>- client requests lock and is willing to wait for it
>- if server has a conflicting lock on the file, it replies "I'll acquire 
> the lock for
>   you when I can and let you know".
>  --> When the conflicting lock is released, the server acquires the lock 
> and does
> a callback (server->client RPC) to tell the client it now has the 
> lock.
> You don't have to think about this for long to realize that any network 
> unreliability
> or partitioning could result in trouble.
> The kernel RPC layer may do some retries of the RPCs (this is controlled by 
> the
> parameters set for the RPC), but at some point the protocol asks the NSM
> (rpc.statd) if the machine is "up" and then uses the NSM's answer to deal 
> with it.
> (The NSM basically pokes other systems and notes they are "up" if they get
>  replies to these pokes. It uses IP broadcast at some point.)
> 
> Now, maybe switching to TCP will make the RPCs reliable enough that it will
> work, or maybe it won't? (It certainly sounds like the Netapp upgrade is 
> causing
> some kind of network issue, and the NLM doesn't tolerate that well.)
> 
> rick

tl;dr I think netapp effectively nerfed UDP lockd performance in newer
versions, maybe cluster mode.

>From my very un-fun experience after migrating our volumes off an older
netapp onto a new netapp with flash drives (plenty fast) running Ontap
9.x ("cluster mode"), our typical IO load from idle time IMAP
connections was enough to overwhelm the new netapp and drive performance
into the ground. The same IO that was perfectly fine on the old netapp.
Going into a workday in this state was absolutely not possible. I opened
a high priority ticket with netapp, didn't really get anywhere that very
long day and settled on nolockd so I could go home and sleep. Both my
hunch later and netapp support suggested switching lockd traffic to TCP
even though I had no network problems (the old netapp was fine). I think
I still run into occasional load issues but the newer netapp OS seemed
way more capable of this load when using TCP lockd. Of course they also
suggested switching to nfsv4 but I could not seriously entertain
validating that type of change for production in less than a day.
___
freebsd-stable@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"


Re: nfs lockd errors after NetApp software upgrade.

2019-12-19 Thread Adam McDougall
Try changing bool_t do_tcp = FALSE; to TRUE in
/usr/src/sys/nlm/nlm_prot_impl.c, recompile the kernel and try again. I
think this makes it match Linux client behavior. I suspect I ran into
the same issue as you. I do think I used nolockd is a workaround
temporarily. I can provide some more details if it works.

On 12/19/19 9:21 AM, Daniel Braniss wrote:
> 
> 
>> On 19 Dec 2019, at 16:09, Rick Macklem  wrote:
>>
>> Daniel Braniss wrote:
>> [stuff snipped]
>>> all mounts are nfsv3/tcp
>> This doesn't affect what the NLM code (rpc.lockd) uses. I honestly don't 
>> know when
>> the NLM uses tcp vs udp. I think rpc.statd still uses IP broadcast at times.
> can the replay cache have any influence here? I tend to remember way back 
> issues
> with it,
>>
>> To me, it looks like a network configuration issue.
> that was/is my gut feelings too, but, as far as we can tell, nothing has 
> changed in the network infrastructure,
> the problems appeared after the NetAPP’s software was updated, it was working 
> fine till then.
> 
> the problems are also happening on freebsd 12.1
> 
>> You could capture packets (maybe when a client first starts rpc.statd and 
>> rpc.lockd)
>> and then look at them in wireshark. I'd disable statup of rpc.lockd and 
>> rpc.statd
>> at boot for a test client and then run something like:
>> # tcpdump -s 0 -s out.pcap host 
>> - and then start rpc.statd and rpc.lockd
>> Then I'd look at out.pcap in wireshark (much better at decoding this stuff 
>> than
>> tcpdump). I'd look for things like different reply IP addresses from the 
>> Netapp,
>> which might confuse this tired old NLM protocol Sun devised in the mid-1980s.
>>
> it’s going to be an interesting week end :-(
>  
>>> the error is also appearing on freebsd-11.2-stable, I’m now checking if 
>>> it’s also
>>> happening on 12.1
>>> btw, the NetApp version is 9.3P17
>> Yes. I wasn't the author of the NSM and NLM code (long ago I refused to even
>> try to implement it, because I knew the protocol was badly broken) and I 
>> avoid
>> fiddling with. As such, it won't have change much since around FreeBSD7.
> and we haven’t had any issues with it for years, so you must have done 
> something good
> 
> cheers,
>   danny
> 
>>
>> rick
>>
>> cheers,
>>danny
>>
>>> rick
>>>
>>> Cheers
>>>
>>> Richard
>>> (NetApp admin)
>>>
>>> On Wed, 18 Dec 2019 at 15:46, Daniel Braniss 
>>> mailto:da...@cs.huji.ac.il>> wrote:
>>>
>>>
 On 18 Dec 2019, at 16:55, Rick Macklem 
 mailto:rmack...@uoguelph.ca>> wrote:

 Daniel Braniss wrote:

> Hi,
> The server with the problems is running FreeBSD 11.1 stable, it was 
> working fine for >several months,
> but after a software upgrade of our NetAPP server it’s reporting many 
> lockd errors >and becomes catatonic,
> ...
> Dec 18 13:11:02 moo-09 kernel: nfs server fr-06:/web/www: lockd not 
> responding
> Dec 18 13:11:45 moo-09 last message repeated 7 times
> Dec 18 13:12:55 moo-09 last message repeated 8 times
> Dec 18 13:13:10 moo-09 kernel: nfs server fr-06:/web/www: lockd is alive 
> again
> Dec 18 13:13:10 moo-09 last message repeated 8 times
> Dec 18 13:13:29 moo-09 kernel: sonewconn: pcb 0xf8004cc051d0: Listen 
> queue >overflow: 194 already in queue awaiting acceptance (1 occurrences)
> Dec 18 13:14:29 moo-09 kernel: sonewconn: pcb 0xf8004cc051d0: Listen 
> queue >overflow: 193 already in queue awaiting acceptance (3957 
> occurrences)
> Dec 18 13:15:29 moo-09 kernel: sonewconn: pcb 0xf8004cc051d0: Listen 
> queue >overflow: 193 already in queue awaiting acceptance …
 Seems like their software upgrade didn't improve handling of NLM RPCs?
 Appears to be handling RPCs slowly and/or intermittently. Note that no one
 tests it with IPv6, so at least make sure you are still using IPv4 for the 
 mounts and
 try and make sure IP broadcast works between client and Netapp. I think 
 the NLM
 and NSM (rpc.statd) still use IP broadcast sometimes.

>>> we are ipv4 - we have our own class c :-)
 Maybe the network guys can suggest more w.r.t. why, but as I've stated 
 before,
 the NLM is a fundamentally broken protocol which was never published by 
 Sun,
 so I suggest you avoid using it if at all possible.
>>> well, at the moment the ball is on NetAPP court, and switching to NFSv4 at 
>>> the moment is out of the question, it’s
>>> a production server used by several thousand students.
>>>

 - If the locks don't need to be seen by other clients, you can just use 
 the "nolockd"
 mount option.
 or
 - If locks need to be seen by other clients, try NFSv4 mounts. Netapp 
 filers
 should support NFSv4.1, which is a much better protocol that NFSv4.0.

 Good luck with it, rick
>>> thanks
>>>   danny
>>>
 …
 any ideas?

 thanks,
  danny

 ___

Re: Random panics in 11.0 and 12.0 on J1900

2019-07-24 Thread Adam
On Wed, Jul 10, 2019 at 11:28 AM James Snow  wrote:

> I have a set of J1900 hosts running 11.0-RELEASE-p1 that experience
> seemingly random panics.


What is the size of this J1900 set? Do you also have J1900 which do not
exhibit the problem?


> One, memtest has turned up no errors on 12.0 host I witnessed the panic
> on.
>

memtest cannot conclusively confirm dimm is good, it is only conclusive on
bad ones.  You can find more info about others learning this lesson
here(see extended comments):

https://superuser.com/questions/547822/how-many-passes-are-enough-with-memtest


> Two, a small number of systems on the same hardware are running
> 10.3-RELEASE, and have experienced no panics in their history. Panics
> have only happened on 11s, and now 12.
>

Once upon a time in a hypothetical universe, I had a stick of ram which
would run on Win98 for very long periods without issue.  It wouldn't even
boot with Win NT.  After the manufacturer sent the same one back twice, I
tased it and RMA'd again.  This time, I got a new stick and all was good.

The point is memory issues can be very subtle and replacing with known good
modules is the easiest way to be sure.

-- 
Adam
___
freebsd-stable@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"


Re: trying to expand a zvol-backed bhyve guest which is UFS

2019-05-19 Thread Adam
On Sun, May 19, 2019 at 9:47 PM tech-lists  wrote:

> Thanks very much to you both, all sorted now. I didn't realise there was
> a 2TB limit for MBR either. Can I shrink the 4TB to 2TB on the zfs side
> without scrambling the ufs on the guest?
>

You can snapshot the zvol to be safe, but you should be able to shrink it
to the existing partition size.  If it's a sparse zvol, it may not may that
much difference.
-- 
Adam
___
freebsd-stable@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"


Re: lightly loaded system eats swap space

2018-06-18 Thread Adam
On Mon, Jun 18, 2018 at 7:27 AM, tech-lists  wrote:

> On 18/06/2018 09:08, Erich Dollansky wrote:
>
>> Hi,
>>
>> On Sun, 17 Jun 2018 23:19:02 +0100
>> tech-lists  wrote:
>>
>> freebsd-11-stable r333874, ZFS raidz1-0 (3x4TB disks), 128GB RAM,
>>> Swap: 4096M Total, 3502M Used, 594M Free, 85% Inuse
>>>
>>
>> this might not be related but I noticed that your swap space is small
>> compared to RAM size. I noticed on a much smaller Raspberry Pi, that it
>> runs into trouble when there is no swap even there is enough RAM
>> available. Is it easily possible for you to add some GB of swap space
>> and let the machine run then?
>>
>> How much swap do the other machines have?
>>
>
> Hi,
>
> Yes, the machine with the problem uses the default 4GB swap. That's all
> the swap it has. The machine without issue has a swapfile installed on a
> SSD in addition to the default 4GB swap.
>
> problematic machine:
> Device  512-blocks UsedAvail Capacity
> /dev/ada0p38388608 3.3G 714M83%
>
> machine without a problem, it has swapfile installed:
> Device  512-blocks UsedAvail Capacity
> /dev/ada0s1b   8262248 1.7G 2.2G44%
> /dev/md0  65536000 1.9G  29G 6%
> Total 73798248 3.7G  32G10%
>
> I added the swapfile a long time ago on this machine due to the same issue.
>
> But my problem isn't so much an out of swapspace problem; all this is, is
> a symptom. My problem is "why is it swapping out at all on a 128GB system
> and why is what's swapped out not being swapped back in again".
>

What is the output of sysctl vm.overcommit? If this system is intended on
being a VM host, then why don't you limit ARC to something reasonable like
Total Mem - Projected VM Mem - Overhead = Ideal ARC .

-- 
Adam
___
freebsd-stable@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"


Re: lightly loaded system eats swap space

2018-06-17 Thread Adam
On Sun, Jun 17, 2018 at 5:19 PM, tech-lists  wrote:

> Hello list,
>
> context is (server)
> freebsd-11-stable r333874, ZFS raidz1-0 (3x4TB disks), 128GB RAM, E5-2630
> @2.3GHz, generic kernel.
>
> There's one bhyve guest on this server (using 4x cpu and 16GB RAM, also
> freebsd-11-stable)
>
> There have been no special options for zfs configuration on the server,
> apart from several datasets having the compressed property set (lz4).
>
> The server runs nothing else really apart from sshd and it uses ntpd to
> sync local time.
>
> How come such a lightly loaded server with plenty of resources is eating
> up swap? If I run two bhyve instances, i.e. two of the same size as
> indicated above, so 32GB used for the bhyves, I'll get out-of-swapspace
> errors in the daily logs:
>
> +swap_pager_getswapspace(24): failed
> +swap_pager_getswapspace(24): failed
> +swap_pager_getswapspace(24): failed
>
> Here's top, with one bhyve instance running:
>
> last pid: 49494;  load averages:  0.12,  0.13,  0.88
>
>  up 29+11:36:06  22:52:45
> 54 processes:  1 running, 53 sleeping
> CPU:  0.4% user,  0.0% nice,  0.4% system,  0.3% interrupt, 98.9% idle
> Mem: 8664K Active, 52M Inact, 4797M Laundry, 116G Wired, 1391M Buf, 4123M
> Free
> ARC: 108G Total, 1653M MFU, 105G MRU, 32K Anon, 382M Header, 632M Other
>  103G Compressed, 104G Uncompressed, 1.00:1 Ratio
> Swap: 4096M Total, 3502M Used, 594M Free, 85% Inuse
>
>   PID USERNAMETHR PRI NICE   SIZERES STATE   C   TIMEWCPU
> COMMAND
> 49491 root  1   40 16444K 12024K select  9   0:12   6.49% ssh
> 32868 root 12  200  9241M  4038M kqread  2  23.2H   1.30% bhyve
> 49490 root  1  200 10812K  6192K sbwait  5   0:02   0.88% sftp
>
> From the looks of it, a huge amount of ram is wired. Why is that, and how
> would I debug it?
>

That seems to be shown in the output you provided:
ARC: 108G Total, 1653M MFU, 105G MRU, 32K Anon, 382M Header, 632M Other


>
> A server of similar spec which is running freebsd-current with seven bhyve
> instances doesn't have this issue:
>

Based upon the output neither ram nor swap seems like similar spec so I
wonder if you could say what you mean by that.

-- 
Adam
___
freebsd-stable@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"


Re: FreeBSD 11.1-release - verbose boot causes the machine to restart

2018-01-31 Thread Adam Vande More
On Wed, Jan 31, 2018 at 3:58 PM, Torfinn Ingolfsen <
torfinn.ingolf...@getmail.no> wrote:

> Ok, this I haven't seen before.
> I just installed FreeBSD 11.1-release on a quite new machine[1]. The
> machine is an ASRock BeeBox-S 7100U, a quite "normal" NUC-form machine with
> a i3-7100U (Kaby Lake) and an internal SSD.
> Installation went great (I only had to manually copy /boot/boot1.efi to
> the correct EFI partition. Yes - there are other operating systems on the
> internal SSD), and the machine boots and works normally.
>
> However, if I select verbose boot from the boot menu, the kernel starts
> spitting out kernel messages and after a while the machine restarts.
> Afterwards (after a normal boot) there is no sign of this unusal activity
> either in dmesg output, /var/log/messages or /var/crash.
>
> Is this something developers want to know more about? If so, pointers on
> howe to debug this further is appreciated.
> Details (including dmesg outpuyt from a normal boot) on the FreeBSD
> page[2] of this machine.
>
> References:
> 1) https://sites.google.com/site/tingox/asrock_beebox-s_7100u
> 2) https://sites.google.com/site/tingox/asrock_beebox-s_7100u_fbsd
>
>
Is it a panic reboot?  If so you can try setting the loader or sysctl
option to not reboot after panic or drop to debugger option.

-- 
Adam
___
freebsd-stable@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"


Re: Random freezes of my FreeBSD droplet (DigitalOcean)

2017-11-22 Thread Adam Vande More
On Wed, Nov 22, 2017 at 7:17 AM, Stefan Lambrev <che...@freebsd-bg.org>
wrote:

> Greetings,
>
> I have a droplet in DO with very light load, currently
> running 11.0-RELEASE-p15 amd64 GENERIC kernel + zfs (1 GB Memory / 30 GB
> Disk / FRA1 - FreeBSD 11.0 zfs)
>
> I know ZFS needs more memory, but the load is really light. Unfortunatelly
> last few weeks I'm experiencing those freezes almost every second day.
> There are no logs or console messages - just freeze. Networks seems to
> work, but nothing else.
>
> Is there anyone with similar experience here?
> Are there any updates in 11.1 that may affect positively my experience in
> the digital ocean cloud?
>

It's entirely possible to run a stable VM using that configuration so you
haven't provided enough details to give any real help.  A common foot
shooting method is putting swap on zvol, but the possibilities are endless.

-- 
Adam
___
freebsd-stable@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"


Re: Boot hang on Xen after r318347/(310418)

2017-06-03 Thread Adam McDougall
On 05/25/2017 09:28, Adam McDougall wrote:
> On Thu, May 25, 2017 at 10:41:03AM +0100, Roger Pau Monné wrote:
> 
>> On Wed, May 24, 2017 at 06:33:07PM -0400, Adam McDougall wrote:
>>> Hello,
>>>
>>> Recently I made a new build of 11-STABLE but encountered a boot hang
>>> at this state:
>>> http://www.egr.msu.edu/~mcdouga9/pics/r318347-smp-hang.png
>>>
>>> It is easy to reproduce, I can just boot from any 11 or 12 ISO that 
>>> contains the commit.
>>
>> I have just tested latest HEAD (r318861) and stable/11 (r318854) and
>> they both work fine on my environment (a VM with 4 vCPUs and 2GB of
>> RAM on OSS Xen 4.9). I'm also adding Colin in case he has some input,
>> he has been doing some tests on HEAD and AFAIK he hasn't seen any
>> issues.
>>
>>> I compiled various svn revisions to confirm that r318347 caused the 
>>> issue and r318346 is fine. With r318347 or later including the latest 
>>> 11-STABLE, the system will only boot with one virtual CPU in XenServer. 
>>> Any more cpus and it hangs. I also tried a 12 kernel from head this 
>>> afternoon and I have the same hang. I had this issue on XenServer 7 
>>> (Xen 4.7) and XenServer 6.5 (Xen 4.4). I did most of my testing on 7. I 
>>> also did much of my testing with a GENERIC kernel to try to rule out 
>>> kernel configuration mistakes. When it hangs, the performance 
>>> monitoring in Xen tells me at least one CPU is pegged. r318674 boots 
>>> fine on physical hardware without Xen involved.
>>>
>>> Looking at r318347 which mentions EARLY_AP_STARTUP and later seeing 
>>> r318763 which enables EARLY_AP_STARTUP in GENERIC, I tried adding it to 
>>> my kernel but it turned the hang into a panic but with any number of 
>>> CPUs: 
>>> http://www.egr.msu.edu/~mcdouga9/pics/r318347-early-ap-startup-panic.png
>>
>> I guess this is on stable/11 right? The panic looks easier to debug
>> that the hang, so let's start by this one. Can you enable the serial
>> console and kernel debug options in order to get a trace? With just
>> this it's almost impossible to know what went wrong.
> 
> Yes this was on stable/11 amd64.
> 
>>
>> Roger.

I worked on this today and the short version is recent kernels no longer
hang or panic with EARLY_AP_STARTUP which includes the 20170602 iso
images of 11 and 12. Adding EARLY_AP_STARTUP to my kernel config appears
to prevent the hang and something between r318855 (May 24) and r319554
(today, June 3) prevents the panic. I'm tempted to figure out which
commit but I already spent hours bisecting and building today, so since
this seems to be a forward working solution, I'm content. Thanks.
___
freebsd-stable@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"

Re: Errors with ports on 9.3..

2017-06-03 Thread Adam Vande More
On Sat, Jun 3, 2017 at 12:19 PM, Howard Leadmon <how...@leadmon.net> wrote:

>Thanks for the update, I had the feeling the issue was from it being to
> old.   I have a question, not sure if you know, but I will toss it out.
>  As I mentioned I update using svn for both src and ports, and I am curious
> to know if I can actually bring my src tree up to the most current 10.x
> stable, recompile, and install and have it all run?
>
>  In the past with much older versions, I know file system changes and such
> make it pretty hard to jump major revisions,

so have a little bit of fear about jumping from 9.x to 10.x, and possibly
> even to 11.x if that is now stable.   I am using ZFS, so I guess that would
> be one thing that is outside the norm, but should be part of the base
> kernels now anyway.
>
>  Any input on upgrading would be most appreciated...


I don't know what you know I guess, but it should work following these
instructions:

https://www.freebsd.org/doc/en_US.ISO8859-1/books/handbook/makeworld.html

or these:

https://www.freebsd.org/doc/handbook/updating-upgrading.html

or these:

https://www.freebsd.org/releases/11.0R/installation.html

It is probably wise to make a backup and do a test first.

-- 
Adam
___
freebsd-stable@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"


Re: Boot hang on Xen after r318347/(310418)

2017-05-25 Thread Adam McDougall
On Thu, May 25, 2017 at 10:41:03AM +0100, Roger Pau Monné wrote:

> On Wed, May 24, 2017 at 06:33:07PM -0400, Adam McDougall wrote:
> > Hello,
> > 
> > Recently I made a new build of 11-STABLE but encountered a boot hang
> > at this state:
> > http://www.egr.msu.edu/~mcdouga9/pics/r318347-smp-hang.png
> > 
> > It is easy to reproduce, I can just boot from any 11 or 12 ISO that 
> > contains the commit.
> 
> I have just tested latest HEAD (r318861) and stable/11 (r318854) and
> they both work fine on my environment (a VM with 4 vCPUs and 2GB of
> RAM on OSS Xen 4.9). I'm also adding Colin in case he has some input,
> he has been doing some tests on HEAD and AFAIK he hasn't seen any
> issues.
> 
> > I compiled various svn revisions to confirm that r318347 caused the 
> > issue and r318346 is fine. With r318347 or later including the latest 
> > 11-STABLE, the system will only boot with one virtual CPU in XenServer. 
> > Any more cpus and it hangs. I also tried a 12 kernel from head this 
> > afternoon and I have the same hang. I had this issue on XenServer 7 
> > (Xen 4.7) and XenServer 6.5 (Xen 4.4). I did most of my testing on 7. I 
> > also did much of my testing with a GENERIC kernel to try to rule out 
> > kernel configuration mistakes. When it hangs, the performance 
> > monitoring in Xen tells me at least one CPU is pegged. r318674 boots 
> > fine on physical hardware without Xen involved.
> > 
> > Looking at r318347 which mentions EARLY_AP_STARTUP and later seeing 
> > r318763 which enables EARLY_AP_STARTUP in GENERIC, I tried adding it to 
> > my kernel but it turned the hang into a panic but with any number of 
> > CPUs: 
> > http://www.egr.msu.edu/~mcdouga9/pics/r318347-early-ap-startup-panic.png
> 
> I guess this is on stable/11 right? The panic looks easier to debug
> that the hang, so let's start by this one. Can you enable the serial
> console and kernel debug options in order to get a trace? With just
> this it's almost impossible to know what went wrong.

Yes this was on stable/11 amd64.

> If you still have that kernel around (and it's debug symbols), can you
> do:
> 
> $ addr2line -e /usr/lib/debug/boot/kernel/kernel.debug 0x80793344
> 
> (The address is the instruction pointer on the crash image, I think I
> got it right)

I'll reproduce this soon and get the results from that command.

> In order to compile a stable/11 kernel with full debugging support you
> will have to add:
> 
> # For full debugger support use (turn off in stable branch):
> options   BUF_TRACKING# Track buffer history
> options   DDB # Support DDB.
> options   FULL_BUF_TRACKING   # Track more buffer history
> options   GDB # Support remote GDB.
> options   DEADLKRES   # Enable the deadlock resolver
> options   INVARIANTS  # Enable calls of extra sanity checking
> options   INVARIANT_SUPPORT   # Extra sanity checks of internal 
> structures, required by INVARIANTS
> options   WITNESS # Enable checks to detect deadlocks and 
> cycles
> options   WITNESS_SKIPSPIN# Don't run witness on spinlocks for 
> speed
> options   MALLOC_DEBUG_MAXZONES=8 # Separate malloc(9) zones
> 
> To your kernel config file.

I'll work on that soon too when I get a chance, thanks.

> 
> Just to be sure, this is an amd64 kernel right?

yes

> 
> Roger.
> ___
> freebsd-stable@freebsd.org mailing list
> https://lists.freebsd.org/mailman/listinfo/freebsd-stable
> To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"
>  
___
freebsd-stable@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"


Boot hang on Xen after r318347/(310418)

2017-05-24 Thread Adam McDougall
Hello,

Recently I made a new build of 11-STABLE but encountered a boot hang
at this state:
http://www.egr.msu.edu/~mcdouga9/pics/r318347-smp-hang.png

It is easy to reproduce, I can just boot from any 11 or 12 ISO that 
contains the commit.

I compiled various svn revisions to confirm that r318347 caused the 
issue and r318346 is fine. With r318347 or later including the latest 
11-STABLE, the system will only boot with one virtual CPU in XenServer. 
Any more cpus and it hangs. I also tried a 12 kernel from head this 
afternoon and I have the same hang. I had this issue on XenServer 7 
(Xen 4.7) and XenServer 6.5 (Xen 4.4). I did most of my testing on 7. I 
also did much of my testing with a GENERIC kernel to try to rule out 
kernel configuration mistakes. When it hangs, the performance 
monitoring in Xen tells me at least one CPU is pegged. r318674 boots 
fine on physical hardware without Xen involved.

Looking at r318347 which mentions EARLY_AP_STARTUP and later seeing 
r318763 which enables EARLY_AP_STARTUP in GENERIC, I tried adding it to 
my kernel but it turned the hang into a panic but with any number of 
CPUs: 
http://www.egr.msu.edu/~mcdouga9/pics/r318347-early-ap-startup-panic.png

I think I verified that this happens with EARLY_AP_STARTUP before
r318347 too so I'll assume it is a different problem.

I may need to do some experimentation to figure out how to get the 
console to pass through hotkeys to drop into a kernel debugger. I could 
also try modifying the kernel config if I can make it print information 
about the hang.

Is there anything else I can provide that might help? Would you prefer 
this be entered in a bugzilla report? Thanks.

___
freebsd-stable@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"


Re: [ZFS] files in a weird situtation

2016-12-18 Thread Adam Vande More
On Sun, Dec 18, 2016 at 2:30 AM, David Marec <david.ma...@davenulle.org>
wrote:

> It fails on «No such file or directory».
>

I can't even replicate this portion of things.  Running it under truss
might provide more insight into what is happening.


>
> This file missing, not much works.
> I have installed a copy into `/usr/lib` to make the system run again.
>
> So, I can't install a new world: `install -C` fails in the same way.


I think you have larger issues than a single corrupt file.



-- 
Adam
___
freebsd-stable@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"

Re: [ZFS] files in a weird situtation

2016-12-17 Thread Adam Vande More
On Sat, Dec 17, 2016 at 3:01 PM, David Marec <david.ma...@davenulle.org>
wrote:

> [I had first posted onto the Forum about this issue]
>
> Two months ago,
>
> - next to a call to |`||delete-old-libs`| or `install world`, I don't
> really know -
>
> my box that is following FreeBSD-11 Stable ran into a weird situation.
>
> A set of files, especially `/lib/libjail.so.1` are in both states
> `existing` and `not existing`:
>
> I means:
>
> david:~>cp ~david/libjail.so.1 /lib
> cp: /lib/libjail.so.1: File exists
>
> But:
>
> david:~>ls /lib/libjail.so.1
> ls: /lib/libjail.so.1: No such file or directory
> david:~>find /lib -name "libjail.so.1" -print
> /lib/libjail.so.1
> david:~>find /lib -name "libjail.so.1" -ls
> find: /lib/libjail.so.1: No such file or directory
>
> With deeper investigation, the file is in fact mapped to an `inode`:
>
> root@dmarec:~ # ls -di /lib
> 13 /lib
> root@dmarec:~ # zdb - zroot/ 13 | grep libjail.so.1
> libjail.so.1 = 10552574 (type: Regular File)
>
> Which fails with `zdb` on:
>
> root@dmarec:~ # zdb - zroot/ 10552574
> Dataset zroot [ZPL], ID 21, cr_txg 1, 114G, 2570002 objects, rootbp
> DVA[0]=<0:b97d6ea00:200> DVA[1]=<0:1c212b0400:200> [L0 DMU objset]
> fletcher4 lz4 LE contiguous unique double size=800L/200P
> birth=3852240L/3852240P fill=2570002
> cksum=17b78fb7e4:7c87a526a07:16251edfaae60:2ce0c5734ccf2f
>
> Object  lvl   iblk   dblk  dsize  lsize   %full type
> zdb: dmu_bonus_hold(10552574) failed, errno 2
>
>
> `stat (2)` returns ENOENT when checking for the file:
>  david:~>truss stat -L /lib/libjail.so.1
> ...
> stat("/lib/libjail.so.1",0x7fffe7e8) ERR#2 'No such
> file or
> directory'david:~>truss stat -L /lib/libjail.so.1
>
> A pass with `zfs scrub` didn't help.
>
> Any clue is welcome. What's that `dmu_bonus_hold` stands for ?
>

I am unable to understand what your intent is here.  If you wish to delete
it, you can do:

find . -inum 10552574 -exec rm {} \;

-- 
Adam
___
freebsd-stable@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"


Re: Hangs with mrsas?

2016-03-07 Thread Adam McDougall
On 03/07/2016 14:09, Garrett Wollman wrote:
> I have a new Dell server with a typical Dell hardware RAID.  pciconf
> identifies it as "MegaRAID SAS-3 3008 [Fury]"; mfiutil reports:
> 
> mfi0 Adapter:
> Product Name: PERC H330 Adapter
>Serial Number: 5AT00PI
> Firmware: 25.3.0.0016
>  RAID Levels:
>   Battery Backup: not present
>NVRAM: 32K
>   Onboard Memory: 0M
>   Minimum Stripe: 64K
>   Maximum Stripe: 64K
> 
> Since I'm running ZFS I have the RAID functions disabled and the
> drives are presented as "system physical drives" ("mfisyspd[0-3]" when
> using mfi(4)).  I wanted to use mrsas(4) instead, so that I could have
> direct access to the drives' SMART functions, and this seemed to work
> after I set the hw.mfi.mrsas_enable tunable, with one major exception:
> all drive access would hang after about 12 hours and the machine would
> require a hard reset to come back up.
> 
> Has anyone seen this before?  The driver in head doesn't appear to be
> any newer.
> 
> -GAWollman

I did some similar testing in late Jan but perhaps not long enough to
notice your symptoms.  I'm pretty certain I used mrsas_enable since that
is what I would plan to use in production.  I had a H330-mini with the
same firmware rev in a R430.  I was testing with some 2.5" Seagate
ST9600205SS 600gb disks from another system.  What kind of disks were
you using and in what kind of configuration?  Does a simpler config stay
up?  If you are using SSD, I wonder if disks would survive?  SSD
firmware issue?  Was it hard hung at the console too?  Can you enter DDB?

If you don't mind, which Dell model is this?

Sorry I don't have any directly helpful suggestions but you have good
timing because this could very well influence hardware choices.  Thanks.
___
freebsd-stable@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"


Re: dev/random warning on 10-STABLE after r292122 up till r292855

2016-01-05 Thread Adam Vande More
On Mon, Jan 4, 2016 at 3:44 PM, Mark Saad <nones...@longcount.org> wrote:

> All
>  At NYC*BUG we are looking into a warning seen on FreeBSD 10-STABLE amd64
> starting at or about r292122  and still up till r292855.
>
> On boot dmesg logs the following warning not seen on 10.2-RELEASE amd64.
>
> random device not loaded; using insecure entropy
>
> The full dmesg can be seen here
> http://dmesgd.nycbug.org/index.cgi?action=dmesgd=view=2871
>
> I checked in svn and there are no recent changes to sys/dev/random .
>
> Does anyone have any insight into this ?
>

It's more of an informational message about seeding the random number
generator.  Probably man 4 random is the best explanation.



-- 
Adam
___
freebsd-stable@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"


Re: high disk %busy, while almost nothing happens

2015-11-26 Thread Adam Vande More
On Thu, Nov 26, 2015 at 10:46 PM, Eugene M. Zheganin <e...@norma.perm.ru>
wrote:

> Hi.
>
> On 26.11.2015 14:19, Eugene M. Zheganin wrote:
> > Hi.
> >
> > I'm using FreeBSD 10.1-STABLE as an application server, last week I've
> > noticed that disks are always busy while gstat shows that the activity
> > measured in iops/reads/writes is low, form my point of view:
> >
> >
> >   L(q)  ops/sr/s   kBps   ms/rw/s   kBps   ms/w   %busy Name
> >  8 56 50 520  160.6  6286  157.4  100.2
> gpt/zfsroot0
> >  8 56 51   1474  162.8  5228  174.4   99.9
> gpt/zfsroot1
> >
> > These %busy numbers arent't changing much, and from my point of view
> > both disks do very little.
> >
> The thing is, it was the compression. As soon as I cleared the gzip
> compression from busy datasets, %busy went down, almost to zero.
> Affected datasets were filled with poorly compressionable files, mostly
> archives or zlib-compressed data.
>

Data which isn't very compressible isn't a very great on a transparently
compressed filesystem.  Gzip is particularly bad at this.  LZ4 may have had
only a slight impact.  Setting gzip-1 would have also been less overhead
than the default gzip which I believe is gzip-6.


> And this is kind of counter-intuitive: one could think that worse-case
> scenario would be redundant CPU load, with constand disk i/o. In
> practice, otherwise, high disk %busy happens.
>

Well that's basically what you had.  And %busy is not really meaningful.
L(q) and ops are the ones to keep an eye on.

-- 
Adam
___
freebsd-stable@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"


Re: ZFS - poor performance with "large" directories

2015-11-24 Thread Adam Vande More
On Tue, Nov 24, 2015 at 8:00 AM, Albert Cervin <alb...@acervin.com> wrote:

> Hi all,
>
> Please feel free to direct me to a list that is more suitable.
>
> We are trying to set up a fileserver solution for a web application that we
> are building. This fileserver is running FreeBSD 10.2 and ZFS. Files are
> written over CIFS with Samba running on the fileserver host.
>
> However, we are seeing en exponential decrease in performance to write to
> the file server when the number of files in the directory grows (when it
> goes up to ~6000 files it becomes unusable and the write time has gone from
> a fraction of a second to ten seconds).
>
> We ran the same setup on a Linux machine with an ext4 file system which did
> NOT suffer from this performance degradation.
>

I should hope not.  ext4 vs zfs comparison isn't fair for either.


>
> Are these "holes" in write speed normal. Since this is the exact symptom we
> are getting when the network writes start to be slow.
>

Totally normal.  You'll want to reference:

https://wiki.freebsd.org/ZFSTuningGuide

In particular for that issue see:

vfs.zfs.txg.timeout
and tuning related to NFS.

Performance is also heavily dependent on pool structure and io
characteristics.  For example, a pool of 3 2 disk mirrors is in general
going to be much faster than 1 6 disk raidz2.


-- 
Adam
___
freebsd-stable@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"


sshpass

2015-11-23 Thread Adam Vande More
sshpass seems to be broken for at least FreeBSD 10.X targets and here is
some debugging output:

 6347: 0.306948550 write(2,"debug2: input_userauth_info_req:"...,48) = 48
(0x30)
 6347: 0.307046901
openat(0xff9c,0x8008a4fe6,0x2,0x0,0x7ff7fc75ecfc,0x803856010) ERR#6
'Device not configured'
debug1: read_passphrase: can't open /dev/tty: Device not configured
 6347: 0.307174637 write(2,"debug1: read_passphrase: can't o"...,69) = 69
(0x45)
 6347: 0.307292478 open("/dev/tty",O_RDWR|O_CLOEXEC,0163727160) ERR#6
'Device not configured'
debug3: packet_send2: adding 48 (len 10 padlen 6 extra_pad 64)
 6347: 0.307424012 write(2,"debug3: packet_send2: adding 48 "...,64) = 64
(0x40)

It seems various tools like ansible and salt are running into this as well:

https://github.com/saltstack/salt/issues/20565
https://github.com/ansible/ansible/issues/11565
http://sourceforge.net/p/sshpass/bugs/12/ <- Misidentified cause

Can anyone offer insight into this?

Thanks,

-- 
Adam
___
freebsd-stable@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"


Re: [CTF] pkg 1.6.0

2015-09-22 Thread Adam McDougall
On 09/22/2015 03:20, Ranjan1018 . wrote:
> 2015-09-21 23:27 GMT+02:00 Baptiste Daroussin :
> 
>> Hi all,
>>
>> We are about to release pkg 1.6.0. pkg-devel has been updated to 1.5.99.13
>> aka
>> 1.6.0 rc3 that we hope will become the new pkg 1.6.0 btw the end of the
>> Week
>> (release planned for Saturday 26th of September or no important issues are
>> raised)

> running version 1.5.3 I have this error message:
> # pkg upgrade
> Updating FreeBSD repository catalogue...
> FreeBSD repository is up-to-date.
> All repositories are up-to-date.
> Checking for upgrades (40 candidates): 100%
> Processing candidates (40 candidates): 100%
> Checking integrity... done (1 conflicting)
> pkg: Cannot solve problem using SAT solver:
> dependency rule: package Thunar(l) depends on:
> xfce4-tumbler(r)xfce4-tumbler(l)
> upgrade rule: upgrade local xfce4-tumbler-0.1.31_1 to remote
> xfce4-tumbler-0.1.31_1
> cannot install package xfce4-tumbler, remove it from request? [Y/n]:
> pkg: cannot find xfce4-tumbler in the request
> pkg: cannot solve job using SAT solver
> Checking integrity... done (0 conflicting)
> Your packages are up to date.
> 
> With this version I have been able to update the packages.

Same here, I ran into a conflict with Thunar on at least two computers
with 1.5.6:

pkg: Cannot solve problem using SAT solver:
dependency rule: package Thunar(l) depends on:
xfce4-tumbler(r)xfce4-tumbler(l)
upgrade rule: upgrade local xfce4-tumbler-0.1.31_1 to remote
xfce4-tumbler-0.1.31_1
cannot install package xfce4-tumbler, remove it from request? [Y/n]: ^C

I upgraded to 1.5.99.13 without any problems and it handles Thunar fine
without any workarounds.  Just some extra warnings the first time when
upgrading to 1.5.99 from my own repo:

# pkg upgrade
Updating pkg-desktop repository catalogue...
Fetching meta.txz: 100%260 B   0.3kB/s00:01
Fetching packagesite.txz: 100%  217 KiB 222.0kB/s00:01
Processing entries:   0%
pkg: Skipping unknown key 'messages'
Processing entries:   2%
pkg: Skipping unknown key 'messages'
Processing entries:   4%
pkg: Skipping unknown key 'messages'
Processing entries:   5%
pkg: Skipping unknown key 'messages'
Processing entries:   6%
pkg: Skipping unknown key 'messages'
pkg: Skipping unknown key 'messages'
...
Processing entries: 100%
pkg-desktop repository update completed. 914 packages processed.
New version of pkg detected; it needs to be installed first.
pkg: Skipping unknown key 'messages'
Checking integrity... done (0 conflicting)
The following 1 package(s) will be affected (of 0 checked):

Installed packages to be UPGRADED:
pkg: 1.5.6 -> 1.5.99.13

___
freebsd-stable@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"


Re: Routing stops working when we create a new vlan

2015-08-07 Thread Adam Vande More
On Fri, Aug 7, 2015 at 9:54 AM, Marcelo Gondim gon...@bsdinfo.com.br
wrote:


 But I still think it does wrong. Why to create a vlan I need to have this
 parameter configured in rc.conf? Or why it needs to change
 net.inet.ip.forwarding?


Because that is what the directions for configuring a default gateway
state, and the directions are based off of code behavior.

 /etc/rc.d/routing:

if checkyesno gateway_enable; then
ropts_init inet
echo -n ' gateway=YES'
${SYSCTL} net.inet.ip.forwarding=1  /dev/null
else
${SYSCTL} net.inet.ip.forwarding=0  /dev/null
fi


-- 
Adam
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: 4TB Western Digital My Book 1230 USB hard drive not working on 10.2

2015-08-02 Thread Adam McDougall
On 08/02/2015 21:22, Paul Mather wrote:
 I have a 4TB external USB drive (Western Digital My Book 1230) that I am 
 trying to use under FreeBSD/amd64 10.2 (10.2-PRERELEASE #0 r286052: Wed Jul 
 29 20:59:39 EDT 2015).  This system has a MSI 760GMA-P34 (FX) motherboard.
 
 The drive probes unreliably when plugged in to a USB 3 port.  It reliably 
 probes when plugged into a USB 2 port.  However, it works in neither cases.  
 Attempting to dd from the drive results in a dd: /dev/da0: Invalid argument.
 
 When plugged in to a USB 2 port, this is how the drive is probed:
 
 ugen6.2: Western Digital at usbus6
 umass0: Western Digital My Book 1230, class 0/0, rev 2.10/10.65, addr 2 on 
 usbus6
 umass0:  SCSI over Bulk-Only; quirks = 0xc001
 umass0:9:0:-1: Attached to scbus9
 da0 at umass-sim0 bus 0 scbus9 target 0 lun 0
 da0: WD My Book 1230 1065 Fixed Direct Access SPC-4 SCSI device
 da0: Serial Number 57434334453056594A4C4A4A
 da0: 40.000MB/s transfers
 da0: 3815415MB (976746240 4096 byte sectors: 255H 63S/T 60799C)
 da0: quirks=0x2NO_6_BYTE
 ses0 at umass-sim0 bus 0 scbus9 target 0 lun 1
 ses0: WD SES Device 1065 Fixed Enclosure Services SPC-4 SCSI device
 ses0: Serial Number 57434334453056594A4C4A4A
 ses0: 40.000MB/s transfers
 ses0: SCSI-3 ENC Device
 
 When booting with it connected to a USB 3 port, this is what is output:
 
 xhci0: XHCI (generic) USB 3.0 controller mem 0xfeafe000-0xfeaf irq 18 
 at device 0.0 on pci3
 xhci0: 64 bytes context size, 64-bit DMA
 usbus0 on xhci0
 [[...]]
 ohci0: AMD SB7x0/SB8x0/SB9x0 USB controller mem 0xfe7fe000-0xfe7fefff irq 
 16 at device 18.0 on pci0
 usbus1 on ohci0
 ohci1: AMD SB7x0/SB8x0/SB9x0 USB controller mem 0xfe7fd000-0xfe7fdfff irq 
 16 at device 18.1 on pci0
 usbus2 on ohci1
 ehci0: AMD SB7x0/SB8x0/SB9x0 USB 2.0 controller mem 0xfe7ff800-0xfe7ff8ff 
 irq 17 at device 18.2 on pci0
 usbus3: EHCI version 1.0
 usbus3 on ehci0
 ohci2: AMD SB7x0/SB8x0/SB9x0 USB controller mem 0xfe7fc000-0xfe7fcfff irq 
 18 at device 19.0 on pci0
 usbus4 on ohci2
 ohci3: AMD SB7x0/SB8x0/SB9x0 USB controller mem 0xfe7f7000-0xfe7f7fff irq 
 18 at device 19.1 on pci0
 usbus5 on ohci3
 ehci1: AMD SB7x0/SB8x0/SB9x0 USB 2.0 controller mem 0xfe7ff400-0xfe7ff4ff 
 irq 19 at device 19.2 on pci0
 usbus6: EHCI version 1.0
 usbus6 on ehci1
 [[...]]
 ohci4: AMD SB7x0/SB8x0/SB9x0 USB controller mem 0xfe7f6000-0xfe7f6fff irq 
 18 at device 20.5 on pci0
 usbus7 on ohci4
 [[...]]
 usbus0: 5.0Gbps Super Speed USB v3.0
 usbus1: 12Mbps Full Speed USB v1.0
 usbus2: 12Mbps Full Speed USB v1.0
 usbus3: 480Mbps High Speed USB v2.0
 usbus4: 12Mbps Full Speed USB v1.0
 usbus5: 12Mbps Full Speed USB v1.0
 usbus6: 480Mbps High Speed USB v2.0
 usbus7: 12Mbps Full Speed USB v1.0
 ugen7.1: ATI at usbus7
 uhub0: ATI OHCI root HUB, class 9/0, rev 1.00/1.00, addr 1 on usbus7
 ugen6.1: ATI at usbus6
 uhub1: ATI EHCI root HUB, class 9/0, rev 2.00/1.00, addr 1 on usbus6
 ugen5.1: ATI at usbus5
 uhub2: ATI OHCI root HUB, class 9/0, rev 1.00/1.00, addr 1 on usbus5
 ugen4.1: ATI at usbus4
 uhub3: ATI OHCI root HUB, class 9/0, rev 1.00/1.00, addr 1 on usbus4
 ugen3.1: ATI at usbus3
 uhub4: ATI EHCI root HUB, class 9/0, rev 2.00/1.00, addr 1 on usbus3
 ugen2.1: ATI at usbus2
 uhub5: ATI OHCI root HUB, class 9/0, rev 1.00/1.00, addr 1 on usbus2
 ugen1.1: ATI at usbus1
 uhub6: ATI OHCI root HUB, class 9/0, rev 1.00/1.00, addr 1 on usbus1
 ugen0.1: 0x1912 at usbus0
 uhub7: 0x1912 XHCI root HUB, class 9/0, rev 3.00/1.00, addr 1 on usbus0
 uhub0: 2 ports with 2 removable, self powered
 uhub2: 3 ports with 3 removable, self powered
 uhub3: 3 ports with 3 removable, self powered
 uhub5: 3 ports with 3 removable, self powered
 uhub6: 3 ports with 3 removable, self powered
 uhub7: 8 ports with 8 removable, self powered
 [[...]]
 Root mount waiting for: usbus6 usbus3 usbus0
 Root mount waiting for: usbus6 usbus3 usbus0
 uhub1: 6 ports with 6 removable, self powered
 uhub4: 6 ports with 6 removable, self powered
 ugen0.2: vendor 0x1058 at usbus0
 umass0: vendor 0x1058 product 0x1230, class 0/0, rev 3.00/10.65, addr 1 on 
 usbus0
 umass0:  SCSI over Bulk-Only; quirks = 0x8000
 Root mount waiting for: usbus0
 [[...]]
 Root mount waiting for: usbus0
 Root mount waiting for: usbus0
 umass0: Get Max Lun not supported (USB_ERR_TIMEOUT)
 umass0:9:0:-1: Attached to scbus9
 [[...]]
 da0 at umass-sim0 bus 0 scbus9 target 0 lun 0
 da0:Fixed Direct Access SCSI device
 da0: Serial Number WCC4E0VYJLJJ
 da0: 400.000MB/s transfers
 da0: 3815415MB (976746240 4096 byte sectors: 255H 63S/T 60799C)
 da0: quirks=0x2NO_6_BYTE
 
 
 This external USB drive works fine under OSX Yosemite and also when plugged 
 in to my Raspberry Pi 2 running OSMC.
 
 Is there anyone using this model of USB drive under FreeBSD/amd64 10.2?  Is 
 it a matter of finding the correct quirk to get it working?
 
 Cheers,
 
 Paul.

The trouble detecting is probably related to
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=196332

I have trouble with my 2T WD My passport but I 

Re: Getting going with a new Dell 7810

2015-06-17 Thread Adam McDougall
On 06/16/2015 12:55, Richard Kuhns wrote:
 Greetings all,
 
 I've just received a new Dell Precision 7810. I've installed FreeBSD
 10.1 (UEFI boot), checked out sources, built world  kernel and am now
 running r284449. So far, so good.
 
 The problem is Xorg. I'm running the latest Xorg in ports; I just did a
 'make install clean' in /usr/ports/x11/xorg with no errors.
 
 The display card is a FirePro W4100. lspci shows:
 
 03:00.0 VGA compatible controller: Advanced Micro Devices, Inc.
 [AMD/ATI] Cape Verde GL [FirePro W4100]
 

If it is brand new, it is probably not supported and probably won't be
for a while.  Please see https://wiki.freebsd.org/Graphics for a list of
devices which does include your Radeon HD 4670.
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: ports/base ntpd rc.d script with WITHOUT_NTP=yes

2015-04-08 Thread Adam McDougall
On 04/08/2015 12:48, Matt Smith wrote:
 Hi,
 
 I just upgraded my server to 10.1-STABLE r281264 and when I ran
 mergemaster it told me that /etc/rc.d/ntpd was stale and would I like to
 delete it. It's never done this before. I've figured out it's because I
 have WITHOUT_NTP=yes in /etc/src.conf. I did this because I use the
 ports version of ntpd and thus wanted to remove the base installed
 version so that when I run commands like ntpq it's using my possibly
 newer port installed version and not the older one.
 
 However, the port version doesn't have its own rc script. It usually
 uses the base version with ntpd_program and ntpd_config set.  With this
 latest change it means I have to have the base version installed again. 
 Is it possible to get the port version to have its own rc script?
 

net/openntpd has an rc script if you don't mind switching.  It is very
very simple to configure.

Ideally the original problem should be solved too but I ran into the
same problem with Kerberos.  I didn't get anywhere in the bug report
where I argued the system scripts still worked fine except for recent
changes in them causing a regression and failure with the port.  Both
situations could probably use a contributed patch to make an rc script.
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: SSH hung with an OpenSSH_6.6.1p1 -- OpenSSH_5.8p2_hpn13v11

2015-03-26 Thread Adam McDougall
On 03/26/2015 21:25, Wu ShuKun wrote:
 Okay
 % ssh -v -o KexAlgorithms diffie-hellman-group-exchange-sha1 10.41.172.19
 OpenSSH_6.6.1p1, OpenSSL 1.0.1l-freebsd 15 Jan 2015
 debug1: Reading configuration data /etc/ssh/ssh_config
 debug1: Connecting to 10.41.172.19 [10.41.172.19] port 22.
 debug1: Connection established.
 debug1: identity file /home/wsk/.ssh/id_rsa type -1
 debug1: identity file /home/wsk/.ssh/id_rsa-cert type -1
 debug1: identity file /home/wsk/.ssh/id_dsa type -1
 debug1: identity file /home/wsk/.ssh/id_dsa-cert type -1
 debug1: identity file /home/wsk/.ssh/id_ecdsa type -1
 debug1: identity file /home/wsk/.ssh/id_ecdsa-cert type -1
 debug1: identity file /home/wsk/.ssh/id_ed25519 type -1
 debug1: identity file /home/wsk/.ssh/id_ed25519-cert type -1
 debug1: Enabling compatibility mode for protocol 2.0
 debug1: Local version string SSH-2.0-OpenSSH_6.6.1_hpn13v11 FreeBSD-20140420
 debug1: Remote protocol version 2.0, remote software version
 OpenSSH_5.8p2_hpn13v11 FreeBSD-20110503
 debug1: match: OpenSSH_5.8p2_hpn13v11 FreeBSD-20110503 pat OpenSSH_5*
 compat 0x0c00
 debug1: SSH2_MSG_KEXINIT sent
 debug1: SSH2_MSG_KEXINIT received
 debug1: kex: server-client aes128-ctr hmac-md5 none
 debug1: kex: client-server aes128-ctr hmac-md5 none
 debug1: SSH2_MSG_KEX_DH_GEX_REQUEST(102430728192) sent
 debug1: expecting SSH2_MSG_KEX_DH_GEX_GROUP
 Connection closed by 10.41.172.19
 %

Can you try stopping sshd on the server side and run /usr/sbin/sshd -Dd
then SSH in and see if the server provides a reason for disconnecting
the client?
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: There has to be a better way of merging /etc during a major freebsd-update

2015-03-10 Thread Adam Vande More
On Tue, Mar 10, 2015 at 12:45 PM, Peter Olsson 
list-freebsd-sta...@jyborn.se wrote:

 On Tue, Mar 10, 2015 at 10:17:18AM -0500, Adam Vande More wrote:
  On Tue, Mar 10, 2015 at 7:05 AM, Peter Olsson 
 list-freebsd-sta...@jyborn.se
   wrote:
 
   This flag to mergemaster saved a lot of work when I did
   upgrades the old way, with cvsup and the make steps and
   then mergemaster:
  
 
  https://svnweb.freebsd.org/base?view=revisionrevision=221780
 
  I'd venture to guess the script will work fine on older installs, but
  testing should be done first.
 
  --
  Adam

 This seems like an excellent addition fo freebsd-update,
 how come it isn't added?


It is added, no flag needed.  You are running a version of freebsd-update
which didn't have the feature(8.4).

-- 
Adam
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: There has to be a better way of merging /etc during a major freebsd-update

2015-03-10 Thread Adam Vande More
On Tue, Mar 10, 2015 at 7:05 AM, Peter Olsson list-freebsd-sta...@jyborn.se
 wrote:

 This flag to mergemaster saved a lot of work when I did
 upgrades the old way, with cvsup and the make steps and
 then mergemaster:


https://svnweb.freebsd.org/base?view=revisionrevision=221780

I'd venture to guess the script will work fine on older installs, but
testing should be done first.

-- 
Adam
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: There has to be a better way of merging /etc during a major freebsd-update

2015-03-10 Thread Adam Vande More
On Tue, Mar 10, 2015 at 2:01 PM, Chris H bsd-li...@bsdforge.com wrote:

  
 
  https://svnweb.freebsd.org/base?view=revisionrevision=221780
 
  I'd venture to guess the script will work fine on older installs, but
  testing should be done first.
 Isn't that pretty much what the -F flag, I mentioned does?
 -FIf the files differ only by VCS Id ($FreeBSD) install the
new file.


Op asked for a freebsd-update solution which excludes any mergemaster
solution short of at least a partial rewrite.

-- 
Adam
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Is there a linux_base available for RELENG_9?

2015-03-09 Thread Adam McDougall
On 03/09/2015 20:44, Chris H wrote:
 I performed av svn update for both src (r279796),
 and ports (r380829) last night. building/installing
 world/kernel, went as one would hope. Upgrading ports
 was a different story. Given this box has an nVidia card.
 I usually start by upgrading emulators/linux_base; which
 according to UPDATING; meant linux_base-f10 -- linux_base-c6.
 I deinstalled x11/nvidia-driver, followed by
 emulators/linux_base-f10. I then attempted to make install
 emulators/linux_base-c6, which resulted in a message
 that it wasn't supported.

What was the exact error?

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: gmirror crash writing to disk? Or is it su+j crash?

2013-08-30 Thread Adam Vande More
On Thu, Aug 29, 2013 at 4:35 PM, Zaphod Beeblebrox zbee...@gmail.comwrote:

 So I have a system running:

 FreeBSD walk.dclg.ca 9.2-RC3 FreeBSD 9.2-RC3 # r254952: Wed Aug 28
 03:02:55
 EDT 2013 r...@walk.dclg.ca:/usr/obj/usr/src/sys/STRIKE  i386

 and it has two 2T SATA disks.  To keep this post short, the crash.txt is
 here.


 https://uk.eicat.ca/owncloud/public.php?service=filest=fea9d25579fe0c4afb808859e80e1493

 now curiously, while running a make -j4 buildkernel ... almost every time
 ... it crashes with:

 g_vfs_done():mirror/walke[WRITE(offset=516764794880, length=65536)]error =
 11
 /usr: got error 11 while accessing filesystem
 panic: softdep_deallocate_dependencies: unrecovered I/O error

 ... no error report from the hard drives, simply an error report from the
 mirror.

 The filesystem is ufs with su+j... but I'm not sure this matters here.


Run fsck.


-- 
Adam Vande More
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: ZFS pool with 4k sector size

2013-08-22 Thread Adam McDougall

On 08/22/13 04:23, Trond Endrestøl wrote:

On Thu, 22 Aug 2013 11:40+0400, Michael BlackHeart wrote:


Hello,
I'd like to know what is the best way to convert my pool from 512b sector
size to 4k sector size.

Hardware: 2 x2Tb WD Green with 4k physical sector size

Model Family: Western Digital Caviar Green (AF, SATA 6Gb/s)
Device Model: WDC WD20EARX-00PASB0
Serial Number: WD-WCAZA8280575
LU WWN Device Id: 5 0014ee 206032063
Firmware Version: 51.0AB51
User Capacity: 2 000 398 934 016 bytes [2,00 TB]
Sector Sizes: 512 bytes logical, 4096 bytes physical
Device is: In smartctl database [for details use: -P show]
ATA Version is: ATA8-ACS (minor revision not indicated)
SATA Version is: SATA 3.0, 3.0 Gb/s (current: 3.0 Gb/s)
Local Time is: Thu Aug 22 11:33:16 2013 MSK
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

They're running in a mirror

pool: storage
state: ONLINE
scan: resilvered 48K in 0h0m with 0 errors on Thu Jul 25 19:18:01 2013
config:

NAME STATE READ WRITE CKSUM
storage ONLINE 0 0 0
mirror-0 ONLINE 0 0 0
ada3 ONLINE 0 0 0
ada7 ONLINE 0 0 0

zdb info

storage:
version: 5000
name: 'storage'
state: 0
txg: 1292269
pool_guid: 18442220950447532371
hostid: 708219113
hostname: 'diablo.miekoff.local'
vdev_children: 1
vdev_tree:
type: 'root'
id: 0
guid: 18442220950447532371
create_txg: 4
children[0]:
type: 'mirror'
id: 0
guid: 4289294206539029185
metaslab_array: 33
metaslab_shift: 34
ashift: 9
asize: 2000394125312
is_log: 0
create_txg: 4
children[0]:
type: 'disk'
id: 0
guid: 16348588566764560218
path: '/dev/ada3'
phys_path: '/dev/ada3'
whole_disk: 1
DTL: 95
create_txg: 4
children[1]:
type: 'disk'
id: 1
guid: 7655198429866445090
path: '/dev/ada7'
phys_path: '/dev/ada7'
whole_disk: 1
DTL: 97
create_txg: 4
features_for_read:

As you see ashift is 9 (512b).

I know a common solution with gnop and export-mport pool, but how should I
manage mirror this way? Should I create a mirror on gnop-ed devices and
then export-import?

I'm afraid you're out of luck. You need to backup the data
somehow, recreate the pool with ashift=12, and restore the data.

A better option would be to buy a couple of new drives, assuming you
can connect them to the current system, create a new mirrored pool
with ashift=12, and transfer the data using a recursive set of
snapshots on the current pool and a ZFS send stream sent to the new
pool.


You can zpool detach storage ada7, gnop create -S 4k ada7, zpool create 
storage2 ada7.nop, then copy all of your data into storage2 manually.  
When done, destroy the original zpool then zpool attach storage2 
ada7.nop ada3 which will resilver ada7 onto ada3 to complete the new 
mirror. Then I'd zpool export storage2, destroy the nop or just reboot, 
and re-import storage2 as storage if you wish to rename it.


The risk is losing all of your data if there is a problem while you only 
have one valid copy.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Booting FreeBSD with Syslinux

2013-08-01 Thread Adam Vande More
On Wed, Jul 31, 2013 at 9:47 PM, Daniel O'Connor docon...@gsoft.com.auwrote:


 On 01/08/2013, at 12:15, Warren Block wbl...@wonkity.com wrote:
  Supposedly someone got it to work because there is an entry in the
 syslinux wiki
  http://www.syslinux.org/wiki/index.php/Mboot.c32#FreeBSD_example
 
  I'm following the threads on both lists, and that example looks more
 like a generic template than an actual, working command. kernel_option,
 for example.

 Yeah, I also wonder if it's for booting a XENified FreeBSD or something
 similar (no idea really).

 Maybe I'll just have to stuff the loader in an MFS and boot that :(


galacticdominator% ./mbchk /boot/kernel/kernel
/boot/kernel/kernel: No Multiboot header.

Maybe i386 would work, but apparently not amd64.

-- 
Adam Vande More
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Booting FreeBSD with Syslinux

2013-07-31 Thread Adam Vande More
On Wed, Jul 31, 2013 at 6:43 PM, Daniel O'Connor docon...@gsoft.com.auwrote:


 That was for 7.x though, maybe the kernel has changed a bit.


It doesn't say that at all.  Nor does it say the exact release(even major
branch isn't mentioned) which was confirmed to work, when or if it was
tried, the precise syntax used, or any other type of useful information.

-- 
Adam Vande More
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: experience with 9.2-PRERELEASE

2013-07-18 Thread Adam Vande More
On Thu, Jul 18, 2013 at 3:28 PM, John Reynolds john...@reynoldsnet.orgwrote:

 One person said to fiddle in the BIOS with the USB settings. I went into
 the BIOS and disabled Intel USB 3.0 Mode support (this is a very new
 motherboard with USB 3.0). That fixed the issue! For those curious about
 -current, I also tried to install that and it experienced the same USB
 problems until I disabled this 3.0 mode.


This is worth pursuing on freebsdf-usb@ or by filing a PR.  Only way it
gets fixed is if the right people know about it.

-- 
Adam Vande More
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: experience with 9.2-PRERELEASE

2013-07-17 Thread Adam Vande More
On Wed, Jul 17, 2013 at 7:45 PM, John Reynolds john...@reynoldsnet.orgwrote:

 to the SSD that I used as the primary hard drive. So, then I figured I
 would try a more recent snapshot hoping that something had been spotted and
 fixed already. I got the 9.2-PRERELEASE amd64 snapshot and tried to install
 it. However, I couldn't even get past the first screen of the install
 because of these messages:

 ugen0.2: (Unknown) at usbus0 (disconnected)
 uhub_reattach_port: could not allocate new device

 and the keyboard was non-functional. It just sat there spewing these
 errors about 1 per second.


Try toggling usb compatibility mode in the bios.


-- 
Adam Vande More
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


shutdown -r / shutdown -h / reboot all hang and don't cleanly dismount

2013-06-19 Thread Adam Strohl

Hello -STABLE@,

So I've seen this situation seemingly randomly on a number of both 
physical 9.1 boxes as well as VMs for I would say 6-9 months at least. 
 I finally have a physical box here that reproduces it consistently 
that I can reboot easily (ie; not a production/client server).


No matter what I do:

reboot
shutdown -p
shutdown -r

This specific server will stop at All buffers synced and not actually 
power down or reboot.  KB input seems to be ignored.  This server is a 
ZFS NAS (with GMIRROR for boot blocks) but the other boxes which show 
this are using GMIRRORs for root/swap/boot (no ZFS).


Here is what happens on the console: http://i.imgur.com/1H8JMyB.jpg

When I reset the server it appears that disks were not dismounted 
cleanly ... on this ZFS box it comes back quick because ZFS is good like 
that but on the other servers with GMIRROR roots rebuilding the GMIRROR 
and fscking at the same time is murder on the disk/performance until it 
finishes.


Another interesting thing is that this particular server runs slapd 
(OpenLDAP) which, when it comes back up, has a corrupted DB (easily 
fixed with db_recover, but still).  This might be because FS commits 
aren't happening at the end.   I can even manually stop slapd (service 
slapd stop) then run sync(8) (I assume this does something for ZFS too) 
and it still comes back as hosed if I reboot shortly after.  If I 
start/stop slapd it's fine.  So I feel like there is an FS/dismount 
thing going on here.


Additional information: I also have some boxes which will reboot (ie; 
they don't freeze like some do at the end) but they don't dismount 
cleanly either and have to rebuild both GMIRROR and fsck.  This might be 
a different issue, too.


Anyone have any thoughts?  Let me know if I can provide more details etc.

--
Adam Strohl
http://www.ateamsystems.com/
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: shutdown -r / shutdown -h / reboot all hang and don't cleanly dismount

2013-06-19 Thread Adam Strohl

On 6/19/2013 19:21, Jeremy Chadwick wrote:

On Wed, Jun 19, 2013 at 06:35:57PM +0700, Adam Strohl wrote:

Hello -STABLE@,

So I've seen this situation seemingly randomly on a number of both
physical 9.1 boxes as well as VMs for I would say 6-9 months at
least.  I finally have a physical box here that reproduces it
consistently that I can reboot easily (ie; not a production/client
server).

No matter what I do:

reboot
shutdown -p
shutdown -r

This specific server will stop at All buffers synced and not
actually power down or reboot.  KB input seems to be ignored.  This
server is a ZFS NAS (with GMIRROR for boot blocks) but the other
boxes which show this are using GMIRRORs for root/swap/boot (no
ZFS).

Here is what happens on the console: http://i.imgur.com/1H8JMyB.jpg

When I reset the server it appears that disks were not dismounted
cleanly ... on this ZFS box it comes back quick because ZFS is good
like that but on the other servers with GMIRROR roots rebuilding the
GMIRROR and fscking at the same time is murder on the
disk/performance until it finishes.


1. You mention as well as VMs.  Anything under a virtual machine or
under a hypervisor is going to be very, very, **VERY** different than
bare metal.  So I hope the issues you're talking about above are on bare
metal -- I will assume so.


Nope, I see basically the same thing sometimes under ESXi 5.0 Hypervisor 
(and yes it worries me the implications of something so broad).  Those 
unites I just haven't been able to isolate on a server which isn't 
critical.  Lets focus on this server for now though per your suggestion 
below.




2. We need to know what version of 9.1 you're using, i.e. 9.1-RELEASE.
If you use stable/9 (RELENG_9) we need to see uname -a output (you can
hide the machine name if you want).


Sorry, this ZFS box is 9.1-R P4 (kernel built today):

FreeBSD ilos.dsn 9.1-RELEASE-p4 FreeBSD 9.1-RELEASE-p4 #6: Wed Jun 19 
15:31:12 ICT 2013 root@hostname:/usr/obj/usr/src/sys/ATEAMSYSTEMS  amd64




3. Can we please have dmesg from this machine?  The controller and some
other hardware details matter.


Sure take a look at the full log here: http://pastebin.com/k55gVVuU

This includes a boot, then a reboot as I describe (you can see it logs 
the All Buffers Synced, etc) then powering back on.




4. Does sysctl hw.usb.no_shutdown_wait=1 help you?


Weirdly this allowed it to reboot on the first try (without needing to 
be reset), but not the second.  The Starting background file system 
checks in 60 seconds message appeared ... that only happens when 
something is dirty, right?


So the second try with just this I could ctrl alt del it and it 
responded .. kind of:

http://i.imgur.com/POAIaNg.jpg

Still had to reset it though.



5. Does sysctl hw.acpi.handle_reboot=1 help you?


No change, still responded to a ctrl alt del like above, but like that 
still needs to be reset and comes back dirty.




6. Does sysctl hw.acpi.disable_on_reboot=1 help you?


No change.  Same as above, ctrl alt del responds but needs a hard reset 
still.




7. If none of the above helps, can you please boot verbose mode and then
when the system locks up on shutdown -r now take a picture of the
VGA console?


Lots of debug on boot obviously but not much different on shutdown/hang:
http://i.imgur.com/SgzSsoP.jpg



8. Does the machine run moused(8) (check the process list please, do not
rely on rc.conf) ?


ps -auxww | grep moused reveals nothing running (which is how I have 
things set).





Another interesting thing is that this particular server runs slapd
(OpenLDAP) which, when it comes back up, has a corrupted DB
(easily fixed with db_recover, but still).  This might be because FS
commits aren't happening at the end.   I can even manually stop
slapd (service slapd stop) then run sync(8) (I assume this does
something for ZFS too) and it still comes back as hosed if I reboot
shortly after.  If I start/stop slapd it's fine.  So I feel like
there is an FS/dismount thing going on here.


sync(8) does not do what you think it does.  Please read (not skim) this
entire thread starting here:

http://lists.freebsd.org/pipermail/freebsd-fs/2013-April/thread.html#16982
http://lists.freebsd.org/pipermail/freebsd-fs/2013-April/016982.html


Groking this now ..



Your problem is related to unclean shutdown; fix that and your issues go
away.


Yeah that is my feeling as well.




Additional information: I also have some boxes which will reboot
(ie; they don't freeze like some do at the end) but they don't
dismount cleanly either and have to rebuild both GMIRROR and fsck.
This might be a different issue, too.


Every issue needs to be handled/treated separately.


Sure, I just had run across some threads about that but will focus on 
this ZFS box (and see if anything that fixes here does anything with 
that once I can reliably reproduce it out of production).







--
Adam Strohl
http://www.ateamsystems.com/
___
freebsd-stable

Re: shutdown -r / shutdown -h / reboot all hang and don't cleanly dismount

2013-06-19 Thread Adam Strohl

On 6/19/2013 19:53, Adam Strohl wrote:

sync(8) does not do what you think it does.  Please read (not skim) this
entire thread starting here:

http://lists.freebsd.org/pipermail/freebsd-fs/2013-April/thread.html#16982

http://lists.freebsd.org/pipermail/freebsd-fs/2013-April/016982.html


Groking this now ..



Epic.  So basically mount -u -o ro FS is really what I (and probably 
everyone else) wants and the man page needs a major overhaul + 
disclaimer (and possibly a recommendation to use mount -u -o ro FS 
instead).



--
Adam Strohl
http://www.ateamsystems.com/
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: shutdown -r / shutdown -h / reboot all hang and don't cleanly dismount

2013-06-19 Thread Adam Strohl
: 0
   SyncID: 1
   ID: 1208229558
4. Name: ada3p1
   Mediasize: 131072 (128k)
   Sectorsize: 512
   Stripesize: 4096
   Stripeoffset: 0
   Mode: r1w1e1
   State: ACTIVE
   Priority: 3
   Flags: NONE
   GenID: 0
   SyncID: 1
   ID: 3928010527
5. Name: ada4p1
   Mediasize: 131072 (128k)
   Sectorsize: 512
   Stripesize: 4096
   Stripeoffset: 0
   Mode: r1w1e1
   State: ACTIVE
   Priority: 4
   Flags: NONE
   GenID: 0
   SyncID: 1
   ID: 442340132
6. Name: ada5p1
   Mediasize: 131072 (128k)
   Sectorsize: 512
   Stripesize: 4096
   Stripeoffset: 0
   Mode: r1w1e1
   State: ACTIVE
   Priority: 0
   Flags: NONE
   GenID: 0
   SyncID: 1
   ID: 1281187492


3. Any/all details of your gmirror setup or other things you can
think of when you set it up


The only thing is that we use GMIRROR on the partition level because we 
use GPT (which is clear from the gpart output I think).  I gmirror the 
boot partition only in this case as I use ZFS backed swap and ZFS root 
for this server.



4. Contents of /etc/fstab


 cat /etc/fstab
# DeviceMountpoint  FStype  Options DumpPass#
# NOTE: ZFS root is not managed here
/dev/zvol/zroot/swapnoneswapsw  0   0


5. Contents of /boot/loader.conf


 cat /boot/loader.conf
geom_mirror_load=YES
zfs_load=YES
vfs.root.mountfrom=zfs:zroot
aio_load=YES
if_lagg_load=YES



6. Contents of /etc/rc.conf


#  Don't run FS check and let apps start
#
fsck_y_enable=YES
background_fsck=NO

#  Power management enables SpeedStep and TurboBoost
#
powerd_enable=YES
powerd_flags=-a hiadaptive

#  Networking
#
hostname=hostname
defaultrouter=xxx.xxx.xxx.3
# -- LACP
ifconfig_em0=up
ifconfig_em1=up
cloned_interfaces=lagg0
ifconfig_lagg0=laggproto lacp laggport em0 laggport em1 xxx.xxx.xxx.212/24

#  Services
#
sshd_enable=YES
smartd_enable=YES
samba_enable=YES
zabbix_agentd_enable=YES
zfs_enable=YES
apcupsd_enable=YES
slapd_enable=YES
slapd_flags='-h ldapi://%2fvar%2frun%2fopenldap%2fldapi/ 
ldap://xxx.xxx.xxx.212/ ldap://127.0.0.1/;'

slapd_sockets=/var/run/openldap/ldapi

#  Time Stuff
#
ntpd_enable=YES
ntpd_sync_on_start=YES

#  Mail
#
postfix_enable=YES
sendmail_enable=NO
sendmail_submit_enable=NO
sendmail_outbound_enable=NO
sendmail_msp_queue_enable=NO


7. Contents of /etc/sysctl.conf


kern.maxfiles=25600
kern.maxfilesperproc=16384
net.inet.tcp.sendspace=65536
net.inet.tcp.recvspace=65536


8. Contents of /sys/amd64/conf/ATEAMSYSTEMS


See above




5. Does sysctl hw.acpi.handle_reboot=1 help you?


No change, still responded to a ctrl alt del like above, but like
that still needs to be reset and comes back dirty.



6. Does sysctl hw.acpi.disable_on_reboot=1 help you?


No change.  Same as above, ctrl alt del responds but needs a hard
reset still.


Okay, thank you.


7. If none of the above helps, can you please boot verbose mode and then
when the system locks up on shutdown -r now take a picture of the
VGA console?


Lots of debug on boot obviously but not much different on shutdown/hang:
http://i.imgur.com/SgzSsoP.jpg


It looks to me like the ACPI layer is still actively working at the time
all buffers are synced, meaning the actual reboot phase itself never
happens.  This to me starts to smell of an ACPI problem, but I do not
have the skill set to debug this, and I'm also grasping at straws.
There are many things that happen during that phase of operation,
particularly the USB shutdown phase.


Yeah.  Originally I had even my UPS (APC) disconnected, the only USB 
device (via a port -- I realize there might be MB virtual ports) was a 
Dell KB.




But it all depends on your kernel config, which I've now asked for.


Yeah

--
Adam Strohl
http://www.ateamsystems.com/
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: shutdown -r / shutdown -h / reboot all hang and don't cleanly dismount

2013-06-19 Thread Adam Strohl

On 6/19/2013 21:21, Steven Hartland wrote:

You still need to test if stable/9 fixes your issue though as otherwise
you don't know if the issue your seeing has already been fixed, and if
its the old know ZFS vfs hang on shutdown, it has.


Thanks Steve, understood but probably not going to happen with this box. 
 I can reboot this thing but it's our NAS and not a test bed.  This 
problem on this machine isn't a big deal because its a server and not 
rebooted often (and easy to bring back).  But I more was hoping it would 
let me easily test solutions to the issue since the other servers 
showing the issue are in client production with the mind that the VMs 
not use ZFS also show a similar/identical issue  My gut says it 
appeared in/with 9.1 (We never saw this with 9.0 servers).   It is also 
possible this is a different issue from those other servers and VMs.


How far away is 9.2? ;-P

Depending on how things go with Jeremy I'll probably have to wait this 
out unless I can get a test machine or VM where I can reproduce the 
issue AND upgrade it to -STABLE (again assuming it's even the same issue).

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: shutdown -r / shutdown -h / reboot all hang and don't cleanly dismount

2013-06-19 Thread Adam Strohl

On 6/19/2013 22:04, Jeremy Chadwick wrote:

On Wed, Jun 19, 2013 at 09:15:18PM +0700, Adam Strohl wrote:

On 6/19/2013 20:35, Jeremy Chadwick wrote:


I've snipped out portions which aren't relevant at this point in the
convo.  I'm trying to be terse as much as possible here (honest).

To recap for readers/mailing list:

- Adam seems the same behaviour on systems on bare metal, as well as
   FreeBSD guests running under VMware ESXi 5.0 hypervisor.  However,
   as I stated on the list just yesterday about lock-ups on shutdown,
   every situation may be different and there is a well-established
   history of this problem on FreeBSD where each root cause (bugs)
   were completely different from one another.

- The system we're discussing at this point in the thread is on
   bare metal -- specifically an Asus P8B-X motherboard, with BIOS
   version 6103, driven entirely by on-board Intel AHCI (not BIOS-level
   RAID).

- Adam runs 9.1-RELEASE because of business needs pertaining to
   freebsd-update and binary updates.  (I ask more about this for
   benefits of readers below, however -- because this situation comes
   up a lot and I want to know what real-world admins do)



This is all correct.


Thanks.  I was mainly interested in the storage controller being used
(in this case ahci(4)) and the disks being used (notorious ST3000DM001,
known for excessively parking heads).


Yeah, was not my first choice but then again ... RAIDZ-2 :)  HD
supply chain here (Thailand) is weird considering how many are made
here (and can't buy).  Smartd screams about them possibly needing a
firmware update (they don't according to Seagate).   Had no issues
aside from a failure a month or so again (it's an HD ... it
happens).


Absolutely understood -- and FYI, in case you need backup, your thought
process/conclusion here is spot on (re: it's a MHDD, failures happen).


Indeed :-D



Irrelevant to your shutdown problem: as for smartmontools bitching about
the firmware: no vendors disclose what actual changes go into their
drive firmware updates (vendors if you are reading this: I will have
your souls...), so I have to read a bunch of end-user forums where
nobody knows what they're talking about, and then of course find this
highly educational *cough* article from Adaptec:

http://ask.adaptec.com/app/answers/detail/a_id/17241/~/known-issues-with-seagate-barracuda-7200.14-desktop-drives



Yeah I agree .. I tried to firmware upgrade them when I was building the 
system but it said they didn't qualify when using the boot ISO.  I just 
checked the site and it says no firmware update available too when using 
their search by serial # tool.   At this point I'm leery about updating 
given that I've got data on it anyway.  I do occasionally (maybe once a 
week or two and they're in the same room as me/my office) hear one parking.


I see nothing wrong in smart though, no dmesg errors and have noticed no 
issues with the array and it bench tests at around 850 MB/sec.  Too bad 
10 Gbit equipment isn't cheaper.


Also when I bought the 6 for this array I got a 7th as a cold spare :P


The problem here is that there have been *so many* firmware bugs with
Seagate's drives in the past 2 years or so that it's impossible for me
to know which fixes what.  You buy what you buy because that's what you
buy, and that's cool -- but I avoid their stuff like the plague.


Yeah.  I'd prefer WD myself but this place is swimming in green and 
now red drives.  uhgl.


 Snipping out the unrelated parts ... 


Can you try removing VESA and SC_PIXEL_MODE please?  I know that
sounds crazy (what on earth would that have to do with it?), but
please try it.  I can explain the justification if need be -- I'm being
extra paranoid of something that got discovered here on -stable only a
few days ago.  It's a stretch, but I can see potential relevance.  I can
provide details/links later.


No change unfortunately.




4. Does sysctl hw.usb.no_shutdown_wait=1 help you?


Weirdly this allowed it to reboot on the first try (without needing
to be reset), but not the second.


I'm not surprised.  Pleas re-try with stable/9; Hans has been constantly
working on the USB stack and fixing major bugs.


Got it but probably not going to go this route as it means no more
binary upgrades.  While I can reboot it, it is the office NAS here
and so 'testing out' -STABLE I think probably isn't going to happen.


I understand.  I have a question relating to this below.


Place background_fsck=no in /etc/rc.conf.  If the machine does not
have a clean filesystem on boot-up, you'll know because the system will
immediately begin fsck (in the foreground actively).  You'll recognise
that output if it happens, trust me.


Preaching to the choir, we set this on all servers this one somehow
did not have it set (I think due to ZFS making it unique and not
copying our rc.conf template over properly).


Where should I send my bill for services rendered?  (Totally kidding --
just had some breakfast

Re: sshd didn't run after upgrade to FreeBSD 8.4

2013-06-19 Thread Adam Vande More
On Wed, Jun 19, 2013 at 6:32 PM, Kimmo Paasiala kpaas...@gmail.com wrote:

 You're missing my point totally. The line is commented out in the
 official source of 8.4 and there for I have very hard time believing
 that it would show up uncommented on a fresh 8.4 installation.


I don't think this warrants a mention in the Release Notes for exactly this
point, however it should probably be mentioned in UPDATING.  If nothing
else, that would at least keep UPDATING consistent with previous ssh major
upgrades.

-- 
Adam Vande More
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: 9.1-stable: ATI IXP600 AHCI: CAM timeout

2013-05-29 Thread Adam McDougall
On 05/29/13 10:21, Oliver Fromme wrote:
 Steven Hartland wrote:
   Have you checked your sata cables and psu outputs?
   
   Both of these could be the underlying cause of poor signalling.
 
 I can't easily check that because it is a cheap rented
 server in a remote location.
 
 But I don't believe it is bad cabling or PSU anyway, or
 otherwise the problem would occur intermittently all the
 time if the load on the disks is sufficiently high.
 But it only occurs at tags=3 and above.  At tags=2 it does
 not occur at all, no matter how hard I hammer on the disks.
 
 At the moment I'm inclined to believe that it is either
 a bug in the HDD firmware or in the controller.  The disks
 aren't exactly new, they're 400 GB Samsung ones that are
 several years old.  I think it's not uncommon to have bugs
 in the NCQ implementation in such disks.
 
 The only thing that puzzles me is the fact that the problem
 also disappears completely when I reduce the SATA rev from
 II to I, even at tags=32.
 
 Best regards
Oliver
 
 

Jeremy Chadwick knows of some hardware faults with IXP600/700,
there may be more information on the freebsd-fs mailing list archives or
if you can discuss with him:

http://docs.freebsd.org/cgi/mid.cgi?20130414194440.GB38338

That email mentions port multipliers but the problems may extend beyond.
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: recommended memory for zfs

2013-05-09 Thread Adam Vande More
Probably the simplest answer is that you already have sufficient
memory to run ZFS.  As someone already mentioned you should use AMD64,
not i386.  If your setup isn't fast enough with tuning, add more if
it's the bottleneck.

On Thu, May 9, 2013 at 8:47 PM, Benjamin Adams benjamindad...@gmail.com wrote:
 On 05/09/2013 08:53 PM, Shane Ambler wrote:

 On 09/05/2013 22:48, Benjamin Adams wrote:

 Hello zfs question about memory.
 I heard zfs is very ram hungry.
 Service looking to run:
 - nginx
 - postgres
 - php-fpm
 - python

 I have a machine with two quad core cpus but only 4 G Memory

 I'm looking to buy more ram now.
 What would be the recommend amount of memory for zfs across 6 drives on
 this setup?


 I believe I heard a calculation of 1GB cache per 1TB of disk. But
 basically zfs will use all free ram available if you access that much data
 from disk. You will want to set vfs.zfs.arc_max to allow enough ram for your
 apps to work in.

 If you consider the files for your website and the data you store you may
 find that you would never fill more than 500MB of cache.

 If you will be serving large media files that will easily use up the cache
 you could give them their own filesystem that only caches metadata - zfs set
 primarycache=metadata zroot/mediafiles


 Thanks for all the replies  Size of DB and HD's are:

 Current DB Size = 23 GB
 HD sizes = (6) 500 GB drives




 ___
 freebsd-stable@freebsd.org mailing list
 http://lists.freebsd.org/mailman/listinfo/freebsd-stable
 To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org



-- 
Adam Vande More
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: recommended memory for zfs

2013-05-09 Thread Adam Vande More
On Thu, May 9, 2013 at 9:06 PM, Jeremy Chadwick j...@koitsu.org wrote:

 The advice of 1GB of RAM per 1TB of disk space is absolute nonsense on
 numerous levels -- whoever gave this advice to Shane either has no
 understanding of how filesystems/ZFS works, or does but chose to
 simplify to the point where they're providing half-ass information.

IIRC, that used to be the guideline for memory requirements for dedup.



--
Adam Vande More
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Why does poudriere always rebuild nginx and GraphicsMagick13?

2013-02-14 Thread Adam McDougall
On Fri, Feb 15, 2013 at 12:37:19AM +0100, Rainer Duffner wrote:
  
  Am 12.02.2013 um 23:11 schrieb Baptiste Daroussin b...@freebsd.org:
  
   On Tue, Feb 12, 2013 at 10:59:28PM +0100, Rainer Duffner wrote:
   Hi,
   
   poudriere 2.2 here, running on 9.1-amd64
   
   Of the 730-ish ports, whenever I run a build, it always rebuilds the above 
two ports.
   Even if nothing changed.
  
   Options changed, deleting: GraphicsMagick-nox11-1.3.16_1.txz
   Options changed, deleting: nginx-1.2.6,1.txz
  
  Somehow, it thinks the options have changed.
  Maybe, the options-file has an error?
  
  Regards,
  Rainer
  
Try deleting the options file for each and run poudriere twice to test.
I had the same problem with mailman and it turned out I was missing a
required but not enforced option due to another option I had selected.
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: About kern.ipc.somaxconn and netstat

2013-01-29 Thread Adam Vande More
On Tue, Jan 29, 2013 at 7:26 PM, Efraín Déctor efraindec...@motumweb.comwrote:

 Hello.

 We have a webserver using FreeBSD, we read about tunning
 kern.ipc.somaxconn (
 http://www.freebsd.org/doc/en_US.ISO8859-1/books/handbook/configtuning-kernel-limits.html)
 so the OS can handle all the connections. Is there a way to know how many
 connections are established in a certain moment?. I know about netstat(1)
 but is there any other command that we can use to know the exact amount of
 how many connections are established?.


sockstat(1)

There are other sysctl's to view connections in a particular state such
as net.inet.tcp.pcblist:



-- 
Adam Vande More
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: time issues and ZFS

2013-01-22 Thread Adam McDougall

On 01/22/13 07:27, Julian Stecklina wrote:

Thus spake Daniel Braniss da...@cs.huji.ac.il:


In the meantime here is some info:
Intel(R) Xeon(R) CPU E5645: running with no problems
   LAPIC(600) HPET(450) HPET1(440) HPET2(440) HPET3(440) i8254(100) RTC(0)

Intel(R) Xeon(R) CPU X5550: this is the problematic, at least for the moment
   HPET(450) HPET1(440) HPET2(440) HPET3(440) LAPIC(400) i8254(100) RTC(0)


Does anyone know why the LAPIC is given a lower priority than HPET in
this case? If you have an LAPIC, it should always be prefered to HPET,
unless something is seriously wrong with it...

Julian

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org



This may help:

Problem with LAPIC timer is that it stops working when CPU goes to C3 
or deeper idle state. These states are not enabled by default, so unless 
you enabled them explicitly, it is safe to use LAPIC. In any case 
present 9-STABLE system should prevent you from using unsafe C-state if 
LAPIC timer is used. From all other perspectives LAPIC is preferable, as 
it is faster and easier to operate then HPET. Latest CPUs fixed the 
LAPIC timer problem, so I don't think that switching to it will be 
pessimistic in foreseeable future.


--
Alexander Motin
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: FreeBSD 9.1 - openldap slapd lockups, mutex problems

2013-01-22 Thread Adam McDougall

On 01/22/13 05:19, Kai Gallasch wrote:

Hi.

(Im am sending this to the stable list, because it maybe kernel related.. )

On 9.1-RELEASE I am witnessing lockups of the openldap slapd daemon.

The slapd runs for some days and then hangs, consuming high amounts of CPU.
In this state slapd can only be restarted by SIGKILL.

  # procstat -kk 71195
   PIDTID COMM TDNAME   KSTACK
71195 149271 slapd-mi_switch+0x186 
sleepq_catch_signals+0x2cc sleepq_wait_sig+0x16 _sleep+0x29d do_wait+0x678 
__umtx_op_wait+0x68 amd64_syscall+0x546 Xfast_syscall+0xf7



On UFS2 slapd runs fine, without showing the error.
Has anyone else running openldap-server on FreeBSD 9.1 inside a jail seen 
similar problems?


I have seen openldap spin the cpu and even run out of memory to get 
killed on some of our test systems running ~9.1-rel with zfs.  No jails.
I'm not sure what would have put load on our test systems other than 
nightly scripts.  I had to focus my attention on other servers so I 
don't have one to inspect at this point, but I won't be surprised if I 
see this in production.  Thanks for the tip about it being ZFS related, 
and I'll let you know if I find anything out.  This is mostly a me too 
reply.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Samsung SSD 840 PRO fails to probe

2012-11-26 Thread Adam McDougall

Hello,

My co-worker ordered a Samsung 840 PRO series SSD for his desktop but we 
found 9.0-rel would not probe it and 9.1-rc3 shows some errors.  I got 
past the problem with a workaround of disabling AHCI mode in the BIOS 
which drops it to IDE mode and it detects fine, although runs a little 
slower.  Is there something I can try to make it probe properly in AHCI 
mode?  We also tried moving it to the SATA data and power cables from 
the working SATA HD so I don't think it is the port or controller 
driver.  The same model motherboard from another computer did the same 
thing.  Thanks.


dmesg line when it is working:
ada0: Samsung SSD 840 PRO Series DXM03B0Q ATA-9 SATA 3.x device

dmesg lines when it is not working: (hand transcribed from a picture)
(aprobe0:ahcich0:0:0): SETFEATURES ENABLE SATA FEATURE. ACB: ef 10 00 00 
00 40 00 00 00 00 05 00

(aprobe0:ahcich0:0:0): CAM status: ATA Status Error
(aprobe0:ahcich0:0:0): ATA status: 51 (DRDY SERV ERR), error: 04 (ABRT )
(aprobe0:ahcich0:0:0): RES: 51 04 00 00 00 40 00 00 00 00 00
(aprobe0:ahcich0:0:0): Retrying command
(aprobe0:ahcich0:0:0): SETFEATURES ENABLE SATA FEATURE. ACB: ef 10 00 00 
00 40 00 00 00 00 05 00

(aprobe0:ahcich0:0:0): CAM status: ATA Status Error
(aprobe0:ahcich0:0:0): ATA status: 51 (DRDY SERV ERR), error: 04 (ABRT )
(aprobe0:ahcich0:0:0): RES: 51 04 00 00 00 40 00 00 00 00 00
(aprobe0:ahcich0:0:0): Error 5, Retries exhausted
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Samsung SSD 840 PRO fails to probe

2012-11-26 Thread Adam McDougall

On 11/26/12 14:27, Alexander Motin wrote:

Hi.

On 26.11.2012 20:51, Adam McDougall wrote:

My co-worker ordered a Samsung 840 PRO series SSD for his desktop but we
found 9.0-rel would not probe it and 9.1-rc3 shows some errors.  I got
past the problem with a workaround of disabling AHCI mode in the BIOS
which drops it to IDE mode and it detects fine, although runs a little
slower.  Is there something I can try to make it probe properly in AHCI
mode?  We also tried moving it to the SATA data and power cables from
the working SATA HD so I don't think it is the port or controller
driver.  The same model motherboard from another computer did the same
thing.  Thanks.

dmesg line when it is working:
ada0: Samsung SSD 840 PRO Series DXM03B0Q ATA-9 SATA 3.x device

dmesg lines when it is not working: (hand transcribed from a picture)
(aprobe0:ahcich0:0:0): SETFEATURES ENABLE SATA FEATURE. ACB: ef 10 00 00
00 40 00 00 00 00 05 00
(aprobe0:ahcich0:0:0): CAM status: ATA Status Error
(aprobe0:ahcich0:0:0): ATA status: 51 (DRDY SERV ERR), error: 04 (ABRT )
(aprobe0:ahcich0:0:0): RES: 51 04 00 00 00 40 00 00 00 00 00
(aprobe0:ahcich0:0:0): Retrying command
(aprobe0:ahcich0:0:0): SETFEATURES ENABLE SATA FEATURE. ACB: ef 10 00 00
00 40 00 00 00 00 05 00
(aprobe0:ahcich0:0:0): CAM status: ATA Status Error
(aprobe0:ahcich0:0:0): ATA status: 51 (DRDY SERV ERR), error: 04 (ABRT )
(aprobe0:ahcich0:0:0): RES: 51 04 00 00 00 40 00 00 00 00 00
(aprobe0:ahcich0:0:0): Error 5, Retries exhausted


I believe that is SSD's firmware bug. Probably it declares support for
SATA Asynchronous Notifications in its IDENTIFY data, but returns error
on attempt to enable it. Switching controller to legacy mode disables
that functionality and so works as workaround. Patch below should
workaround the problem from the OS side:

--- ata_xpt.c   (revision 243561)
+++ ata_xpt.c   (working copy)
@@ -745,6 +745,14 @@ probedone(struct cam_periph *periph, union ccb *do
 goto noerror;

 /*
+* Some Samsung SSDs report supported Asynchronous
Notification,
+* but return ABORT on attempt to enable it.
+*/
+   } else if (softc-action == PROBE_SETAN 
+   status == CAM_ATA_STATUS_ERROR) {
+   goto noerror;
+
+   /*
  * SES and SAF-TE SEPs have different IDENTIFY commands,
  * but SATA specification doesn't tell how to identify
them.
  * Until better way found, just try another if first fail.




Thanks for the prompt response and patch, that worked!
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: How go back from X.Y-RELEASE-pZ to X.Y-RELEASE?

2012-11-23 Thread Adam McDougall

On 11/23/2012 6:22 AM, Peter Olsson wrote:

We are currently using cvs for both source and ports.
I have begun changing to portsnap for ports, and I
would also like to try changing at least some of our
servers to freebsd-update.

But all servers have been patched, using either RELENG_8_3
or RELENG_9_0 as cvs tag. I need to revert them to their
respective RELEASE to be able to use freebsd-update.
Complete reinstall from eg CD is not an option, and I don't
want to upgrade to a newer RELEASE at the moment.

Can I change the cvs tags to RELENG_8_3_0_RELEASE or
RELENG_9_0_0_RELEASE, and then build/install world and
kernel as usual?
Or will that method cause problems for the system or the
installed ports?

Thanks!

--
Peter Olssonp...@leissner.se


That is what I would do.  Certainly try it on a non-critical system first,
and take proper consideration for the potential vulnerabilities that will
come back until freebsd-update succeeds.
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: SU+J on 9.1-RC2 ISO

2012-11-04 Thread Adam Strohl

On 11/3/2012 1:31, Mateusz Guzik wrote:

Currently when you try to take a snapshot, the kernel checks whether SUJ
is enabled on specified mount-point, and if yes it returns EOPNOTSUPP.

See this commit (MFCed as r230725):
http://svnweb.freebsd.org/base?view=revisionamp;revision=230250



Ahhh excellent to hear. I partition manually these days with 9.0-R 
because most servers are either using gmirror, which I want setup before 
the install, or a RAID card which means partitions need to be aligned to 
the stripe boundaries.  So I just newfs -U -L and keep journaling off 
and wouldn't have realized there is at least some mitigation that will 
make it into 9.1-R.


I still stand by my feeling that it should not be on by default though, 
because it breaks snapshots and by extension dump -L which I consider to 
be a pretty awesome feature of FreeBSD.  If you have partitions with 
enabled it means booting up in single user to undo it which is a hassle 
for a server if it's in production (I realize that's a bit whiny :P).



--
Adam Strohl
http://www.ateamsystems.com/
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Why is SU+J undesirable on SSDs?

2012-11-04 Thread Adam Strohl

On 11/4/2012 5:32, Karl Denninger wrote:

It is utter insanity to enable, by default, filesystem options that
break _*the canonical backup solution*_ in the handbook (dump, when
used with -L, which it must be to dump a live filesystem SAFELY.)


Exactly.


--
Adam Strohl
http://www.ateamsystems.com/
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Why is SU+J undesirable on SSDs?

2012-11-03 Thread Adam Vande More
On Sat, Nov 3, 2012 at 4:30 PM, Brett Glass br...@lariat.net wrote:

 Have been following the thread related to SU+J, and am wondering: why is it
 considered to be undesirable on SSDs (assuming that they have good wear
 leveling)?


Superstition


-- 
Adam Vande More
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: SU+J on 9.1-RC2 ISO

2012-11-02 Thread Adam Strohl

On 11/2/2012 23:47, Bas Smeelen wrote:

Hi

Why are journaled soft updates the default when installing a new system
from a 9.1-RC2 ISO?

I admit I did not pay too much attention when installing a new system
from an 9.1-RC2 ISO and found out when taking a snapshot with dump (dump
-0Lauf) to clone the system. Other systems (9-STABLE, 9.1-RC2 and
9.1-RC3) have been upgraded from 8.X-RELEASE and earlier, so there are
no journaled soft updates enabled, just soft updates, and well there
dump with snapshot works just fine.

Can SU+J be disabled for the 9.1-RELEASE or do you think this is not
going to be a problem for users of FreeBSD? I will have to boot these
two systems single user now to disable the soft updates journal, because
I use dump + restore on live systems, not a problem for me, it is just
an inconvenience.



I have to second this sentiment.  Unless the dump/snapshot issue has 
been resolved they journal should be turned off by default.


It's a really nasty bug that causes an instant panic which is awful if 
the server is in production.  The fact that it happens when you're 
trying to exercise due diligence (ie; backups) is even worse.


-- my .02
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: SU+J on 9.1-RC2 ISO

2012-11-02 Thread Adam Strohl

On 11/3/2012 0:13, Mike Jakubik wrote:

You can disable SU+J after installing, though it would be nice if the
installer gave you a choice.


This assumes that you know about this flaw, which most people do not.

I didn't until I discovered it by panic-ing a perfectly fine running 
server.  Getting burned by a known bug like this shouldn't be SOP for 
users of FreeBSD.


If anything it should be turned off by default, and people can turn it 
on if they want given the landmine it plants.  If they know how to turn 
it on they're much more likely to be aware of the issue.



--
Adam Strohl
http://www.ateamsystems.com/
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


No buffer space available / tcp_inpcb value

2012-10-30 Thread Adam Strohl

Hey -STABLE,

I've got a client who we've setup a FreeBSD cluster for with about a 
dozens servers, all behind two front end proxies/LBs/firewalls which 
also act as NAT gateways for the internal servers.


On the active front end proxy we've started seeing fatal: socket: No 
buffer space available errors during high-peak times.   I can see in 
vmstat -z that this is what is getting denied:


ITEM   SIZE  LIMIT USED FREE  REQ FAIL SLEEP
tcp_inpcb:  392,  32770,   19398, 13372,1449734621,6312858,   0

We've got a lot of the other values bumped, and it appears to be this 
input limit that is getting hit.  There are no other non-zero FAILed 
counters except 64 and 128 buckets which I believe are normal.


I cannot seem to find the sysctl (or equiv) that controls this limit 
though, or even what it is.  Anyone know?


I'm obviously in need of this specific answer, but overall is there a 
codex of vmstat -z's items that explains this that I have just not found 
in my searches?  This isn't the first time I've had to dig into a value 
like this to increase it's limit, but this time I'm not turning anything up.


Any thoughts/ideas appreciated!

--
Adam Strohl
http://www.ateamsystems.com/

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: No buffer space available / tcp_inpcb value

2012-10-30 Thread Adam Strohl

On 10/30/2012 23:05, Adrian Chadd wrote:

Check the output of 'netstat -mb', maybe you're also running out of mbufs?


There was nothing denied there that I can see:

35696/4039/39735 mbufs in use (current/cache/total)
2069/3797/5866/32768 mbuf clusters in use (current/cache/total/max)
2069/2077 mbuf+clusters out of packet secondary zone in use (current/cache)
4/3283/3287/16384 4k (page size) jumbo clusters in use 
(current/cache/total/max)

0/0/0/8192 9k jumbo clusters in use (current/cache/total/max)
0/0/0/4096 16k jumbo clusters in use (current/cache/total/max)
13078K/21735K/34813K bytes allocated to network (current/cache/total)
0/0/0 requests for mbufs denied (mbufs/clusters/mbuf+clusters)
0/0/0 requests for jumbo clusters denied (4k/9k/16k)
0/0/0 sfbufs in use (current/peak/max)
0 requests for sfbufs denied
0 requests for sfbufs delayed
0 requests for I/O initiated by sendfile
0 calls to protocol drain routines




Adrian


On 30 October 2012 06:21, Adam Strohl adams-free...@ateamsystems.com wrote:

Hey -STABLE,

I've got a client who we've setup a FreeBSD cluster for with about a dozens
servers, all behind two front end proxies/LBs/firewalls which also act as
NAT gateways for the internal servers.

On the active front end proxy we've started seeing fatal: socket: No buffer
space available errors during high-peak times.   I can see in vmstat -z
that this is what is getting denied:

ITEM   SIZE  LIMIT USED FREE  REQ FAIL SLEEP
tcp_inpcb:  392,  32770,   19398, 13372,1449734621,6312858,   0

We've got a lot of the other values bumped, and it appears to be this input
limit that is getting hit.  There are no other non-zero FAILed counters
except 64 and 128 buckets which I believe are normal.

I cannot seem to find the sysctl (or equiv) that controls this limit though,
or even what it is.  Anyone know?

I'm obviously in need of this specific answer, but overall is there a codex
of vmstat -z's items that explains this that I have just not found in my
searches?  This isn't the first time I've had to dig into a value like this
to increase it's limit, but this time I'm not turning anything up.

Any thoughts/ideas appreciated!

--
Adam Strohl
http://www.ateamsystems.com/

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org



--
Adam Strohl
http://www.ateamsystems.com/
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: SOLVED: Time Clock Stops in FreeBSD 9.0 guest running under ESXi 5.0

2012-08-03 Thread Adam Strohl
Just a heads up on the original issue, which is FreeBSD's timer/clock 
stopping under ESXi 5.0 and some later versions of VMware Workstation.


I've gotten a few direct messages that this thread ranks high on Google 
but people are missing the solution.  A few months ago I found this 
forum posting (I believe this was linked in this thread already) 
http://unix.derkeiler.com/Mailing-Lists/FreeBSD/stable/2012-03/msg00201.html 



The long and short of it is that changing the kern.timecounter sysctl 
value to ACPI-fast or (ACPI-safe if you're not running 9.x yet) fixes 
the hanging issue so far for us.


To temporarily enable it under 9.x:
sysctl kern.timecounter.hardware=ACPI-fast

Pre 9.x (which doesn't have the ACPI-fast mode):
sysctl kern.timecounter.hardware=ACPI-safe

To make this persist across reboots and be enabled by default add this 
line to your /etc/sysctl.conf


Under 9.x:
kern.timecounter.hardware=ACPI-fast

Pre 9.x:
kern.timecounter.hardware=ACPI-safe

Hope this helps anyone running across this issue.

--
Adam Strohl
http://www.ateamsystems.com/

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: SOLVED: Time Clock Stops in FreeBSD 9.0 guest running under ESXi 5.0

2012-08-03 Thread Adam Strohl

Doh, correct URL for the forum post is:
http://forums.freebsd.org/showthread.php?t=31929page=2

On 8/3/2012 14:38, Adam Strohl wrote:

Just a heads up on the original issue, which is FreeBSD's timer/clock
stopping under ESXi 5.0 and some later versions of VMware Workstation.

I've gotten a few direct messages that this thread ranks high on Google
but people are missing the solution.  A few months ago I found this
forum posting (I believe this was linked in this thread already)
http://unix.derkeiler.com/Mailing-Lists/FreeBSD/stable/2012-03/msg00201.html


The long and short of it is that changing the kern.timecounter sysctl
value to ACPI-fast or (ACPI-safe if you're not running 9.x yet) fixes
the hanging issue so far for us.

To temporarily enable it under 9.x:
sysctl kern.timecounter.hardware=ACPI-fast

Pre 9.x (which doesn't have the ACPI-fast mode):
sysctl kern.timecounter.hardware=ACPI-safe

To make this persist across reboots and be enabled by default add this
line to your /etc/sysctl.conf

Under 9.x:
kern.timecounter.hardware=ACPI-fast

Pre 9.x:
kern.timecounter.hardware=ACPI-safe

Hope this helps anyone running across this issue.




--
Adam Strohl
http://www.ateamsystems.com/
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: apache hangs in wait4

2012-07-07 Thread Adam Vande More
On Sat, Jul 7, 2012 at 9:22 PM, Ivan Voras ivo...@freebsd.org wrote:

 Hello,

 I have a very embarrassing problem where apache22-worker, running
 mod_fcgid with php, perl and python fastcgi processes, hangs daliy in
 wait4:

 # procstat -k 54688
   PIDTID COMM TDNAME   KSTACK
 54688 101355 httpd-mi_switch
 sleepq_catch_signals sleepq_wait_sig _sleep kern_wait sys_wait4
 amd64_syscall Xfast_syscall

 The only suspicious things in logs is this:

 [Sat Jul 07 20:00:01 2012] [notice] SIGUSR1 received.  Doing graceful
 restart
 [Sat Jul 07 20:00:10 2012] [error] FastCGI process 41228 still did not
 exit, terminating forcefully

 The 41228 process is a Perl FastCGI web application using p5-FCGI
 (wwsympa), and it is in the accept wchan.

 Any ideas?


Is it the same time?  newsyslog perhaps?

-- 
Adam Vande More
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Recommendation for Hyervisor to host FreeBSD

2012-07-05 Thread Adam Strohl

On 7/5/2012 21:27, Rainer Duffner wrote:

They come (or came, last time I looked) with a lot of
run-time dependencies and even more at build-time.
And AFAIK, they don't offer the full functionality either.


There is a number of dependencies, but as far as I know it isn't missing 
anything: memory driver, OS control (ie; shutdown), etc.


I manage dozens of FreeBSD VMs under ESXi 3.5, 4.x and 5.0 ... most of 
them using OpenVM tools (ie; the 9.x hosts), works great.



--
Adam Strohl
http://www.ateamsystems.com/


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: fsck_ufs running too often

2012-06-23 Thread Adam Vande More
On Sat, Jun 23, 2012 at 9:54 PM, Jason Hellenthal jhellent...@dataix.netwrote:

 At one point it was proven that background fsck was not benefitial.


Where can we find this proof?

-- 
Adam Vande More
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: How to bind a route to a network adapter and not IP

2012-06-15 Thread Adam McDougall

On 06/15/12 12:19, Hans Petter Selasky wrote:

Hi,

Maybe there is a simple answer, but how do I bind a route to a network
interface in 8-stable? Is that possible at all? I'm asking because the routes
I add in my network setup are lost because of ARP packet drops. I.E. they
exist for a while, but not forever like I want to.

--HPS


Is route add x.x.x.x -iface em0   what you want?
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: IPv6 and CARP crashes boxes

2012-06-12 Thread Adam Strohl

On 6/12/2012 19:48, Pete French wrote:

I ran into some - aliases on a CARP integface did not seem
to work proprly - but if you workaround that then it appears
to work fine. We are using it in production with no problems.


I have noticed this issue (CARP + IPv4 aliases) with older (pre 9.x) 
versions of FreeBSD.


I maintain some legacy 6.2 servers and had to eventually add ifconfig 
statements inside rc.local to get the links to coalesce.  6.2 appears to 
ignore _aliasn directives entirely inside rc.conf, and has real issues 
if you add/delete aliases to a CARP interface while its up (both peers 
end up thinking they're MASTER).


In 9.x it all works as expected at least for IPv4 (rc.conf 
carpn_aliasn entries, aliases, on the fly reconfiguring).


--
Adam Strohl
http://www.ateamsystems.com/
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: IPv6 and CARP crashes boxes

2012-06-12 Thread Adam Strohl

On 6/12/2012 20:08, Pete French wrote:

I have noticed this issue (CARP + IPv4 aliases) with older (pre 9.x)
versions of FreeBSD.


Ah, just to be clear, the only problems I had with aliases weher IPv6 - it
always worked properly with IPv4. But I didnt try on anything pre 8.1!

-pete.


Doh, I caught this just as I hit send :P

--
Adam Strohl
http://www.ateamsystems.com/
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Backups with 9-STABLE -- Options?

2012-06-10 Thread Adam Strohl

On 6/10/2012 3:08, Karl Denninger wrote:

With SU+J as the default filesystem, what options actually WORK now?

1. Dump L will NOT -- it doesn't hang any more but now just bitches
and refuses to run.  I suppose that beats a hang


Heh, yeah that is improved from what it did before ;D


2. Dump without L and take your chances?  What risks am I running by
doing this on a running system?


Depends on what is running and how it does file writes.  For example SQL 
DB storage engines are unlikely to do well (ie; the restore will be 
corrupted if there are changes during the process).  Something like 
CouchDB though which is always consistent on disk probably wouldn't care.


Past specific applications (or user activity) the inherent risk is 
unpredictable usefulness of your backups.  Since you're doing backups as 
a safeguard (and are very likely your last hope if things really go 
wrong) you don't want to find out that a key piece corrupted or missing 
entirely due to files moving around during the dump when you end up 
needing it.



3.  Other?

Dump has been the canonical means of backing up... forever.  And it
still is claimed to be the canonical means in the documentation.

So what options do we have now that actually work -- is there now a new
canonical backup method that is recommended?


My solution is to turn off journals for any build.   Dump is a great 
tool (especially when scripted) and is very efficient.


And as neat as journals are, backups using dump with snapshots is way 
more valuable and important in my book.


My .02.

--
Adam Strohl
http://www.ateamsystems.com/
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Backups with 9-STABLE -- Options?

2012-06-10 Thread Adam Strohl

On 6/10/2012 22:26, Karl Denninger wrote:

Well, backup with snapshots don't do well EITHER on a database unless
you can snapshot BOTH the dbms data store(s) and the transaction log
store(s) /*at the exact same instant*/.  If you cannot then you're
asking for trouble and are likely to get it.  But I've dealt with that
particular gotcha problem in a different way for the DBMS I use
(Postgresql)


You asked what would happen, not what was the best way to back up a SQL 
DB, but your point is valid.


Snapshots don't fix this issue entirely but drastically reduce the 
chance of a 100% broken backup.


SQL servers should be dumped out to disk (ie; mysql_dump) to avoid this 
or have a dedicated backup client (which means you're probably not using 
dump anyway).



So basically what you're saying is that SU+J leaves you exposed to
having no real backup option that provides a rational guarantee of the
ability to restore the backup taken.


That's a bit of a gloss over on what I said.  My point was that you 
might end up missing something if its changing at the time the backup 
was taken.  It really depends on what specifically that server is doing.


There is also a consistency issue too, using snapshots makes it so that 
all the files make sense together, instead of the files getting more and 
more recent as the end of the backup block approaches.


--
Adam Strohl
http://www.ateamsystems.com/
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: su problem

2012-06-09 Thread Adam Strohl

On 6/9/2012 20:29, Sami Halabi wrote:

Hi,
/var/log/messages - no new logs


Sorry if this has been asked, anything in dmesg?
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: su problem

2012-06-09 Thread Adam Strohl

On 6/9/2012 20:33, Sami Halabi wrote:

its the same as /var/log/messages


I assume you mean there is nothing there because it's not the same thing 
(yes dmesg stuff should get logged into syslog but your system obviously 
isn't working right so ...).


Past that I've been skimming this thread since you posted and I can't 
think of anything here that would resolve this except that it might be 
worth a try to have someone ctrl-alt-del it (requires no FreeBSD 
knowledge, passwords, etc by the person doing it and should gracefully 
reboot the server).   Its a total Hail Mary [pass] though [and probably 
won't work].


It might lock you out entirely, too.

P.S.
Beyond this incident obviously setting up a remote console is ideal, 
IPMI is very worth it, but my guess is you'd have it setup if your MB 
had it.  If you don't have an IPMI module and you happen to have another 
box there cross-patching their serial consoles to each other so if one 
goes down you can serial via the other one (ie; server1's com1 to 
server2's com2, and server2's com1 to server1's com2).  You need to set 
this up as root though so no help now.


--
Adam Strohl
http://www.ateamsystems.com/
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Why Are You NOT Using FreeBSD ?

2012-06-09 Thread Adam Strohl

On 6/9/2012 14:50, O. Hartmann wrote:

Lucky man! We are off from some desktop services (like LibreOffice and
Firefox) for more than a week now!


Why did you update to begin with?  Bug/security fix?

--
Adam Strohl
http://www.ateamsystems.com/
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Why Are You NOT Using FreeBSD ?

2012-06-09 Thread Adam Strohl

On 6/9/2012 21:04, O. Hartmann wrote:


Well, this is a good question. Unfortunately, I did an update of the
ports tree and PNG update rushed in. The information in UPDATING came a
in bit later, but since then several ports have been updated already -
and rendered some applications unuseable.

The question why isn't applicable here. Sometimes ports need updates
or a port that is installed reels in another or even an update and this
triggers the avalnche of messes.



Fair enough, I just feel like people reporting 48 hours of not using 
their computer are doing something extraordinarily weird and I'm just 
at a loss as to what they're doing and why.


I get the feeling people are updating their ports tree and then 
recompiling/reinstalling everything just because and then are 
complaining when one thing breaks (its the only thing I can think of).


--
Adam Strohl
http://www.ateamsystems.com/
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Why Are You NOT Using FreeBSD ?

2012-06-09 Thread Adam Strohl

On 6/9/2012 21:36, H wrote:

why is there an update, would be a little bit better


My point was why do you need the update, and can't wait until its been 
better vetted.  The porters do the best they can but can't test everything.



but a real good question would be, why is there a not working/compiling
update released to the ports tree


Because it was just released and every combination of system 
configuration hasn't been tested, so there is some lag time before it 
stabilizes, especially with complicated software.


There in lies the question -- why do you need to compile a port which 
was just released?   Is it a security thing or is it I want the latest 
?  I'm just curious (and totally uninterested in how this ranks in your 
worse question list).


--
Adam Strohl
http://www.ateamsystems.com/
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Why Are You NOT Using FreeBSD ?

2012-06-08 Thread Adam Strohl

On 6/9/2012 3:34, Steve Franks wrote:

Every time libjpeg or
perl or python bumps the rev, I have to explain to my boss that I
won't be using my computer for 48 hours.


Why is this?  And why are you updating every time there is a rev bump?

It almost sounds like you're recompiling everything just for the heck of 
it, though I don't get how even that takes 48 hours.  Even make 
buildworld is done in multi-user mode and so you could use your 
workstation during the build.  And we're talking about ports here so ...


Just curious!

--
Adam Strohl
http://www.ateamsystems.com/
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Load when idl on stable

2012-06-05 Thread Adam McDougall

On 06/05/12 15:37, Albert Shih wrote:

  Le 03/06/2012 ? 23:55:06+0200, Oliver Pinter a écrit

I think, this is the old thread:
http://freebsd.1045724.n5.nabble.com/High-load-event-idl-td5671431.html


Yes. But because I didn't find any solution, I resent the problem.


The interrupt rerouting does not help?


Well I've no idea what you talking but I try every solution describe in the
thread you mentioned. I didn't find any solution.

Regards.

NB: I forget to say I'm not a developer, just sysadmin. I use Stable just
for report here any problem I got.


Try changing kern.eventtimer.timer:

% sysctl kern.eventtimer.timer=LAPIC

How to display your choices ordered by quality:
% sysctl kern.eventtimer

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Why Are You NOT Using FreeBSD ?

2012-06-03 Thread Adam Strohl

On 6/3/2012 10:09, Mark Linimon wrote:

On Sun, Jun 03, 2012 at 01:43:43AM +0200, Fritz Wuehler wrote:

So there could be lots of overlap and just looking at the two numbers
you posted doesn't really tell the whole story.

No, I agree that it doesn't.  I was just trying to add an aside, and
point out that the task would not be trivial.

Since I'm heavily invested in FreeBSD ports I think I need to step back
and let other folks comment in this thread.


I manage and support a little over 50 FreeBSD servers (VMWare, Xen and 
native) and feel that the port system, on the whole, is excellent.  Its 
easily one of the best features about FreeBSD.   Portaudit reports 
issues and I can plan and upgrade them as needed.  Portupgrade works 
great 99% of the time and when it doesn't it has the good sense to roll 
back what its done.  If there is any question as to what it should do it 
errors and tells me, which is exactly what I want it to do.


I've been a FreeBSD user for about 18 years and supported it 
professionally for about 10.  In this thread I've read a few posts that 
contain blanket statements like ports are broken and never work, I'm 
at a loss as to how to respond to this as it is completely counter to my 
experience.   I wish I could see what they were talking about and figure 
out what happened so I could understand what caused them to make such a 
statement.  It's like they're talking about a different OS than the one 
I know.


I've written a simple script to run portaudit and pop up a dialog with 
check boxes that then kicks off portupgrade for the selected ports which 
have issues.   99% of the time its that simple.  This is what I want in 
a server environment.  I do not want things auto-updating (a.k.a. auto 
breaking) or making decisions about supporting libraries behind my back. 
  PHP is a good and common example why: an upgrade can and does break 
web sites that ran fine before.   Updates need to be managed in a 
process which is outside the scope of the OS (because its a server not a 
desktop).  FreeBSD has all these great tools for managing the mechanical 
action of updating and imposes minimal process which is perfect because 
I have my own process.  And if things get mucked up (which mostly isn't 
the ports system fault when it does happen), its easy to back out and 
re-do if needed.


After reading this thread I am wondering if I should clean the update 
dialog script up and submit to the ports tree.  It seems like people 
think the port update process is harder than it is because it lacks a 
Windows Update like dialog which is essentially what this is akin to 
(and there might be a port which does this already, too .. anyone?).  
All the hard stuff has been done by the FreeBSD team, all I did was put 
a bash/dialog script on it.


I very rarely run into ports that don't build on supported versions of 
FreeBSD (ie; ones that haven't reached EoL).  I have a number of 
customers with a few 6.2 boxes [which I can't wait to upgrade] and still 
almost everything builds without tinkering.


All of this is in the scope of servers though (web, DB, application, 
etc) and not on the desktop.  I haven't used a FreeBSD desktop since 
probably 4.x, and while I don't begrudge the work people are doing for 
the desktop experience it just doesn't apply to me nor is it why I love 
FreeBSD.   I won't say something like you're running a server OS on 
your desktop and expecting it to be like a Mac.  What will say is: I'm 
getting from this thread that a lot of the complaints people have seem 
to be based around the desktop.  My guess is that this is a super 
minority of actual use (by server count).


BUT: I feel like people are judging how fit an FreeBSD is for server 
work by how easy/Mac/Windows/whatever like (as many Linux distros try to 
emulate) it is to update.  Not good ... but it makes sense from a 
social/human perspective, and is probably another thing we should 
consider in terms of advocacy.


I'm interested in what people think about this, and yeah this should 
probably be in the advocacy list but its not so thhblt :P


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Why Are You NOT Using FreeBSD?

2012-06-03 Thread Adam Strohl

On 6/3/2012 11:14, Erich wrote:

What I really do not understand in this whole discussion is very simple. Is it 
just a few people who run into problems like this or is this simply ignored by 
the people who set the strategy for FreeBSD?

I mention since yeares here that putting version numbers onto the port tree 
would solve many of these problems. All I get as an answer is that it is not 
possible.

I think that this should be easily possible with the limitation that older 
versions do not have security fixes. Yes, but of what help is a security fix if 
there is no running port for the fix?


I feel like I'm missing something.  Why would you ever want to go back 
to an old version of the ports tree?  You're ignoring tons of security 
issues!


And if a port build is broken then the maintainer needs to fix it, that 
is the solution.


I must be missing something else here, it just seems like the underlying 
need for this is misguided (and dangerous from a security perspective).

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Why Are You NOT Using FreeBSD ?

2012-06-03 Thread Adam Strohl

On 6/3/2012 17:51, Mehmet Erol Sanliturk wrote:

Always I am stressing that to manage FreeBSD,  a fair amount of expertise
is required which I think this level may be reduced by improving the
FreeBSD management by transferring knowledge to its managing parts ( for
example : package management , repair of broken parts , installation steps
to reach a state like in very easily usable Linux distributions such as
Fedora , Mageia , Mandriva , and many others , etc. )


Yeah or a GUI to reduce the need for knowledge transfer.


You know what to do by your expertise gained over use , which such an
expertise is completely missing in a new comer , and even sometimes in very
highly experienced computer professionals because a different operating
system reduces them to a little experienced new starter .



I agree and your issue with USB sticks proves my point.  I've never 
tried to mount an NTFS USB stick and I'm OK with that.  But for you it 
is a big hassle (understandably so) and it has definitely negatively 
impacted your view of FreeBSD.



Compare the cost of a Linux or Windows and personal time , and make a
decision which one to choose .

Another point frequently mentioned is that FreeBSD is leaned toward servers
.
Only I want to say that , Please , install a CentOS , Debian , or Windows
Server trial , and see how a server may be ...


I manage Windows, CentOS and Debian (and RedHat and a few others) 
servers too.   I've found FreeBSD is more reliable on the whole and 
takes less time to maintain (which means less expensive for my clients). 
 This is one area where FreeBSD shines.  And when things do break it is 
possible to recover fairly easily.  That is another.


And yes, in terms of that initial learning curve my experience helps but 
its the OS that is doing the work here.  If I was more experienced with 
Windows or Linux it wouldn't make them any easier to update, either 
though.  So there is a point at which knowing what to do stops being 
the limiting issue and its just ok well this is broken now and it can't 
be cost-effectively fixed.   That crossover point is something that is 
almost never reached with FreeBSD in my experience.


All of this is completely parallel and unrelated to your (or another 
person's) experience as a desktop user though.  What you see is USB 
thumbdrives don't work :)   So you decide to use another OS, and 
probably wouldn't advocate for FreeBSD if presented the chance in a 
server context because of that experience.  That is a shame in my book. 
(I know I'm putting words in your mouth but its simply to illustrate my 
thinking on how public perception is formed).


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Why Are You NOT Using FreeBSD?

2012-06-03 Thread Adam Strohl



On 6/3/2012 19:24, Erich wrote:

yes, you miss a very simple thing. Updated this morning your ports tree. Your 
client asks for something for Monday morning for which you need now a program 
which needs some kind of PNG but you did not install it.

Do you have a machine that is fast enough to upgrade all your ports and still 
finish what your client needs Monday morning?


All I'd need to do is compile and install the libpng and then compile 
the program.   There is no need to upgrade all my ports.




The ports tree is not broken as such. Only the installation gets broken in some 
sense. Have a version number there would allow people to go back to the last 
known working ports tree, install the software - or whatever has to be done - 
with a working system.

Of course, the next step will be an upgrade. But only after the work which 
brings in the money is done.


I don't understand what you are saying here, sorry.  Or why you'd 
upgrade all your ports to install 1 new one.




You do not face this problem on Windows. You can run a 10 year old 'kernel' and 
still install modern software.


Not true at all.  Lots of Windows software requires minimum service pack 
and KB patch levels.




Erich

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Why Are You Using FreeBSD?

2012-06-01 Thread Adam Strohl

On 6/1/2012 17:19, Katinka wrote:
There's a nice discussion going on, over at Phoronix. 
http://phoronix.com/forums/showthread.php?71263
For some reason, they don't seem to like us very much. 


Lots of the comments remind me about Linux vs. Windows in the late 90s, 
and taken with a grain of salt are fairly amusing because of how 
ignorant a lot of them are.


I found this particularly fitting comment at the very end:

If you'd ask me for the biggest difference between Linux and BSD users: 
We know all about Linux - They know nothing about BSD. 


Which is sad really, their lives could be so much easier if only they 
knew how much better it could be ;D  (My opinion of course, I'm sure 
lots of people think Windows Server administration is easier than any 
UNIX -- just not on this list).  To each their own, and arguing about it 
is counter-productive.


I do think that forum post underscores the need for advocacy though -- 
we need to get the message out as to why FreeBSD is better than any OS 
in a lot of applications (which is different than arguing it out on 
Linux forums).  We need them to try it out and expose them to the things 
that make it great so they see it first hand.  Because it is clear most 
of these posters are very ignorant about FreeBSD -- that's really our 
collective fault.


Trolls and fanbois aside there is probably a huge number of Linux admins 
out there who just use it because that is what they use .. in the same 
way that Windows admins in the 90s hadn't really heard of Linux and 
feared it because they didn't understand it.


My 2 cents + attempt at keeping this thread constructive  I think 
I'm going to go sign up for the FreeBSD-advocacy list now ...

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Why Are You Using FreeBSD?

2012-06-01 Thread Adam Strohl

On 6/1/2012 18:03, Jason Leschnik wrote:

I may be totally incorrect with my above ideas, but it's what i would
like to see from FreeBSD *again*... This is the reason in the first
place most people used FreeBSD, stability/scalability/performance are
the hallmarks of FreeBSD. If we have these hard hitting numbers
released frequently it gives the dev team a good indication of how
changes reflect on performance.


This is a good point and the kind of stuff that would make a, for 
example, great Slashdot post once finished.


Of course there would be arguments but I think it would be good 
exposure.  It certainly would be nice to have a place to point to these 
things vs. just saying its more better and stabler, too.  And if its 
not at least its acknowledged so it can be fixed.


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Why Are You Using FreeBSD?

2012-05-31 Thread Adam Strohl

On 5/31/2012 21:22, Damien Fleuriot wrote:

On 5/31/12 4:01 PM, Jim Ohlstein wrote:

To add others, in no particular order:

Ease of upgrade. While some have noted that binary upgrades are easier
on Debian, it's far and away superior, IMMHO, to have a locally compiled
system. Many Linux distros have no upgrade path short of a wipe and
re-install.


Far superior, check, FAR MORE TIME CONSUMING, check as well !


This brings up another point: Repair is always possible with FreeBSD.

You can back out all packages or types of packages easily (and 
re-compile or reinstall them if needed).  You can recompile/reinstall 
the OS if needed (somewhere else too and copy it over).  Or just copy 
pieces from a live cd or restore tarball.  And it's pretty 
straightforward to do even for a non-admin person.


You can even restore over a live running system with tar, which I do 
occasionally when cloning machines or restoring them with dump/restore.  
Very slick.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Why Are You Using FreeBSD?

2012-05-31 Thread Adam Strohl


On 5/31/2012 21:47, Damien Fleuriot wrote:

Regarding packages, I've never really explored it, would you detail a bit ?


Well, I really mean the resulting pkg info from a port.  A good example 
is PHP, sometimes you have to say everyone out of the pool because of 
an upgrade:


cd /var/db/pkg  PKGS=`ls | egrep ^(php|pear|pecl)`; for PKG in 
$PKGS; do echo  $PKG; pkg_delete $PKG; done;


Running that a few times until it stops picking things up, then its a 
few commands to re-install PHP and its extensions (because of the 
extensions roll-up port).


You can of course script it further, which is part of why I like FreeBSD 
so much.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Why Are You Using FreeBSD?

2012-05-30 Thread Adam Strohl

On 5/31/2012 1:20, David Chisnall wrote:

I am currently looking at updating some of our advocacy material (which 
advertises exciting new features like SMP support), and before I do I'd like to 
get a better feel for why the rest of you are using FreeBSD.  If you had to 
list the three things you most like about FreeBSD, which would you pick?  Are 
they the same as when you first started using it?


1. High performance with security and stability focus -- truly makes it 
the ideal server platform

2. The ports system (and supporting tools like portupgrade, portaudit, etc)
3. The OS makes sense (as Chris N. mentioned).  The file system 
layout, tools, etc are consistent.


There is so much other stuff too.  Like PF and CARP, ZFS and more ...  a 
kick-ass combo of features and very server-focused.


As a professional admin FreeBSD is a pleasure to work with day in and 
day out.  I've never heard a admins of other OSes say that :P


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Boot hangs on v9 system at CD device probe

2012-05-30 Thread Adam Vande More
On Wed, May 30, 2012 at 4:54 PM, Kevin Oberman kob6...@gmail.com wrote:

 I sent a note about this a couple of weeks ago, but have not heard
 anything. I'm really getting a bit desperate.

 I have a system that I am trying to upgrade from 8.2 to 9.0. I have
 built it and installed the kernel, but it fails to boot. The boot
 freezes after probing for my hard drives during the probe of the
 CDROM. It just sits there, seemingly forever, though I have never
 waited longer then a few minutes.

 The system is a SuperMicro C25BX mother board. The DVD is PATA,
 reported on boot of 8-Stable as:
 acd0: DVDR ATAPI DVD A DH20A4P/9P59 at ata2-master UDMA66

 If I unplug the CDROM, it boots fine, but I really need the device on
 the system, so I really can't leave it unplugged. Also, after the 9
 kernel is installed, my Mk file have been updated so that I can't
 build some ports if I boot the 8.2 kernel. Does anyone remember this
 being reported by others? It was most likely on current, as it was
 probably prior to the release of 9. I googled around, but could not
 find it.

 I'd really appreciate it if anyone can point me toward a solution.


http://lists.freebsd.org/pipermail/freebsd-current/2010-October/020336.html

-- 
Adam Vande More
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: 9.0-RELEASE amd64: No Boot on VMWare Workstation

2012-05-16 Thread Adam Strohl

On 5/16/2012 8:12, Larry Rosenman wrote:

Ok, I'm just impatient.  I let it sit, and it eventually came up.

Would it be possible for the next 9.x release to set hw.memtest.tests=0
when we discover we're under a hypervisor to avoid doing the tests? (or
default it to 0 in the installer kernel?)?



FWIW this seems odd/unique to your setup.

I see no such delay under any VMWare product, though I have not yet 
upgraded to Workstation 8.0.3 from 8.0.0.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: 9.0-RELEASE amd64: No Boot on VMWare Workstation

2012-05-16 Thread Adam Strohl

On 5/16/2012 22:05, Larry Rosenman wrote:

I believe this is due to the 8G of memory I put on it. (I like to build
big VM's.

It's directly proportional to the size of the VM.



Ahh!  Yeah I rarely build a VM with more than a gig or two here in the 
office (ie; where I use Workstation).

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: 9.0-RELEASE amd64: No Boot on VMWare Workstation

2012-05-15 Thread Adam Strohl

On 5/14/2012 22:18, Larry Rosenman wrote:

Is there a known issue with 9.0-RELEASE on amd64 VMWare Workstation?


Since nobody has chimed in I felt I should:

I use FreeBSD 9.0 routinely under VMWare Workstation without issue (my 
current VMWare Workstation version is 8.0.0 which is slightly out of 
date, 8.0.3 is available).



I tried to build a new VM on my new Lenovo W520 Laptop (Windows 7
Pro/64-Bit, 16G ram) and it gets to the Beastie menu, and times out, then
dies.

Any ideas?

What can I provide?


The version of VMWare Workstation you are running.


And, is there an issue with 9 in general, or could I install 8.3 and then
source update it to 9 or 10?


I build test VMs for 9 and do test upgrades from 6.x, 7.x and 8.x to 9 
using VMWare Workstation on my desktop and laptop without issue.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: 9.0-RELEASE amd64: No Boot on VMWare Workstation

2012-05-15 Thread Adam Strohl

On 5/15/2012 21:49, Larry Rosenman wrote:

This is VMWare Workstation 8.0.3 booting off the release ISO.

Ideas?


Is this the installer that doesn't boot or is it the OS after you've 
installed?


If its the former you might just have a bad ISO download. Have/did you 
verified the checksum of the ISO?

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: superpages not solving PV entries limit warning

2012-05-10 Thread Adam Vande More
On Wed, May 9, 2012 at 12:55 PM, Charles Owens
cow...@greatbaysoftware.comwrote:

 Hi fellow BSD-types,

 I have a buy system that forks lots of processes and I see repeatedly the
 message:  Approaching the limit on PV entries, consider increasing either
 the vm.pmap.shpgperproc or the vm.pmap.pv_entry_max tunable.

 System details:

  * 8.1-RELEASE-p2 i386 PAE kernel
  * 6 GB RAM


The warning is not applicable any longer including your version as well as
several previous ones.  The warning has been removed from current releases.

-- 
Adam Vande More
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: FreeBSD 9 gptboot: invalid backup GPT header error (boots fine though)

2012-05-09 Thread Adam Strohl

On 5/2/2012 23:08, Andrey V. Elsukov wrote:

On 02.05.2012 17:53, Adam Strohl wrote:

% gpart recover da0


Good thought, but no dice:

$ gpart recover da0
da0 recovering is not needed


I already saw several reports about gptboot's complains on 3ware
controllers, but don't know what is the problem.
The only guess is that a controller incorrectly handles BIOS requests,
when gptboot tries to read GPT header from the end of a large virtual disk.



Thanks for your input on this Andrey.  Just to clarify I am assuming 
that da0 recovering is not needed means that gpart has no problem 
reading and verifying the backup GPT header?


(which is why its probably the BIOS for the RAID controller as the GPT 
is actually intact)

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: FreeBSD 9 gptboot: invalid backup GPT header error (boots fine though)

2012-05-02 Thread Adam Strohl

Thanks Andrey,

I've just recompiled /boot/gptboot after updating gpt.c and installed it 
via:


gpart bootcode -p /boot/gptboot -i 1 da0

I still see gptboot: invalid backup GPT header on boot (but it does 
still boot).


On 5/2/2012 12:58, Andrey V. Elsukov wrote:

On 30.04.2012 23:14, Adam Strohl wrote:

da0 at tws0 bus 0 scbus0 target 0 lun 0
da0:LSI 9750-8iDISK 5.12  Fixed Direct Access SCSI-5 device
da0: 6000.000MB/s transfers
da0: 2860992MB (5859311616 512 byte sectors: 255H 63S/T 364725C)


Let me know anyone wants to see anything else/has seen this/has any theories!


Can you try patch from the r234693, update and reinstall gptboot, does it help?
http://svnweb.freebsd.org/base?view=revisionrevision=234693


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: FreeBSD 9 gptboot: invalid backup GPT header error (boots fine though)

2012-05-02 Thread Adam Strohl


On 5/2/2012 20:46, Mark Saad wrote:
Did you try to repair the header ? I saw a similar issue on upgraded 
boxes that were 7-STABLE upgraded to 9-STABLE. and recovering made the 
warning go away . I may be way off here but just my 2 cents .


% gpart recover da0 


Good thought, but no dice:

$ gpart recover da0
da0 recovering is not needed
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


FreeBSD 9 gptboot: invalid backup GPT header error (boots fine though)

2012-04-30 Thread Adam Strohl
I've been deploying FreeBSD 9 without issue on a number of 
near-identical servers for a client, but have run into an interesting 
annoyance when I hit the two DB servers.


These DB servers have an LSI 3ware 9750-8i (running a 6 disk RAID 10 in 
a single 3TB virtual volume) which puts them apart from the other two 
servers in this cluster (which don't show either issue I am about to 
discuss).  Otherwise the hardware is identical (Dual Xeon E5620s, 16GB 
RAM).  I've also never seen this before on other physical (or VM) 
FreeBSD 9 instances and I've probably done 50+ FreeBSD 9 VM and physical 
installs at this point (and run through the installer process probably 
over 150 times :P).


Before I get into the GPT error, I want to mention this in case its 
relevant:


I found I had to partition via the shell (gpart create/gpart add/etc 
etc) the disks during install or the kernel would fail to re-mount the 
root disk after booting into the new OS.   If I used the default layout, 
or the partition GUI at all (ie; 'manual mode') the new OS wouldn't 
remount root on boot.


I could manually specify the proper root device ie; ufs:/dev/da0p3 and 
continue booting without issue, so this is an installer thing.   I'm 
sure I could have fixed this in /boot/loader.conf or similar but wanted 
to try to figure out what was breaking (now I know its something the 
installer is doing since it doesn't happen when I do it manually).  So I 
kept reOSing it doing different things and ultimately found shell-based 
manual partitioning worked fine.


However, I see the following error right before BTX comes up (and did 
previously when using the installer's partition GUI):


gptboot: invalid backup GPT header

The machine boots fine, so I'm not stuck  but it is an annoyance for 
an A-type sysadmin like myself.  Even if its superficial I dislike 
setting up a client's machine to generate errors on boot, especially 
without an explanation or understanding behind it.   I also obviously 
wanted to raise the issue here in case there is actually a rare problem 
or this is a symptom of one.


I could find nothing that related specifically to this issue, so I was 
wondering if anyone else had seen this or had thoughts.


My suspicion is that maybe the large size of the volume (3TB or 2.7TB 
formatted) makes it too large for the boot loader to address all of 
and thus can't get to the end of the disk where the backup GPT header is 
to validate it..


Or maybe the RAID adapter is doing something weird at the end of the 
disk.  This seems unlikely since it presents the RAID as a single volume 
so I'd assume it would hide any tagging or RAID meta data from the OS' 
virtual volume though.


That's about all I can think of.

Selected dmesg output:
LSI 3ware device driver for SAS/SATA storage controllers, version: 
10.80.00.003
tws0: LSI 3ware SAS/SATA Storage Controller port 0x1000-0x10ff mem 
0xb194-0xb1943fff,0xb190-0xb193 irq 32 at device 0.0 on pci4

tws0: Using legacy INTx
tws0: Controller details: Model 9750-8i, 8 Phys, Firmware FH9X 
5.12.00.007, BIOS BE9X 5.11.00.006


da0 at tws0 bus 0 scbus0 target 0 lun 0
da0: LSI 9750-8iDISK 5.12 Fixed Direct Access SCSI-5 device
da0: 6000.000MB/s transfers
da0: 2860992MB (5859311616 512 byte sectors: 255H 63S/T 364725C)


Let me know anyone wants to see anything else/has seen this/has any 
theories!


--

Adam Strohl
A-Team Systems
http://ateamsystems.com/

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: VirtualBox problem booting FreeBSD-9.0-RELEASE-amd64-bootonly.iso

2012-04-16 Thread Adam Vande More
On Mon, Apr 16, 2012 at 10:30 PM, Craig Rodrigues rodr...@crodrigues.orgwrote:

 After exchanging a few e-mails with Bernard, I think I understand the
 problem.
 According to the VirtualBox manual:

 http://www.virtualbox.org/manual/ch03.html#intro-64bitguests

 The CPU needs to support hardware-assisted virtualization in order for
 VirtualBox to support 64-bit guests.
 Without the CPU support, VirtualBox cannot do it.  I double checked
 this with CoreInfo.exe from Windows Sysinternals utilities,
 and also from the Intel data sheet for my CPU:

 http://ark.intel.com/products/30787
 (VT-x option not supported)


 I will use QEMU under Windows until I can get a better machine. :)


I believe a Windows 7 product key will activate either 32 or 64 install, so
unless you have a specific reason to install 64 bit then you are likely
better using 32 anyway.  32 bit OS's and apps use less memory than their
corresponding 64 bit counterparts.



-- 
Adam Vande More
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Time Clock Stops in FreeBSD 9.0 guest running under ESXi 5.0

2012-03-19 Thread Adam Strohl

On 3/12/2012 0:01, Ian Lepore wrote:
 It seems unlikely to me that ntpd and the vm tools would be fighting in
 a way that caused this symptom.  The way ntpd affects timing is to step
 the clock (which gets logged), or to numerically steer the kernel's
 timekeeping routines.  The steering is clamped at 500 ppm; to make the
 clock appear to stop it would have to steer at 1e6 ppm.  I've always
 assumed that VM guest services daemons that handle timekeeping use the
 same ntp_adjtime() interface to the kernel timekeeping that ntpd itself
 uses, so the same steering limits would apply.

An excellent point.


 If it happens again, interesting data might be found in the output of:

sysctl kern.timecounter
sysctl kern.eventtimer
vmstat -i
ntpdc -c kerninfo
anything unusual in dmesg output

Will do, I know there was nothing in dmesg, I will definitely check all 
of this though if/when it happens again.  I just brought up another ESXi 
5.0 host with FreeBSD 9.0 VMs (created from dump/restore from the 
existing ones), so there is an increased chance of me seeing this 
hopefully and getting to the bottom of it.  Or it never happens again :P



On 3/19/2012 1:36, Steve Wills wrote:

I've experienced something similar once or twice with ESXi 5.0. The
second time it happened, I found that kern.timecounter.tc.HPET.counter
stopped changing. I was told on IRC that this indicated a hardware
problem, which I took to indicate a possible bug in ESXi. I haven't
upgraded to ESXi 5.0 Update 1 yet to see if that changes anything.
Rebooting of course fixed it, it has been a while since this happened
and it hasn't happened again since so I haven't pursued it. Just another
data point, hope it hopes.


Thanks for the info!  I didn't realize there was an update out already 
for 5.0 (I don't see it on VMWare's site).

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Time Clock Stops in FreeBSD 9.0 guest running under ESXi 5.0

2012-03-10 Thread Adam Strohl
I've now seen this on two different VMs on two different ESXi servers 
(Xeon based hosts but different hardware otherwise and at different 
facilities):


Everything runs fine for weeks then (seemingly) suddenly/randomly the 
clock STOPS.  In the first case I saw a jump backwards of about 15 
minutes (and then a 'freeze' of the clock).  The second time just 'time 
standing still' with no backwards jump.  Logging accuracy is of course 
questionable given the nature of the issue, but nothing really jumps out 
(ie; I don't see NTPd adjusting the time just before this happens or 
anything like that).


Naturally the clock stopping causes major issues, but the machine does 
technically stay running.  My open sessions respond, but anything that 
relies on time moving forward hangs.  I can't even gracefully reboot it 
because shutdown/etc all rely on time moving forward (heh).


So I'm not sure if this is a VMWare/ESXi issue or a FreeBSD issue, or 
some kind of interaction between the two.   I manage lots of VMWare 
based FreeBSD VMs, but these are the only ESXi 5.0 servers and the only 
FreeBSD 9.0 VMs.  I have never seen anything quite like this before, and 
last night as I mentioned above I had it happen for the second time on a 
different VM + ESXi server combo so I'm not thinking its a fluke 
anymore.  I've looked for other reports of this both in VMWare and 
FreeBSD contexts and not seeing anything.


What is interesting is that the 2 servers that have shown this issue 
perform similar tasks, which are different from the other VMs which have 
not shown this issue (yet).  This is 2 VMs out of a dozen VMs spread 
over two ESXi servers on different coasts.  This might be a coincidence 
but seems suspicious. These two VMs run these services (where as the 
other VMs don't):


- BIND
- CouchDB
- MySQL
- NFS server
- Dovecot 2.x

I would also say that these two VMs probably are the most active, have 
the most RAM and consume the most CPU because of what they do (vs. the 
others).


I have disabled NTPd since I am running the OpenVM Tools (which I 
believe should be keeping the time in sync with the ESXi host, which 
itself uses NTP), my only guess is maybe there is some kind of collision 
where NTPd and OpenVMTools were adjusting the time at the same time.  
I'm playing the waiting game now to see what this brings (again though I 
am running NTPd and OpenVMTools on all the other VMs which have yet to 
show this issue).


Anyone seen anything like this?  Ring any bells?

--

Adam Strohl
A-Team Systems
http://ateamsystems.com/

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Time Clock Stops in FreeBSD 9.0 guest running under ESXi 5.0

2012-03-10 Thread Adam Strohl

On 3/10/2012 17:10, Bjoern A. Zeeb wrote:

On 10. Mar 2012, at 08:07 , Adam Strohl wrote:


I've now seen this on two different VMs on two different ESXi servers (Xeon 
based hosts but different hardware otherwise and at different facilities):

Everything runs fine for weeks then (seemingly) suddenly/randomly the clock 
STOPS.


Apart from the ntp vs. openvm-tools thing, do you have an idea what for weeks 
 means in more detail?  Can you check based on last/daily mails/.. how many days it was 
since last reboot to a) see if it's close to a integer wrap-around or b) to give anyone 
who wants to reproduce this maybe a clue on how long they'll have to wait?  For that 
matter, is it a stock 9.0 or your own kernel?  What other modules are loaded?


Uptime was 31 days on the first incident / server (occurred 5 days ago)
Uptime was 4 days on the second incident / server (occurred last night)

One additional unique factor I just thought of: the two problem VMs have 
4 cores allocated to them inside ESXi, while the rest have 2 cores.


Kernel config is a copy of GENERIC (amd64) with the following lines 
added to the bottom.  All the VMs use this same kernel which I compiled 
once and then installed via NFS on the rest:


# -- Add Support for nicer console
#
options VESA
options SC_PIXEL_MODE

# -- IPFW support
#
options IPFIREWALL
options IPFIREWALL_VERBOSE
options IPFIREWALL_VERBOSE_LIMIT=10
options IPDIVERT
options IPFIREWALL_FORWARD
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Request for flowtable testers and actionable feedback RE: flowtable usable or not

2012-03-05 Thread Adam Strohl

On 3/5/2012 15:00, Daniel Kalchev wrote:

I happen to share the opinion and the experience of Mark Linimon in situations 
like this and yes, I do believe you have been rude here. For no reason 
whatsoever.


I agree.  This H person has been hijacking threads over the last week 
or so, and all of the messages I've seen from them boil down trolling.


This is in contrast to the patient, well thought out replies from the 
rest of the list.


I'm at a loss as to what H's endgame is, but it probably has more to 
do with writing poorly executed metaphors than it does with helping 
FreeBSD or its users (whom he/she implies they represent).

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


  1   2   3   4   >