Re: [OmniOS-discuss] r151012 nlockmgr fails to start

2014-10-06 Thread Kevin Swab
I had this same problem after upgrading a system to r151012.  nlockmgr
failed to start because "svc:/network/nfs/status:default" was disabled.
 I enabled that service, then nlockmgr was happy.

I've done 3 other upgrades to r151012 and none of them had a problem
with nlockmgr...

Kevin

On 10/06/2014 09:56 AM, Schweiss, Chip wrote:
> 
> 
> On Mon, Oct 6, 2014 at 9:59 AM, Dan McDonald  > wrote:
> 
> 
> On Oct 6, 2014, at 10:41 AM, Schweiss, Chip  > wrote:
> 
> >
> > Anyone else seeing this in r151012?
> >
> > Any tips on collecting better information on this would be appreciated.
> 
> I saw this in once in 012, but not as persistently as you have.  It
> cleared up for me with a single reboot, but that may have just been
> because I got lucky w.r.t. statd.
> 
> I take it "svcadm disable nlockmgr ; svcadm enable nlockmgr" doesn't
> help?
> 
> 
> I didn't think to try that.  Tried rebooting, but that didn't help.  I
> was already past my maintenance window, so I resorted to backing out to
> r151010.
> 
>  
> 
> 
> And as for output, I've seen in /var/adm/messages what you have, per
> the cited illumos bug (4518).
> 
> 
> 
> I wasn't aware of any new NFS changes that may have caused this to
> become exaggerated in r151012.  But, it's not clear on what is
> triggering this bug either.   On my larger pool systems with lots of NFS
> exports, almost every reboot causes this and a 'svcadm clear' fixes it.  
> 
> Is there something different that a disable and enable does?  I can
> likely get a small window in the next few days to give it a try.  
> 
> -Chip
> 
> 
> ___
> OmniOS-discuss mailing list
> OmniOS-discuss@lists.omniti.com
> http://lists.omniti.com/mailman/listinfo/omnios-discuss
> 

-- 
---
Kevin Swab  UNIX Systems Administrator
ACNSColorado State University
Phone: (970)491-6572Email: kevin.s...@colostate.edu
GPG Fingerprint: 7026 3F66 A970 67BD 6F17  8EB8 8A7D 142F 2392 791C
___
OmniOS-discuss mailing list
OmniOS-discuss@lists.omniti.com
http://lists.omniti.com/mailman/listinfo/omnios-discuss


Re: [OmniOS-discuss] replace disk with difrferent model/size

2014-10-06 Thread Fábio Rabelo
Thanks a lot !

This user intent to replace the entire pool, so it is the best to change it
.

But I have no idea what I have to do so ...

Point me to a doc with this info and I will read and apply .

Many thanks again 


Fábio Rabelo

2014-10-06 11:48 GMT-03:00 Schweiss, Chip :

> Looks like you are trying to add a 4K disk to pool with ashift=9.
>
> If the disk is a 512e disk you can force it to ashift=9 by adding an entry
> in sd.conf.  This will degrade write performance.   It's best to move the
> whole pool to ashift=12 (4K sectors) or find a 512b native disk.
>
> -Chip
>
> On Mon, Oct 6, 2014 at 6:46 AM, Fábio Rabelo 
> wrote:
>
>> Hi to all
>>
>> I have a system with failed hard disk .
>>
>> When I try to replace it, after 3 to 5 seconds "thinking"  the system
>> returns a msg like
>>
>> geometry mismatch error
>>
>> So, how can I replace a failed disk with another model or size ?
>>
>> This system uses 2 TB Hard disks, at first, I've installed 4 TB one,
>> after this geometry error I get a try with 2 TB model, same result .
>>
>>
>> Fábio Rabelo
>> ___
>> OmniOS-discuss mailing list
>> OmniOS-discuss@lists.omniti.com
>> http://lists.omniti.com/mailman/listinfo/omnios-discuss
>>
>
>
___
OmniOS-discuss mailing list
OmniOS-discuss@lists.omniti.com
http://lists.omniti.com/mailman/listinfo/omnios-discuss


Re: [OmniOS-discuss] r151012 nlockmgr fails to start

2014-10-06 Thread Schweiss, Chip
On Mon, Oct 6, 2014 at 9:59 AM, Dan McDonald  wrote:

>
> On Oct 6, 2014, at 10:41 AM, Schweiss, Chip  wrote:
>
> >
> > Anyone else seeing this in r151012?
> >
> > Any tips on collecting better information on this would be appreciated.
>
> I saw this in once in 012, but not as persistently as you have.  It
> cleared up for me with a single reboot, but that may have just been because
> I got lucky w.r.t. statd.
>
> I take it "svcadm disable nlockmgr ; svcadm enable nlockmgr" doesn't help?
>

I didn't think to try that.  Tried rebooting, but that didn't help.  I was
already past my maintenance window, so I resorted to backing out to r151010.



>
> And as for output, I've seen in /var/adm/messages what you have, per the
> cited illumos bug (4518).
>


I wasn't aware of any new NFS changes that may have caused this to become
exaggerated in r151012.  But, it's not clear on what is triggering this bug
either.   On my larger pool systems with lots of NFS exports, almost every
reboot causes this and a 'svcadm clear' fixes it.

Is there something different that a disable and enable does?  I can likely
get a small window in the next few days to give it a try.

-Chip
___
OmniOS-discuss mailing list
OmniOS-discuss@lists.omniti.com
http://lists.omniti.com/mailman/listinfo/omnios-discuss


Re: [OmniOS-discuss] r151012 nlockmgr fails to start

2014-10-06 Thread Dan McDonald

On Oct 6, 2014, at 10:41 AM, Schweiss, Chip  wrote:

> 
> Anyone else seeing this in r151012?  
> 
> Any tips on collecting better information on this would be appreciated.

I saw this in once in 012, but not as persistently as you have.  It cleared up 
for me with a single reboot, but that may have just been because I got lucky 
w.r.t. statd.

I take it "svcadm disable nlockmgr ; svcadm enable nlockmgr" doesn't help?

And as for output, I've seen in /var/adm/messages what you have, per the cited 
illumos bug (4518).

Dan

___
OmniOS-discuss mailing list
OmniOS-discuss@lists.omniti.com
http://lists.omniti.com/mailman/listinfo/omnios-discuss


Re: [OmniOS-discuss] replace disk with difrferent model/size

2014-10-06 Thread Schweiss, Chip
Looks like you are trying to add a 4K disk to pool with ashift=9.

If the disk is a 512e disk you can force it to ashift=9 by adding an entry
in sd.conf.  This will degrade write performance.   It's best to move the
whole pool to ashift=12 (4K sectors) or find a 512b native disk.

-Chip

On Mon, Oct 6, 2014 at 6:46 AM, Fábio Rabelo 
wrote:

> Hi to all
>
> I have a system with failed hard disk .
>
> When I try to replace it, after 3 to 5 seconds "thinking"  the system
> returns a msg like
>
> geometry mismatch error
>
> So, how can I replace a failed disk with another model or size ?
>
> This system uses 2 TB Hard disks, at first, I've installed 4 TB one,
> after this geometry error I get a try with 2 TB model, same result .
>
>
> Fábio Rabelo
> ___
> OmniOS-discuss mailing list
> OmniOS-discuss@lists.omniti.com
> http://lists.omniti.com/mailman/listinfo/omnios-discuss
>
___
OmniOS-discuss mailing list
OmniOS-discuss@lists.omniti.com
http://lists.omniti.com/mailman/listinfo/omnios-discuss


[OmniOS-discuss] r151012 nlockmgr fails to start

2014-10-06 Thread Schweiss, Chip
I've started the process of moving my systems to r151012.  The first
production system I did this on nlockmgr repeatedly went to maintenance
mode.  Several attempts to clear it failed.

I've restarted on the previous boot environment and it starts fine on
r151010.

I think this is related to https://www.illumos.org/issues/4518, but I'm not
sure if this is a new bug.

This system has only 3 zfs folders, all NFS shared.

Unfortunately this being a production system, I cannot dedicate much
downtime to diagnosing this problem.   I mounted the r151012 be, to a
temporary mount point so I could examine logs more closely.
Unfortunately, not much detail there:

root@hcp-iops1:/tmp/omnios-r151012/var/svc/log# cat
network-nfs-nlockmgr\:default.log

...old lines deleted

[ Oct  6 07:05:26 Disabled. ]
[ Oct  6 07:05:48 Enabled. ]
[ Oct  6 07:05:48 Executing start method ("/lib/svc/method/nlockmgr"). ]
[ Oct  6 07:06:17 Method "start" exited with status 1. ]
[ Oct  6 07:06:17 Executing start method ("/lib/svc/method/nlockmgr"). ]
[ Oct  6 07:06:18 Method "start" exited with status 0. ]
[ Oct  6 07:14:06 Disabled. ]
[ Oct  6 07:14:35 Enabled. ]
[ Oct  6 07:14:35 Executing start method ("/lib/svc/method/nlockmgr"). ]
[ Oct  6 07:15:00 Method "start" exited with status 1. ]
[ Oct  6 07:15:00 Executing start method ("/lib/svc/method/nlockmgr"). ]
[ Oct  6 07:15:25 Method "start" exited with status 1. ]
[ Oct  6 07:15:25 Executing start method ("/lib/svc/method/nlockmgr"). ]
[ Oct  6 07:15:50 Method "start" exited with status 1. ]
[ Oct  6 07:18:11 Leaving maintenance because clear requested. ]
[ Oct  6 07:18:11 Enabled. ]
[ Oct  6 07:18:11 Executing start method ("/lib/svc/method/nlockmgr"). ]
[ Oct  6 07:18:36 Method "start" exited with status 1. ]
[ Oct  6 07:18:45 Leaving maintenance because clear requested. ]
[ Oct  6 07:18:45 Enabled. ]
[ Oct  6 07:18:45 Executing start method ("/lib/svc/method/nlockmgr"). ]
[ Oct  6 07:19:10 Method "start" exited with status 1. ]
[ Oct  6 07:19:17 Leaving maintenance because clear requested. ]
[ Oct  6 07:19:17 Enabled. ]
[ Oct  6 07:19:17 Executing start method ("/lib/svc/method/nlockmgr"). ]
[ Oct  6 07:19:42 Method "start" exited with status 1. ]
[ Oct  6 07:20:09 Leaving maintenance because clear requested. ]
[ Oct  6 07:20:09 Enabled. ]
[ Oct  6 07:20:09 Executing start method ("/lib/svc/method/nlockmgr"). ]
[ Oct  6 07:20:34 Method "start" exited with status 1. ]
[ Oct  6 07:21:23 Disabled. ]
[ Oct  6 07:22:36 Enabled. ]
[ Oct  6 07:22:36 Executing start method ("/lib/svc/method/nlockmgr"). ]
[ Oct  6 07:23:06 Method "start" exited with status 1. ]
[ Oct  6 07:23:06 Executing start method ("/lib/svc/method/nlockmgr"). ]
[ Oct  6 07:23:31 Method "start" exited with status 1. ]
[ Oct  6 07:23:31 Executing start method ("/lib/svc/method/nlockmgr"). ]
[ Oct  6 07:23:36 Method "start" exited with status 0. ]

Anyone else seeing this in r151012?

Any tips on collecting better information on this would be appreciated.

-Chip
___
OmniOS-discuss mailing list
OmniOS-discuss@lists.omniti.com
http://lists.omniti.com/mailman/listinfo/omnios-discuss


[OmniOS-discuss] replace disk with difrferent model/size

2014-10-06 Thread Fábio Rabelo
Hi to all

I have a system with failed hard disk .

When I try to replace it, after 3 to 5 seconds "thinking"  the system
returns a msg like

geometry mismatch error

So, how can I replace a failed disk with another model or size ?

This system uses 2 TB Hard disks, at first, I've installed 4 TB one,
after this geometry error I get a try with 2 TB model, same result .


Fábio Rabelo
___
OmniOS-discuss mailing list
OmniOS-discuss@lists.omniti.com
http://lists.omniti.com/mailman/listinfo/omnios-discuss