On Wed, Mar 27, 2013 at 2:44 PM, Andrew Deason <adea...@sinenomine.net> wrote:
> On Tue, 26 Mar 2013 20:04:15 -0400 (EDT)
> Prasad Dharmasena <p...@glue.umd.edu> wrote:
>
>> The vicep* partitions (or volsets), for which the backup dump/butc
>> hang, are not consistent.  If we kill and restart the dump process,
>> some of the previously hung volsets finish while others hang.
>>
>> What info do we need to grab from butc and buserver in order to
>> track the problem?
>
> I assume there's nothing helpful in BackupLog?
>
> I haven't worked with butc/buserver for a long time, so I don't remember
> if there are ways to get more information out of them specifically.
> However, just going by what works in general:
>
> One pretty surefire way of getting to know what's happening is to grab a
> core from the butc and buserver processes while they are hanging ('gcore
> <pid>'). You'll need a developer to look at that to say what's going on,
> and those cores will contain sensitive information. But if there is
> someone you trust enough with it, that will let you know what's
> happening.

there won't be anything sensitive in just a stack trace, and on solaris that
is easily generated with just running pstack and could be shared
with impunity. it may or may not be enough information but it's
certainly a place
to start.

-- 
Derrick
_______________________________________________
OpenAFS-info mailing list
OpenAFS-info@openafs.org
https://lists.openafs.org/mailman/listinfo/openafs-info

Reply via email to