Bug#902117: [Debian-ha-maintainers] Bug#902117: corosync-qdevice will not daemonize/run

2019-08-09 Thread Nickle, Richard
I'm seeing the same behavior in Ubuntu 18.04.

What looks like early termination, and no debug output.

The program is running though:

$ sudo strace corosync-qdevice -df

[...]

openat(AT_FDCWD, "/dev/shm/qb-votequorum-event-12248-19618-30-header",
O_RDWR) = 9
ftruncate(9, 8248)  = 0
mmap(NULL, 8248, PROT_READ|PROT_WRITE, MAP_SHARED, 9, 0) = 0x7f22de722000
openat(AT_FDCWD, "/dev/shm/qb-votequorum-event-12248-19618-30-data",
O_RDWR) = 10
ftruncate(10, 1052672)  = 0
getpid()= 19618
sendto(11, "<30>Aug  9 11:06:16 corosync-qde"..., 102, MSG_NOSIGNAL, NULL,
0) = 102
mmap(NULL, 2105344, PROT_NONE, MAP_PRIVATE|MAP_ANONYMOUS, -1, 0) =
0x7f22dac82000
mmap(0x7f22dac82000, 1052672, PROT_READ|PROT_WRITE, MAP_SHARED|MAP_FIXED,
10, 0) = 0x7f22dac82000
mmap(0x7f22dad83000, 1052672, PROT_READ|PROT_WRITE, MAP_SHARED|MAP_FIXED,
10, 0) = 0x7f22dad83000
close(10)   = 0
close(9)= 0
sendto(8, "\20", 1, MSG_NOSIGNAL, NULL, 0) = 1
exit_group(1)   = ?
+++ exited with 1 +++


Bug#902117: [Debian-ha-maintainers] Bug#902117: corosync-qdevice will not daemonize/run

2018-08-13 Thread Ferenc Wágner
Control: severity -1 normal
Control: tag -1 + upstream

Valentin Vidic  writes:

> On Fri, Jul 06, 2018 at 12:50:42PM +0200, Ferenc Wágner wrote:
>
>> Thanks for the report.  I've been pretty busy with other tasks, but I'll
>> check this out as soon as possible, your report isn't forgotten.  I ask
>> for you patience till then.
>
> Feri, you still want to check this or should we close this issue?

Hi Valentin,

Sorry for the late response.  I think that "corosync-qdevice -f -d"
being completely silent is unexpected, thus a usability problem.  There
really should be a hint about this in the man page in my opinion.
-- 
Regards,
Feri



Bug#902117: [Debian-ha-maintainers] Bug#902117: corosync-qdevice will not daemonize/run

2018-07-23 Thread Valentin Vidic
On Fri, Jul 06, 2018 at 12:50:42PM +0200, Ferenc Wágner wrote:
> Thanks for the report.  I've been pretty busy with other tasks, but I'll
> check this out as soon as possible, your report isn't forgotten.  I ask
> for you patience till then.

Feri, you still want to check this or should we close this issue?

-- 
Valentin



Bug#902117: [Debian-ha-maintainers] Bug#902117: corosync-qdevice will not daemonize/run

2018-07-07 Thread Valentin Vidic
On Fri, Jul 06, 2018 at 04:43:00PM -0400, Jason Gauthier wrote:
> Now, it's entirely possible that I do have a configuration issue
> causing corosync-qdevice to not start.  However, the real issue is
> that corosync-qdevice does not log anything to stdout when run with
> "-f -d"  (foreground, debug).

Just tried this on unstable and you are right there is no output
for "-f -d", but I do get this in the daemon.log:

Jul  7 14:29:51 sid1 corosync-qdevice[1507]: Configuring qdevice
Jul  7 14:29:51 sid1 corosync-qdevice[1507]: Can't read quorum.device.model 
cmap key.
Jul  7 14:29:55 sid1 corosync-qdevice[1511]: Initializing votequorum
Jul  7 14:29:55 sid1 corosync-qdevice[1511]: shm size:1048589; 
real_size:1052672; rb->word_size:263168
Jul  7 14:29:55 sid1 corosync-qdevice[1511]: shm size:1048589; 
real_size:1052672; rb->word_size:263168
Jul  7 14:29:55 sid1 corosync-qdevice[1511]: shm size:1048589; 
real_size:1052672; rb->word_size:263168
Jul  7 14:29:55 sid1 corosync-qdevice[1511]: Initializing local socket
Jul  7 14:29:55 sid1 corosync-qdevice[1511]: Registering qdevice models
Jul  7 14:29:55 sid1 corosync-qdevice[1511]: Configuring qdevice
Jul  7 14:29:55 sid1 corosync-qdevice[1511]: Can't read quorum.device.model 
cmap key.

Maybe stdout does not exist for this service or you need to
tune this part of corosync.conf:

logging {
fileline:   off
to_stderr:  no
to_logfile: no
logfile:/var/log/corosync/corosync.log
to_syslog:  yes
debug:  off
timestamp:  on
logger_subsys {
subsys: QUORUM
debug:  off
}
}

-- 
Valentin



Bug#902117: [Debian-ha-maintainers] Bug#902117: corosync-qdevice will not daemonize/run

2018-07-06 Thread Jason Gauthier
On Fri, Jul 6, 2018 at 10:20 AM Valentin Vidic  wrote:
>
> On Fri, Jun 22, 2018 at 09:46:36AM -0400, Jason Gauthier wrote:
> > corosync-qdevice is a daemon that runs on each cluster node that help
> > provide a voting subsystem that utilizes corosync-qnet outside the
> > cluster.
> >
> > After installing the packages from debian stretch, and configuring the
> > application, it does not run.  One can use -d and -f to troubleshoot
> > issues, and even in this situation no data is logged to the console,
> > or any syslog messages generated.  The application immediately fails.
>
> corosync-qdevice is configured in corosync.conf, can you share the
> quorum block from there?
>
> If this is not configured I get the following error:
>
> Jul  6 16:08:08 node1 corosync-qdevice[2778]: Can't read quorum.device.model 
> cmap key.
>
> But with a correct configuration it starts fine for me.
>

Interesting.  Let me make some notes.  First, on a 3rd node, I ran
corosync-qnetd, and if not setup then it gives you a nice bulleted
list to complete a certificate based set up.  I did that, and
corosync-qnetd is launched and active on a 3rd node.

Now, it's entirely possible that I do have a configuration issue
causing corosync-qdevice to not start.  However, the real issue is
that corosync-qdevice does not log anything to stdout when run with
"-f -d"  (foreground, debug).

So, if there is a configuration problem I cannot troubleshoot it.  If
I understand the issue correctly, this has something to do with libqb
and (perhaps) the way that it is built.

I grabbed the source code for corosync-qdevice and when I replaced the
logging functionality with a simple varargs based function I was able
to get some output and troubleshoot a little further.

Let me demonstrate:
root@alpha:/home/jgauthier# corosync-qdevice -f -d
root@alpha:/home/jgauthier#

This should result in some form of output.  (This would be a great
test as for you as well)

Now, all that aside, my quorum block looks like this:

quorum {
# Enable and configure quorum subsystem (default: off)
# see also corosync.conf.5 and votequorum.5
#  two_node: 1
provider: corosync_votequorum
expected_votes: 3
device {
votes: 1
model: net
net {
  host: delta
  }
}

delta is the host for corosync-qnetd.

Jason



Bug#902117: [Debian-ha-maintainers] Bug#902117: corosync-qdevice will not daemonize/run

2018-07-06 Thread Valentin Vidic
On Fri, Jun 22, 2018 at 09:46:36AM -0400, Jason Gauthier wrote:
> corosync-qdevice is a daemon that runs on each cluster node that help
> provide a voting subsystem that utilizes corosync-qnet outside the
> cluster.
> 
> After installing the packages from debian stretch, and configuring the
> application, it does not run.  One can use -d and -f to troubleshoot
> issues, and even in this situation no data is logged to the console,
> or any syslog messages generated.  The application immediately fails.

corosync-qdevice is configured in corosync.conf, can you share the
quorum block from there?

If this is not configured I get the following error:

Jul  6 16:08:08 node1 corosync-qdevice[2778]: Can't read quorum.device.model 
cmap key.

But with a correct configuration it starts fine for me.

-- 
Valentin