from:"michael j. goulish \(JIRA\)"

[jira] [Commented] (DISPATCH-106) pn link corruption after router restart

2015-01-29 Thread michael j. goulish (JIRA)


[ 
https://issues.apache.org/jira/browse/DISPATCH-106?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14298201#comment-14298201
 ] 

michael j. goulish commented on DISPATCH-106:
-

OK, will do.very interesting -- thanks !





 pn link corruption after router restart
 ---

 Key: DISPATCH-106
 URL: https://issues.apache.org/jira/browse/DISPATCH-106
 Project: Qpid Dispatch
  Issue Type: Bug
  Components: Router Node
Affects Versions: 0.3
Reporter: michael goulish
 Fix For: 0.4


 With the standard 6-node demo network,  (A-D, X, Y)  after killing and 
 restarting node Y, I see a bad link on router D -- which causes D to crash.
 Here is sequence of events from logs of routers and the topologist testing 
 program:
   01:05:05.367 Killing router Y, pid 20074
   01:05:05.367 Sleeping 30 seconds
   01:05:35.367 Restarting router Y, pid 20120
   01:05:38 Router D : last valid origins post to its log file :
Node QDR.C valid origins: []
   01:05:46 Router D posts to its log file:
Exited Router Flux Mode
   01:06:05.368 checking for crash after node bounce
( no crash detected )
   01:06:17 last post to router D log file
ROUTER_LS (trace) RCVD: RA(id=QDR.X area=0 inst=1422165872 
 ls_seq=2 mobile_seq=0)
   01:06:35.369 second check for crash. (none detected)
   01:06:35.370 getting topology
( Node D fails to respond.  PID 20072 )
( core file, timestamped 01:06 )
   here is backtrace from router D's core file
   {
 #0  pn_string_get (string=0xfdfdfdfdbabecafe) at 
 /home/mick/rh-qpid-proton/proton-c/src/object/string.c:120
 #1  0x7ff73fa8e752 in qd_router_link_name (link=0x7ff72800b2d0) at 
 /home/mick/dispatch/src/router_agent.c:112
 #2  0x7ff73fa8e7dd in qd_entity_refresh_router_link 
 (entity=0x7ff7300c9b50, impl=0x7ff72800b2d0)
 at /home/mick/dispatch/src/router_agent.c:120
 #3  0x003e40805d8c in ffi_call_unix64 () from /lib64/libffi.so.6
 #4  0x003e408056bc in ffi_call () from /lib64/libffi.so.6
 #5  0x7ff737d2dc8b in _ctypes_callproc () from 
 /usr/lib64/python2.7/lib-dynload/_ctypes.so
 #6  0x7ff737d27a85 in PyCFuncPtr_call () from 
 /usr/lib64/python2.7/lib-dynload/_ctypes.so
 #7  0x0036df44a0d3 in PyObject_Call () from /lib64/libpython2.7.so.1.0
 #8  0x0036df4de37c in PyEval_EvalFrameEx () from 
 /lib64/libpython2.7.so.1.0
 #9  0x0036df4e21dd in PyEval_EvalCodeEx () from 
 /lib64/libpython2.7.so.1.0
 #10 0x0036df4e088f in PyEval_EvalFrameEx () from 
 /lib64/libpython2.7.so.1.0
 #11 0x0036df4e21dd in PyEval_EvalCodeEx () from 
 /lib64/libpython2.7.so.1.0
 #12 0x0036df4e088f in PyEval_EvalFrameEx () from 
 /lib64/libpython2.7.so.1.0
 #13 0x0036df4e21dd in PyEval_EvalCodeEx () from 
 /lib64/libpython2.7.so.1.0
 #14 0x0036df46f0d8 in ?? () from /lib64/libpython2.7.so.1.0
 #15 0x0036df44a0d3 in PyObject_Call () from /lib64/libpython2.7.so.1.0
 #16 0x0036df4590c5 in ?? () from /lib64/libpython2.7.so.1.0
 #17 0x0036df44a0d3 in PyObject_Call () from /lib64/libpython2.7.so.1.0
 #18 0x0036df44a1b5 in ?? () from /lib64/libpython2.7.so.1.0
 #19 0x0036df44a29e in PyObject_CallFunction () from 
 /lib64/libpython2.7.so.1.0
 #20 0x7ff73fa8d77f in qd_io_rx_handler (context=0x7ff736321e68, 
 msg=0x7ff728019bd0, link_id=0
 at /home/mick/dispatch/src/python_embedded.c:519
 #21 0x7ff73fa92533 in router_rx_handler (context=0x1db5fd0, 
 link=0x7ff730008710, delivery=0x7ff73004cc50)
 at /home/mick/dispatch/src/router_node.c:922
 #22 0x7ff73fa7fa16 in do_receive (pnd=0x1e359a0) at 
 /home/mick/dispatch/src/container.c:221
 #23 0x7ff73fa7fea3 in process_handler (container=0x1dbd6f0, 
 unused=0x1e0a050, qd_conn=0x1e2c6a0)
 at /home/mick/dispatch/src/container.c:362
 #24 0x7ff73fa80135 in handler (handler_context=0x1dbd6f0, 
 conn_context=0x1e0a050, event=QD_CONN_EVENT_PROCESS,
 qd_conn=0x1e2c6a0) at /home/mick/dispatch/src/container.c:438
 #25 0x7ff73fa98346 in process_connector (qd_server=0x1d78460, 
 cxtr=0x1e1b9b0)
 at /home/mick/dispatch/src/server.c:322
 #26 0x7ff73fa98c1f in thread_run (arg=0x1d70d30) at 
 /home/mick/dispatch/src/server.c:546
 #27 0x003e3dc07ee5 in start_thread () from /lib64/libpthread.so.0
 ...
 }
   Let's go up to qd_router_link_name
   at /home/mick/dispatch/src/router_agent.c:112
   (gdb) print * link
 $1 =
 {
   prev = 0x7ff72800b210,
   next = 0x7ff72800b390,
   mask_bit = 3,
   link_type = QD_LINK_ROUTER,
   link_direction = QD_OUTGOING,
   owning_addr =

[jira] [Created] (QPID-4518) Unknown qpidd config-file options should prevent startup.

2012-12-20 Thread michael j. goulish (JIRA)

michael j. goulish created QPID-4518:


 Summary: Unknown qpidd config-file options should prevent startup.
 Key: QPID-4518
 URL: https://issues.apache.org/jira/browse/QPID-4518
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Affects Versions: 0.19
Reporter: michael j. goulish
Assignee: michael j. goulish
Priority: Minor


Mis-spelled config file options are ignored silently.
They should cause a warning, and halt startup.


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@qpid.apache.org
For additional commands, e-mail: dev-h...@qpid.apache.org

[jira] [Closed] (QPID-4003) qpid-cluster to allow --sasl-mechanism option

2012-09-24 Thread michael j. goulish (JIRA)


 [ 
https://issues.apache.org/jira/browse/QPID-4003?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

michael j. goulish closed QPID-4003.


   Resolution: Fixed
Fix Version/s: 0.18

 qpid-cluster to allow --sasl-mechanism option
 -

 Key: QPID-4003
 URL: https://issues.apache.org/jira/browse/QPID-4003
 Project: Qpid
  Issue Type: Improvement
  Components: Python Tools
Affects Versions: 0.14
Reporter: Pavel Moravec
Priority: Minor
  Labels: patch
 Fix For: 0.18

 Attachments: qpid-cluster-mechList.patch

   Original Estimate: 2h
  Remaining Estimate: 2h

 Description of problem:
 qpid-cluster shall have option --sasl-mechanism to select which SASL 
 authentication mechanism it wishes to use. Other qpid tools have it.
 That makes sense e.g. when more mechanisms are allowed and one wishes to 
 authenticate qpid-cluster via Kerberos. Currently qpid-cluster picks up the 
 most secure method (DIGEST-MD5) that can't be changed.
 Trivial patch is already proposed.
 Version-Release number of selected component (if applicable):
 qpid 0.14
 How reproducible:
 100%
 Steps to Reproduce:
 1. don't limit auth.mechanisms in /etc/sasl2/qpidd.conf (comment out 
 mech_list there, if present)
 2. have auth=yes in a clustered broker
 3. Setup Kerberos credentials.
 4. Try to run qpid-cluster authenticating via Kerberos.
  
 Actual results:
 There is no option to pick up a mechanism (in our case, GSSAPI).
 Expected results:
 Have --sasl-mechanism option available.
 Additional info:
 Easy patch proposed.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@qpid.apache.org
For additional commands, e-mail: dev-h...@qpid.apache.org

[jira] [Resolved] (QPID-4222) qpid-cluster -C ignores credentials provided

2012-09-24 Thread michael j. goulish (JIRA)


 [ 
https://issues.apache.org/jira/browse/QPID-4222?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

michael j. goulish resolved QPID-4222.
--

Resolution: Fixed

 qpid-cluster -C ignores credentials provided
 

 Key: QPID-4222
 URL: https://issues.apache.org/jira/browse/QPID-4222
 Project: Qpid
  Issue Type: Bug
  Components: Python Tools
Affects Versions: 0.14
Reporter: Pavel Moravec
Priority: Minor
  Labels: patch
 Attachments: 0001-qpid-cluster-C-ignores-credentials.patch

   Original Estimate: 2h
  Remaining Estimate: 2h

 Description of problem:
 qpid-cluster -C shall connect to all brokers of the cluster to gather all 
 connections to the brokers. However it ignores credentials from broker URL. 
 So any connection attempt to a broker with authentication fails.
 Version-Release number of selected component (if applicable):
 any (incl. 0.12)
 How reproducible:
 100%
 Steps to Reproduce:
 1. Start clustered broker with authentication enabled (sasl config without 
 anonymous mechanism)
 2. SASL database to have default credentials stored (guest/guest for realm 
 QPID)
 3. Run:
 qpid-cluster -C guest/guest@${HOSTNAME}:5672
   
 Actual results:
 Traceback (most recent call last):
   File /usr/bin/qpid-cluster, line 316, in ?
 sys.exit(main())
   File /usr/bin/qpid-cluster, line 307, in main
 raise Exception(Failed: %s - %s % (e.__class__.__name__, e))
 Exception: Failed: ConnectionFailed - (None, 'No acceptable SASL 
 authentication mechanism available')
 Expected results:
 list of connections of the cluster to be printed out
 Additional info:
 Patch to be provided

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@qpid.apache.org
For additional commands, e-mail: dev-h...@qpid.apache.org

[jira] [Resolved] (QPID-4221) qpid-cluster -c does not work

2012-09-24 Thread michael j. goulish (JIRA)


 [ 
https://issues.apache.org/jira/browse/QPID-4221?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

michael j. goulish resolved QPID-4221.
--

Resolution: Fixed

 qpid-cluster -c does not work
 -

 Key: QPID-4221
 URL: https://issues.apache.org/jira/browse/QPID-4221
 Project: Qpid
  Issue Type: Bug
  Components: Python Tools
Affects Versions: 0.14
Reporter: Pavel Moravec
Priority: Minor
  Labels: patch
 Attachments: 0001-qpid-cluster-c-doesnt-work.patch, 
 qpid-cluster-ipv6.patch

   Original Estimate: 1h
  Remaining Estimate: 1h

 Description of problem:
 qpid-cluster -c ID shall view client connections to specified member, but 
 it does not work at all. 
 Version-Release number of selected component (if applicable):
 any
 How reproducible:
 100%
 Steps to Reproduce:
 1. Setup a qpid cluster
 2. Run qpid-cluster one_broker_address to discover ID of a broker. Output 
 is something like:
 $ qpid-cluster localhost:5672
   Cluster Name: test
 Cluster Status: ACTIVE
   Cluster Size: 2
Members: ID=1.2.3.4:28840 URL=amqp:tcp:1.2.3.4:5672
   : ID=1.2.3.5:21260 URL=amqp:tcp:1.2.3.5:5672
 $
 3. Query for connections on one node using broker ID from previous output, 
 i.e.:
 $ qpid-cluster -c 1.2.3.4:28840 localhost:5672
   
 Actual results:
 Output identical to the one in point 2. is written:
   Cluster Name: test
 Cluster Status: ACTIVE
   Cluster Size: 2
Members: ID=1.2.3.4:28840 URL=amqp:tcp:1.2.3.4:5672
   : ID=1.2.3.5:21260 URL=amqp:tcp:1.2.3.5:5672
 Expected results:
 List of connections, i.e. something like:
 Clients on Member: ID=1.2.3.5:21260:
 dhcp-1-223:5672-1.2.3.4
 dhcp-1-223:5672-1.2.3.5
 localhost.localdomain:5672-127.0.0.1
 Additional info:
 Patch attached, trivial omitting of setting up variables at options parsing

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@qpid.apache.org
For additional commands, e-mail: dev-h...@qpid.apache.org

[jira] [Assigned] (QPID-4317) make browse-only x-arg similar to others.

2012-09-18 Thread michael j. goulish (JIRA)


 [ 
https://issues.apache.org/jira/browse/QPID-4317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

michael j. goulish reassigned QPID-4317:


Assignee: michael j. goulish

 make browse-only x-arg similar to others.
 -

 Key: QPID-4317
 URL: https://issues.apache.org/jira/browse/QPID-4317
 Project: Qpid
  Issue Type: Improvement
  Components: C++ Broker
Reporter: michael j. goulish
Assignee: michael j. goulish
Priority: Minor

 Make the x-arg string for the new browse-only queue feature  
 qpid.browse-only to be similar to (almost all of) the others.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@qpid.apache.org
For additional commands, e-mail: dev-h...@qpid.apache.org

[jira] [Created] (QPID-4317) make browse-only x-arg similar to others.

2012-09-18 Thread michael j. goulish (JIRA)

michael j. goulish created QPID-4317:


 Summary: make browse-only x-arg similar to others.
 Key: QPID-4317
 URL: https://issues.apache.org/jira/browse/QPID-4317
 Project: Qpid
  Issue Type: Improvement
  Components: C++ Broker
Reporter: michael j. goulish
Priority: Minor


Make the x-arg string for the new browse-only queue feature  qpid.browse-only 
to be similar to (almost all of) the others.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@qpid.apache.org
For additional commands, e-mail: dev-h...@qpid.apache.org

[jira] [Commented] (QPID-4317) make browse-only x-arg similar to others.

2012-09-18 Thread michael j. goulish (JIRA)


[ 
https://issues.apache.org/jira/browse/QPID-4317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13457797#comment-13457797
 ] 

michael j. goulish commented on QPID-4317:
--

done in rev 1387135.

 make browse-only x-arg similar to others.
 -

 Key: QPID-4317
 URL: https://issues.apache.org/jira/browse/QPID-4317
 Project: Qpid
  Issue Type: Improvement
  Components: C++ Broker
Reporter: michael j. goulish
Assignee: michael j. goulish
Priority: Minor
 Fix For: 0.19


 Make the x-arg string for the new browse-only queue feature  
 qpid.browse-only to be similar to (almost all of) the others.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@qpid.apache.org
For additional commands, e-mail: dev-h...@qpid.apache.org

[jira] [Resolved] (QPID-4317) make browse-only x-arg similar to others.

2012-09-18 Thread michael j. goulish (JIRA)


 [ 
https://issues.apache.org/jira/browse/QPID-4317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

michael j. goulish resolved QPID-4317.
--

   Resolution: Fixed
Fix Version/s: 0.19

done in rev 1387135.

 make browse-only x-arg similar to others.
 -

 Key: QPID-4317
 URL: https://issues.apache.org/jira/browse/QPID-4317
 Project: Qpid
  Issue Type: Improvement
  Components: C++ Broker
Reporter: michael j. goulish
Assignee: michael j. goulish
Priority: Minor
 Fix For: 0.19


 Make the x-arg string for the new browse-only queue feature  
 qpid.browse-only to be similar to (almost all of) the others.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@qpid.apache.org
For additional commands, e-mail: dev-h...@qpid.apache.org

[jira] [Commented] (QPID-4241) browse-only queues

2012-09-14 Thread michael j. goulish (JIRA)


[ 
https://issues.apache.org/jira/browse/QPID-4241?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13455959#comment-13455959
 ] 

michael j. goulish commented on QPID-4241:
--

implemented in rev 1382991.
auto-test checked in -- rev 1384851.

 browse-only queues
 --

 Key: QPID-4241
 URL: https://issues.apache.org/jira/browse/QPID-4241
 Project: Qpid
  Issue Type: New Feature
  Components: C++ Broker
Affects Versions: Future
Reporter: michael j. goulish
Assignee: michael j. goulish
 Fix For: 0.19


 Some users need browse-only functionality.  For example, messages represent 
 the an ongoing history of events, all of which are relevant to newly joining 
 consumers.  The messages should remain unconsumed and available to newcomers 
 until they are outdated by time-to-live rules, or the queue is destroyed.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@qpid.apache.org
For additional commands, e-mail: dev-h...@qpid.apache.org

[jira] [Resolved] (QPID-4241) browse-only queues

2012-09-14 Thread michael j. goulish (JIRA)


 [ 
https://issues.apache.org/jira/browse/QPID-4241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

michael j. goulish resolved QPID-4241.
--

   Resolution: Fixed
Fix Version/s: 0.19

impl in rev 1382991
auto-test in rev 1384851.   (oops!)

 browse-only queues
 --

 Key: QPID-4241
 URL: https://issues.apache.org/jira/browse/QPID-4241
 Project: Qpid
  Issue Type: New Feature
  Components: C++ Broker
Affects Versions: Future
Reporter: michael j. goulish
Assignee: michael j. goulish
 Fix For: 0.19


 Some users need browse-only functionality.  For example, messages represent 
 the an ongoing history of events, all of which are relevant to newly joining 
 consumers.  The messages should remain unconsumed and available to newcomers 
 until they are outdated by time-to-live rules, or the queue is destroyed.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@qpid.apache.org
For additional commands, e-mail: dev-h...@qpid.apache.org

[jira] [Created] (QPID-4241) browse-only queues

2012-08-23 Thread michael j. goulish (JIRA)

michael j. goulish created QPID-4241:


 Summary: browse-only queues
 Key: QPID-4241
 URL: https://issues.apache.org/jira/browse/QPID-4241
 Project: Qpid
  Issue Type: New Feature
  Components: C++ Broker
Affects Versions: Future
Reporter: michael j. goulish
Assignee: michael j. goulish


Some users need browse-only functionality.  For example, messages represent the 
an ongoing history of events, all of which are relevant to newly joining 
consumers.  The messages should remain unconsumed and available to newcomers 
until they are outdated by time-to-live rules, or the queue is destroyed.


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@qpid.apache.org
For additional commands, e-mail: dev-h...@qpid.apache.org

[jira] [Commented] (QPID-4241) browse-only queues

2012-08-23 Thread michael j. goulish (JIRA)

[
https://issues.apache.org/jira/browse/QPID-4241?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13440281#comment-13440281
]

michael j. goulish commented on QPID-4241:
--

Here is the proposed functional specification.

proposed functional specification
{
purpose
{
Some users need browse-only functionality. For example,
messages represent the an ongoing history of events,
all of which are relevant to newly joining consumers.
The messages should remain unconsumed and available to
newcomers until they are outdated by time-to-live rules,
thrown away by a ring-queue, or the queue is destroyed.
}

strategy note
{
ACL and browse-only-queue solutions to this problem are
both possible. But right now we should prefer a
browse-only-queue solution, because

(1) it would satisfy the current customer's requirements, and

(2) the ACL functionality requires a complete overhaul.
}

user interface
{
1. extend x-declare{arguments:{}} in our addressing
to include 'browse-only'.

2. queues that are declared browse-only will allow
subscribers to access queues, and 'acquire' messages
normally -- but that 'acquisition' in this queue would
result in only a browse. The message would remain.

3. no effect on ACLs.
}

testing
{
As user mick, use standard command line tools to:

1. declare a queue with browse-only argument.

2. use standard command line tool to send N
messages to the queue.

3. use standard tool to 'consume' messages from
queue. Confirm that N messages are read.

4. Repeat step 3 at least once, and confirm that
same N messages are again read every time.
( They are not consumed. )

5. Set TTL on messages and confirm that it removes
messages from queue.

6. Confirm same behavior across all queue types.
}
}

browse-only queues
--

Key: QPID-4241
URL: https://issues.apache.org/jira/browse/QPID-4241
Project: Qpid
Issue Type: New Feature
Components: C++ Broker
Affects Versions: Future
Reporter: michael j. goulish
Assignee: michael j. goulish

Some users need browse-only functionality. For example, messages represent
the an ongoing history of events, all of which are relevant to newly joining
consumers. The messages should remain unconsumed and available to newcomers
until they are outdated by time-to-live rules, or the queue is destroyed.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@qpid.apache.org
For additional commands, e-mail: dev-h...@qpid.apache.org

[jira] [Created] (QPID-4194) cluster should enable queue events in CATCHUP state

2012-08-06 Thread michael j. goulish (JIRA)

michael j. goulish created QPID-4194:


 Summary: cluster should enable queue events in CATCHUP state
 Key: QPID-4194
 URL: https://issues.apache.org/jira/browse/QPID-4194
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Reporter: michael j. goulish
Assignee: michael j. goulish


pavel moravec has discovered the mechanism for this problem already -- current 
cluster code does not enable queue events until READY state -- but new messages 
may be enqueued during CATCHUP state.
So if you have a replication-queue, which needs those queue-events to be 
generated, a newbie broker -- if it is in CATCHUP state -- will fail to 
replicate any messages that arrive on the replicated queue.
Two-line fix

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@qpid.apache.org
For additional commands, e-mail: dev-h...@qpid.apache.org

[jira] [Resolved] (QPID-4194) cluster should enable queue events in CATCHUP state

2012-08-06 Thread michael j. goulish (JIRA)


 [ 
https://issues.apache.org/jira/browse/QPID-4194?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

michael j. goulish resolved QPID-4194.
--

   Resolution: Fixed
Fix Version/s: 0.19

pavel's fix.

 cluster should enable queue events in CATCHUP state
 ---

 Key: QPID-4194
 URL: https://issues.apache.org/jira/browse/QPID-4194
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Reporter: michael j. goulish
Assignee: michael j. goulish
  Labels: cluster, desync
 Fix For: 0.19


 pavel moravec has discovered the mechanism for this problem already -- 
 current cluster code does not enable queue events until READY state -- but 
 new messages may be enqueued during CATCHUP state.
 So if you have a replication-queue, which needs those queue-events to be 
 generated, a newbie broker -- if it is in CATCHUP state -- will fail to 
 replicate any messages that arrive on the replicated queue.
 Two-line fix

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@qpid.apache.org
For additional commands, e-mail: dev-h...@qpid.apache.org

[jira] [Created] (QPID-3438) cluster auth failure increments cnx count

2011-08-19 Thread michael j. goulish (JIRA)

cluster auth failure increments cnx count
-

 Key: QPID-3438
 URL: https://issues.apache.org/jira/browse/QPID-3438
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Reporter: michael j. goulish
Assignee: michael j. goulish
 Fix For: 0.14


If a cluster brokers are authenticating, and an attempted cnx fails due to an 
auth problem, the broker nevertheless increments its cnx counter.  Which means 
it eventually runs out of available connections -- even if there aren't any 
open.  :-(

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] [Created] (QPID-3397) Add Broker management methods for several queue functions

2011-08-05 Thread michael j. goulish (JIRA)

Add Broker management methods for several queue functions
-

 Key: QPID-3397
 URL: https://issues.apache.org/jira/browse/QPID-3397
 Project: Qpid
  Issue Type: Improvement
  Components: C++ Broker
Affects Versions: 0.14
Reporter: michael j. goulish
Assignee: michael j. goulish


Implement broker management functions to 
  (1) return the ID list of all messages on a queue.
  (2) retrieve the header of a given message on a given queue
  (3) retrieve the body of a given message on a given queue
  (4) remove a given message from a given queue.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] [Created] (QPID-3398) LegacyLVQ should not impose extra semantics on remove function

2011-08-05 Thread michael j. goulish (JIRA)

LegacyLVQ should not impose extra semantics on remove function
--

 Key: QPID-3398
 URL: https://issues.apache.org/jira/browse/QPID-3398
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Reporter: michael j. goulish
Assignee: michael j. goulish
 Fix For: 0.14


LegacyLVQ imposes extra semantics on the remove function beyond what its parent 
MessageMap specifies.  
It requires the caller to pass in a message with the same payload -- which 
seems weird.
It should instead pass back the message that was deleted, just like its parent.


--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] [Created] (QPID-3340) restore use of testagent to two cluster tests

2011-07-07 Thread michael j. goulish (JIRA)

restore use of testagent to two cluster tests
-

 Key: QPID-3340
 URL: https://issues.apache.org/jira/browse/QPID-3340
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Affects Versions: 0.14
Reporter: michael j. goulish
Assignee: michael j. goulish
Priority: Minor


In my checkin for JIRA 3337, two of the cluster tests only worked because I 
prevented them from using a client called testagent.  The tests are   
cluster_tests.LongTests.test_management_qmf2   
cluster_tests.LongTests.test_management   .
Restore  'testagent'  to their list of clients, and fix it.


--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] [Created] (QPID-3337) eliminate guest/guest default username/password and use an explicit sasl mechanism list

2011-07-06 Thread michael j. goulish (JIRA)

eliminate guest/guest default username/password and use an explicit sasl 
mechanism list
---

 Key: QPID-3337
 URL: https://issues.apache.org/jira/browse/QPID-3337
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Reporter: michael j. goulish
Assignee: michael j. goulish
 Fix For: 0.14


Currently, we default to using the system-default sasl mechanisms list.  That
list will include GSSAPI if the package is installed on the user's system.  But
merely installing the GSSAPI package does not prepare qpidd to use GSSAPI.  The
user must perform specific config steps to make it work.  And, since GSSAPI
will be selected before other mechanisms, this means that many users will see
qpidd fail as soon as they try  --auth=yes  .

It also seems dangerous to allow PLAIN, since users who install qpidd will then
have an insecure system by default.

By accepting the system-default list we are allowing too many user-surprises.

The solution is to explicitly control the mech list, probably only allowing a
single mechanism such as DIGEST-MD5, and give the user sufficient instruction
on how to set up other mechanisms when they are desired.

NOTE -- I am also allowing  ANONYMOUS, because some python tools do not yet 
know how to send credentials, and this will allow them to continue working.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] [Resolved] (QPID-3337) eliminate guest/guest default username/password and use an explicit sasl mechanism list

2011-07-06 Thread michael j. goulish (JIRA)


 [ 
https://issues.apache.org/jira/browse/QPID-3337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

michael j. goulish resolved QPID-3337.
--

Resolution: Fixed

checkin 1143536 .

 eliminate guest/guest default username/password and use an explicit sasl 
 mechanism list
 ---

 Key: QPID-3337
 URL: https://issues.apache.org/jira/browse/QPID-3337
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Reporter: michael j. goulish
Assignee: michael j. goulish
 Fix For: 0.14


 Currently, we default to using the system-default sasl mechanisms list.  That
 list will include GSSAPI if the package is installed on the user's system.  
 But
 merely installing the GSSAPI package does not prepare qpidd to use GSSAPI.  
 The
 user must perform specific config steps to make it work.  And, since GSSAPI
 will be selected before other mechanisms, this means that many users will see
 qpidd fail as soon as they try  --auth=yes  .
 It also seems dangerous to allow PLAIN, since users who install qpidd will 
 then
 have an insecure system by default.
 By accepting the system-default list we are allowing too many user-surprises.
 The solution is to explicitly control the mech list, probably only allowing a
 single mechanism such as DIGEST-MD5, and give the user sufficient instruction
 on how to set up other mechanisms when they are desired.
 NOTE -- I am also allowing  ANONYMOUS, because some python tools do not yet 
 know how to send credentials, and this will allow them to continue working.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] [Created] (QPID-3262) Remove support for archaic Boost version 103200

2011-05-17 Thread michael j. goulish (JIRA)

Remove support for archaic Boost version 103200
---

 Key: QPID-3262
 URL: https://issues.apache.org/jira/browse/QPID-3262
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Reporter: michael j. goulish
Assignee: michael j. goulish
Priority: Minor


Remove support for antique Boost version -- no one wants this any more, and I 
promised.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] [Created] (QPID-3239) sasl_fed_ex tests hangs

2011-05-02 Thread michael j. goulish (JIRA)

sasl_fed_ex tests hangs
---

 Key: QPID-3239
 URL: https://issues.apache.org/jira/browse/QPID-3239
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Reporter: michael j. goulish
Assignee: michael j. goulish


the sasl_fed_ex test ( which is the base code for severl tests ) is hanging, 
because qpid-tool can no longer print the status of links between clustered 
brokers.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] [Resolved] (QPID-3239) sasl_fed_ex tests hangs

2011-05-02 Thread michael j. goulish (JIRA)


 [ 
https://issues.apache.org/jira/browse/QPID-3239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

michael j. goulish resolved QPID-3239.
--

Resolution: Fixed

fixed by checkin 1098704.

 sasl_fed_ex tests hangs
 ---

 Key: QPID-3239
 URL: https://issues.apache.org/jira/browse/QPID-3239
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Reporter: michael j. goulish
Assignee: michael j. goulish

 the sasl_fed_ex test ( which is the base code for severl tests ) is hanging, 
 because qpid-tool can no longer print the status of links between clustered 
 brokers.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] [Commented] (QPID-3168) C++ Broker: fix the --help text description of flow control thresholds.

2011-04-13 Thread michael j. goulish (JIRA)


[ 
https://issues.apache.org/jira/browse/QPID-3168?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13019302#comment-13019302
 ] 

michael j. goulish commented on QPID-3168:
--

The change clarifies the text and introduces zero risk.
Nihil Obstat.  Imprimatur.

 C++ Broker: fix the --help text description of flow control thresholds.
 -

 Key: QPID-3168
 URL: https://issues.apache.org/jira/browse/QPID-3168
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Affects Versions: 0.10
Reporter: Ken Giusti
Assignee: Ken Giusti
Priority: Minor
 Fix For: 0.11


 The following text is incorrect:
   --default-flow-stop-threshold %MESSAGES (80) 
 Queue capacity level at which flow 
 control is activated.
   --default-flow-resume-threshold %MESSAGES (70) 
 Queue capacity level at which flow 
 control is de-activated.
 Correction:
   --default-flow-stop-threshold PERCENT (80) 
 Percent of queue's maximum capacity 
 at which flow 
 control is activated.
   --default-flow-resume-threshold PERCENT (70) 
 Percent of queue's maximum capacity 
 at which flow 
 control is de-activated.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] [Created] (QPID-3188) CyrusSasl ctor logic change for callback storage

2011-04-05 Thread michael j. goulish (JIRA)

CyrusSasl ctor logic change for callback storage


 Key: QPID-3188
 URL: https://issues.apache.org/jira/browse/QPID-3188
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker, C++ Client
Reporter: michael j. goulish
Assignee: michael j. goulish


The logic that stores callbacks in the CyrusSasl ctor should be changed to make 
the c++ client behave similarly to other language clients.

If there is no username, then do nothing with either NAME or PASSWD callbacks.
If there is a name but no passwd, then explicitly store an empty PASSWD 
callback.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] [Created] (QPID-3174) broker crash using rdma+sasl

2011-03-31 Thread michael j. goulish (JIRA)

broker crash using rdma+sasl


 Key: QPID-3174
 URL: https://issues.apache.org/jira/browse/QPID-3174
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Reporter: michael j. goulish


This is sometimes very hard to observe.  I was lucky enough to see it three 
times in the space of 500 iterations yesterday, but now, with same code  
scripts  system, I have no repetition after 6000 iterations.

my tree rev number is  1084895



Here is the backtrace -- all 3 have been identical:

  #0  0x003f8d030265 in raise (sig=value optimized out)
at ../nptl/sysdeps/unix/sysv/linux/raise.c:64
  #1  0x003f8d031d10 in abort () at abort.c:88
  #2  0x003f8d0296e6 in __assert_fail (
  assertion=0x2b28f90400aa completionsNeeded.get()  0,
  file=0x2b28f903fe58 ./qpid/broker/AsyncCompletion.h, line=167,
  function=0x2b28f9040440 void 
qpid::broker::AsyncCompletion::end(qpid::broker::AsyncCompletion::Callback)) 
at assert.c:78
  #3  0x2b28f8fe2935 in qpid::broker::AsyncCompletion::end 
(this=0x2aaab60c2e50,
  cb=value optimized out) at ./qpid/broker/AsyncCompletion.h:167
  #4  0x2b28f8fdf3f3 in qpid::broker::SessionState::handleContent (
  this=0x2aaabc4157d0, frame=..., id=value optimized out)
  at qpid/broker/SessionState.cpp:265
  #5  0x2b28f8fdf830 in qpid::broker::SessionState::handleIn 
(this=0x2aaabc4157d0,
  frame=...) at qpid/broker/SessionState.cpp:355
  #6  0x2b28f94799dc in qpid::amqp_0_10::SessionHandler::handleIn (
  this=0x2aaab0025680, f=...) at qpid/amqp_0_10/SessionHandler.cpp:93
  #7  0x2b28f8f2622b in operator() (this=0x2aaab38cffa0, frame=...)
  at ./qpid/framing/Handler.h:42
  #8  qpid::broker::Connection::received (this=0x2aaab38cffa0, frame=...)
  at qpid/broker/Connection.cpp:164
  #9  0x2b28f8ef8300 in qpid::amqp_0_10::Connection::decode 
(this=0x2aaabc47d240,
  buffer=value optimized out, size=value optimized out)
  at qpid/amqp_0_10/Connection.cpp:58
  #10 0x2b28f94c25e5 in qpid::sys::cyrus::CyrusSecurityLayer::decode (
  this=0x2aaabc468480, input=0x2aaab2aa97d0 , size=118)
  at qpid/sys/cyrus/CyrusSecurityLayer.cpp:59
  #11 0x2b28f999c7d8 in qpid::sys::RdmaIOHandler::readbuff 
(this=0x2aaab002d9e0,
  buff=0x) at qpid/sys/RdmaIOPlugin.cpp:218
  #12 0x2b28f9bba75a in boost::function2void, Rdma::AsynchIO, 
Rdma::Buffer*, std::allocatorboost::function_base ::operator() (this=0x0, 
a0=..., a1=0x6)



here is my broker start script:

-- start script ---

#! /bin/bash

export LD_LIBRARY_PATH=$TRUNK/qpid/cpp/src/.libs

QPID_SRC=$TRUNK/qpid/cpp/src
QPIDD=${QPID_SRC}/.libs/qpidd

echo $QPIDD

rm -rf /tmp/mick
mkdir /tmp/mick

$QPIDD\
  --no-module-dir \
  --load-module ${QPID_SRC}/.libs/rdma.so \
  --data-dir /tmp/mick/data_1 \
  --auth=yes  \
  --mgmt-enable=yes   \
  --port 5813 \
  --log-enable info+  \
  --log-to-file /tmp/mick/qpidd_1.log \
  --log-source yes\
  --sasl-config=${QPID_SRC}/tests/sasl_config \
  -d

echo started broker from $QPIDD
--- end script 


and here is my client iterator script;


-- start script -
#! /bin/bash

rm core.* ~/.qpidd/core*

count=0
while [ $count -lt 1 ]
do
  echo ===
  echo TEST $count
  echo ===
  sleep 1
  core_files=`ls -l core.* ~/.qpidd/core* | wc -l`
  echo core files:  ${core_files}

  if [ ${core_files} -gt 0 ]; then
echo core files found!
exit 1
  else
echo no core files found.
  fi
  ./qpid-perftest --username zig --password zig --protocol rdma --broker 
20.0.40.14 --port 5813 --qt 4 --count 10
  count=$(( $count + 1 ))
done
-- end script ---



the sasl config directory that the broker is pointing at was created by the 
script cpp/src/tests/sasl_test_setup.sh


also, you need to set  ulimit -l 131072(at least that value)   before 
starting the broker.


--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] [Created] (QPID-3171) RDMA client can segfault if no SASL mech specified.

2011-03-30 Thread michael j. goulish (JIRA)

 RDMA client can segfault if no SASL mech specified.


 Key: QPID-3171
 URL: https://issues.apache.org/jira/browse/QPID-3171
 Project: Qpid
  Issue Type: Bug
  Components: C++ Client
Reporter: michael j. goulish
Assignee: michael j. goulish


Using the following scripts to start the broker and to run a perftest client ( 
using RDMA ) I get a client segv about 3% of the time.

broker start script:

#! /bin/bash

export LD_LIBRARY_PATH=$TRUNK/qpid/cpp/src/.libs

QPID_SRC=$TRUNK/qpid/cpp/src
QPIDD=${QPID_SRC}/.libs/qpidd

echo $QPIDD

rm -rf /tmp/mick
mkdir /tmp/mick

$QPIDD\
  --no-module-dir \
  --load-module ${QPID_SRC}/.libs/rdma.so \
  --data-dir /tmp/mick/data_1 \
  --auth=yes  \
  --mgmt-enable=yes   \
  --port 5813 \
  --log-enable info+  \
  --log-to-file /tmp/mick/qpidd_1.log \
  --log-source yes\
  --sasl-config=${QPID_SRC}/tests/sasl_config \
  -d

echo started broker from $QPIDD



client run script:

#! /bin/bash

rm core.*

count=0
while [ $count -lt 1 ]
do
  echo ===
  echo TEST $count
  echo ===
  core_files=`ls -l core.* | wc -l`
  echo core files:  ${core_files}

  if [ ${core_files} -gt 0 ]; then
echo core files found!
exit 1
  else
echo no core files found.
  fi
  ./qpid-perftest --username zig --password zig --protocol rdma --broker 
20.0.40.14 --port 5813 --qt 4 --count 10  
  count=$(( $count + 1 ))
done




--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] [Resolved] (QPID-3171) RDMA client can segfault if no SASL mech specified.

2011-03-30 Thread michael j. goulish (JIRA)


 [ 
https://issues.apache.org/jira/browse/QPID-3171?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

michael j. goulish resolved QPID-3171.
--

Resolution: Fixed

fixed by revision 1087047

  RDMA client can segfault if no SASL mech specified.
 

 Key: QPID-3171
 URL: https://issues.apache.org/jira/browse/QPID-3171
 Project: Qpid
  Issue Type: Bug
  Components: C++ Client
Reporter: michael j. goulish
Assignee: michael j. goulish

 Using the following scripts to start the broker and to run a perftest client 
 ( using RDMA ) I get a client segv about 3% of the time.
 broker start script:
 #! /bin/bash
 export LD_LIBRARY_PATH=$TRUNK/qpid/cpp/src/.libs
 QPID_SRC=$TRUNK/qpid/cpp/src
 QPIDD=${QPID_SRC}/.libs/qpidd
 echo $QPIDD
 rm -rf /tmp/mick
 mkdir /tmp/mick
 $QPIDD\
   --no-module-dir \
   --load-module ${QPID_SRC}/.libs/rdma.so \
   --data-dir /tmp/mick/data_1 \
   --auth=yes  \
   --mgmt-enable=yes   \
   --port 5813 \
   --log-enable info+  \
   --log-to-file /tmp/mick/qpidd_1.log \
   --log-source yes\
   --sasl-config=${QPID_SRC}/tests/sasl_config \
   -d
 echo started broker from $QPIDD
 client run script:
 #! /bin/bash
 rm core.*
 count=0
 while [ $count -lt 1 ]
 do
   echo ===
   echo TEST $count
   echo ===
   core_files=`ls -l core.* | wc -l`
   echo core files:  ${core_files}
   if [ ${core_files} -gt 0 ]; then
 echo core files found!
 exit 1
   else
 echo no core files found.
   fi
   ./qpid-perftest --username zig --password zig --protocol rdma --broker 
 20.0.40.14 --port 5813 --qt 4 --count 10  
   count=$(( $count + 1 ))
 done

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] [Commented] (QPID-3147) Misconfigured tracing/logging can lead to hung threads in logging stack

2011-03-21 Thread michael j. goulish (JIRA)


[ 
https://issues.apache.org/jira/browse/QPID-3147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13009267#comment-13009267
 ] 

michael j. goulish commented on QPID-3147:
--

nice fix.  seems like zero risk.
nihil obstat.  imprimatur.

 Misconfigured tracing/logging can lead to hung threads in logging stack
 ---

 Key: QPID-3147
 URL: https://issues.apache.org/jira/browse/QPID-3147
 Project: Qpid
  Issue Type: Bug
  Components: C++ Client
Affects Versions: 0.10
 Environment: Fedora 14 32-bit
Reporter: Pete MacKinnon
Assignee: Alan Conway
Priority: Minor
  Labels: c++, logging, qpid, thread, trace

 Given the following trace and logging environment variable mix-up like this:
 QPID_TRACE=debug+
 QPID_LOG_ENABLE=
 A C++ client that embeds the Qpid runtime can get hung although it has 
 recognized the misconfiguration.
 #0  0x00555416 in __kernel_vsyscall ()
 #1  0x00f49367 in syscall () at ../sysdeps/unix/sysv/linux/i386/syscall.S:30
 #2  0x002bbfca in __cxxabiv1::__cxa_guard_acquire (g=0x2171ea0)
 at ../../../../libstdc++-v3/libsupc++/guard.cc:293
 #3  0x020ca3b8 in instance ()
 at /usr/include/boost/pool/detail/singleton.hpp:83
 #4  qpid::log::Logger::instance () at qpid/log/Logger.cpp:47
 #5  0x020cd324 in qpid::log::Statement::Initializer::Initializer (
 this=0x2171b88, s=...) at qpid/log/Statement.cpp:61
 #6  0x02075b6c in qpid::Exception::Exception (this=0x880b370, msg=
 Error in environment variables: in option 'trace': invalid bool value\n)
 at qpid/Exception.cpp:31
 #7  0x0208095d in Exception (this=0xbf8d0df0, argc=0, argv=0x0, 
 configFile=, 
 allowUnknown=false) at ../include/qpid/Options.h:210
 #8  qpid::Options::parse (this=0xbf8d0df0, argc=0, argv=0x0, configFile=, 
 allowUnknown=false) at qpid/Options.cpp:352
 #9  0x020c9c57 in qpid::log::Logger::Logger (this=0x2171ec0)
 at qpid/log/Logger.cpp:55
 #10 0x020ca330 in instance ()
 at /usr/include/boost/pool/detail/singleton.hpp:83
 #11 object_creator () at /usr/include/boost/pool/detail/singleton.hpp:66
 #12 __static_initialization_and_destruction_0 ()
 at /usr/include/boost/pool/detail/singleton.hpp:95
 #13 global constructors keyed to Logger.cpp(void) () at 
 qpid/log/Logger.cpp:156
 #14 0x020ebc6d in __do_global_ctors_aux () from /usr/lib/libqpidcommon.so.2
 #15 0x01fc2a84 in _init () from /usr/lib/libqpidcommon.so.2
 #16 0x004f68fc in call_init (l=value optimized out, argc=3, 
 argv=0xbf8d1634, 
 env=0x87feb58) at dl-init.c:68
 #17 0x004f6a19 in _dl_init (main_map=value optimized out, argc=3, 
 argv=0xbf8d1634, env=0x87feb58) at dl-init.c:132
 #18 0x004fa74f in dl_open_worker (a=0xbf8d10d0) at dl-open.c:464
 #19 0x004f6786 in _dl_catch_error (objname=0xbf8d10f8, errstring=0xbf8d10f4, 
 mallocedp=0xbf8d10ff, operate=0x4fa3b0 dl_open_worker, args=0xbf8d10d0)
 at dl-error.c:178

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Commented: (QPID-3154) Fix qpidd late/overran warnings.

2011-03-18 Thread michael j. goulish (JIRA)


[ 
https://issues.apache.org/jira/browse/QPID-3154?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13008591#comment-13008591
 ] 

michael j. goulish commented on QPID-3154:
--

This looks reasonable to me (after Alan explained a little more) and zero risk.

 Fix qpidd late/overran warnings.
 

 Key: QPID-3154
 URL: https://issues.apache.org/jira/browse/QPID-3154
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Affects Versions: 0.9
Reporter: Alan Conway
Assignee: Alan Conway
Priority: Minor
 Fix For: 0.10


 Warnings for late-and-overrun tasks were not being correctly reported. They 
 were reported with 0 values.
 Lateness for overrun tasks below the late threshold was not being reported. 
 This leads to reports of e.g. overran by 3ms, with no indication of lateness 
 when in fact the event was overrun _and_ late by 3ms.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Created: (QPID-3150) sasl_fed test fails to specify client's sasl mechanism

2011-03-17 Thread michael j. goulish (JIRA)

sasl_fed test fails to specify client's sasl mechanism
--

 Key: QPID-3150
 URL: https://issues.apache.org/jira/browse/QPID-3150
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Reporter: michael j. goulish
Assignee: michael j. goulish
Priority: Minor


this is a problem with the test in cpp/src/tests/sasl_fed -- not with the 
broker.

the client should use DIGEST-MD5 when connecting to the broker -- but this is 
not specified on the command line.  It was succeeding accidentally on some 
systems but, recently, not on others.


--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Created: (QPID-3152) non-clustered sasl_fed_ex_* tests load cluster.so

2011-03-17 Thread michael j. goulish (JIRA)

non-clustered sasl_fed_ex_* tests load cluster.so
-

 Key: QPID-3152
 URL: https://issues.apache.org/jira/browse/QPID-3152
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Reporter: michael j. goulish


The 4 non-clustered versions of the sasl_fed_ex_* tests load cluster.so in the 
broker command line (and fail when it isn't present).

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Resolved: (QPID-3152) non-clustered sasl_fed_ex_* tests load cluster.so

2011-03-17 Thread michael j. goulish (JIRA)


 [ 
https://issues.apache.org/jira/browse/QPID-3152?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

michael j. goulish resolved QPID-3152.
--

   Resolution: Fixed
Fix Version/s: 0.10
 Assignee: michael j. goulish

fixed by rev 1082804

 non-clustered sasl_fed_ex_* tests load cluster.so
 -

 Key: QPID-3152
 URL: https://issues.apache.org/jira/browse/QPID-3152
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Reporter: michael j. goulish
Assignee: michael j. goulish
 Fix For: 0.10


 The 4 non-clustered versions of the sasl_fed_ex_* tests load cluster.so in 
 the broker command line (and fail when it isn't present).

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Resolved: (QPID-3150) sasl_fed test fails to specify client's sasl mechanism

2011-03-17 Thread michael j. goulish (JIRA)


 [ 
https://issues.apache.org/jira/browse/QPID-3150?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

michael j. goulish resolved QPID-3150.
--

   Resolution: Fixed
Fix Version/s: 0.10

fixed by 1082685.

 sasl_fed test fails to specify client's sasl mechanism
 --

 Key: QPID-3150
 URL: https://issues.apache.org/jira/browse/QPID-3150
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Reporter: michael j. goulish
Assignee: michael j. goulish
Priority: Minor
 Fix For: 0.10


 this is a problem with the test in cpp/src/tests/sasl_fed -- not with the 
 broker.
 the client should use DIGEST-MD5 when connecting to the broker -- but this is 
 not specified on the command line.  It was succeeding accidentally on some 
 systems but, recently, not on others.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Created: (QPID-3153) sasl_fed_ex_*_cluster tests should test for aisexec running

2011-03-17 Thread michael j. goulish (JIRA)

sasl_fed_ex_*_cluster tests should test for aisexec running
---

 Key: QPID-3153
 URL: https://issues.apache.org/jira/browse/QPID-3153
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Reporter: michael j. goulish
Assignee: michael j. goulish


use ais_exec to make sure that aisexec is actually running.  also use 
with_ais_group to force proper group membership for openais.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Resolved: (QPID-3153) sasl_fed_ex_*_cluster tests should test for aisexec running

2011-03-17 Thread michael j. goulish (JIRA)


 [ 
https://issues.apache.org/jira/browse/QPID-3153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

michael j. goulish resolved QPID-3153.
--

   Resolution: Fixed
Fix Version/s: 0.11

fixed by rev 1082812.

 sasl_fed_ex_*_cluster tests should test for aisexec running
 ---

 Key: QPID-3153
 URL: https://issues.apache.org/jira/browse/QPID-3153
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Reporter: michael j. goulish
Assignee: michael j. goulish
 Fix For: 0.11


 use ais_exec to make sure that aisexec is actually running.  also use 
 with_ais_group to force proper group membership for openais.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Commented: (QPID-3144) qpidd --check does not work with info logging and --log-to-stdout=yes

2011-03-16 Thread michael j. goulish (JIRA)


[ 
https://issues.apache.org/jira/browse/QPID-3144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13007476#comment-13007476
 ] 

michael j. goulish commented on QPID-3144:
--

good point about the inappropriate position of these log statements in any case.
and especially because of teensyness of change -- i withdraw my objection.

nihil obstat.
imprimatur.



 qpidd --check does not work with info logging and --log-to-stdout=yes
 -

 Key: QPID-3144
 URL: https://issues.apache.org/jira/browse/QPID-3144
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Affects Versions: 0.9
Reporter: Alan Conway
Assignee: Alan Conway
Priority: Minor
 Fix For: 0.10


 qpidd --check is supposed to print the PID of a running qpidd --daemon, for 
 use in scripts and the like. However with info logging and log-to-stdout 
 enabled this does not work because log messages of the form:
2011-03-14 16:21:05 info Loaded Module: /usr/lib64/qpid/daemon/cluster.so
 precede the port number.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Commented: (QPID-3144) qpidd --check does not work with info logging and --log-to-stdout=yes

2011-03-15 Thread michael j. goulish (JIRA)


[ 
https://issues.apache.org/jira/browse/QPID-3144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13007042#comment-13007042
 ] 

michael j. goulish commented on QPID-3144:
--

Fixing this in the code seems too heavy-handed.
Also, the log message that is getting in the way is presumably useful in other 
contexts.

Wouldn't it be better to simply do something like the following in the affected 
scripts?

my_pid=`qpidd --check | grep -v Loaded`

I would propose that we leave this small change out.



 qpidd --check does not work with info logging and --log-to-stdout=yes
 -

 Key: QPID-3144
 URL: https://issues.apache.org/jira/browse/QPID-3144
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Affects Versions: 0.9
Reporter: Alan Conway
Assignee: Alan Conway
Priority: Minor
 Fix For: 0.10


 qpidd --check is supposed to print the PID of a running qpidd --daemon, for 
 use in scripts and the like. However with info logging and log-to-stdout 
 enabled this does not work because log messages of the form:
2011-03-14 16:21:05 info Loaded Module: /usr/lib64/qpid/daemon/cluster.so
 precede the port number.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Updated: (QPID-2057) qpidd --require-encryption forces SASL security layer to be used even if SSL is in use

2011-03-10 Thread michael j. goulish (JIRA)


 [ 
https://issues.apache.org/jira/browse/QPID-2057?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

michael j. goulish updated QPID-2057:
-

Fix Version/s: 0.11

 qpidd --require-encryption forces SASL security layer to be used even if SSL 
 is in use
 --

 Key: QPID-2057
 URL: https://issues.apache.org/jira/browse/QPID-2057
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Affects Versions: 0.5
Reporter: Ted Ross
Assignee: michael j. goulish
Priority: Minor
 Fix For: 0.11


 If running SSL and using GSSAPI authentication with the --require-encryption 
 option turned on, the SASL layer will force minSsf to be greater than zero, 
 thus causing the SASL security layer to encrypt.
 This is unnecessary double-encryption since SSL is already providing a cipher 
 channel at a lower layer.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Updated: (QPID-2993) Federated source-local links crash remotely federated cluster member on local cluster startup

2011-03-10 Thread michael j. goulish (JIRA)

[
https://issues.apache.org/jira/browse/QPID-2993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

michael j. goulish updated QPID-2993:
-

Fix Version/s: 0.11

Federated source-local links crash remotely federated cluster member on local
cluster startup
-

Key: QPID-2993
URL: https://issues.apache.org/jira/browse/QPID-2993
Project: Qpid
Issue Type: Bug
Components: C++ Broker, C++ Clustering
Affects Versions: 0.8
Environment: Debian Linux Squeeze, 32-bit, kernel 2.6.36.2, Dell
Poweredge 1950s. Corosync==1.3.0, Openais==1.1.4
Reporter: Mark Moseley
Assignee: michael j. goulish
Fix For: 0.11

Attachments: 2993_bug.sh, cluster-fed-src.sh

This is related to JIRA 2992 that I opened, but this is for source-local
routes. Given the same setup as in JIRA 2992 but using source-local routes
(and obviously with the exchanges switched accordingly in the qpid-route
statements), i.e. cluster A and cluster B with the routes between A1-B1,
when cluster B shuts down in the order B2-B1 and starts back up, the static
routes are not correctly re-bound on cluster A's side. However if cluster B
is shut down in the order B1-B2 and started back up, the route is correctly
created and works. However in the non-functioning case (B2-B1, or A2-A1),
there is an additional side-effect: on node A2, qpidd crashes with the
following error (cluster A is called 'walclust', B is bosclust):
2011-01-07 18:57:35 error Channel exception: not-attached: Channel 1 is not
attached (qpid/amqp_0_10/SessionHandler.cpp:39)
2011-01-07 18:57:35 critical cluster(102.0.0.0:13650 READY/error) local error
2030 did not occur on member 101.0.0.0:9920: not-attached: Channel 1 is not
attached (qpid/amqp_0_10/SessionHandler.cpp:39)
2011-01-07 18:57:35 critical Error delivering frames: local error did not
occur on all cluster members : not-attached: Channel 1 is not attached
(qpid/amqp_0_10/SessionHandler.cpp:39) (qpid/cluster/ErrorCheck.cpp:89)
2011-01-07 18:57:35 notice cluster(102.0.0.0:13650 LEFT/error) leaving
cluster walclust
2011-01-07 18:57:35 notice Shut down
This happens on both sides of the cluster, so it's not limited to one or the
other. This crash does *not* occur in the A1-A2/B1-B2 test (i.e. the test
where the route is re-bound correctly). I can cause this to reoccur pretty
much every time. I've been resetting the cluster completely to a new state
between each test. Occasionally in the B2-B1 test, A1 will also crash with
the same error (and vice versa for A2-A1 for node B1), though most of the
time, it's A2/B2 that crashes.
I was getting this same behaviour prior to upgrading corosync/openais as
well. Previously I was using the stock Squeeze versions of corosync==1.2.1
and openais==1.1.2. The results are the same with corosync=1.3.0 and
openais==1.1.4.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
Apache Qpid - AMQP Messaging Implementation
Project: http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Created: (QPID-3123) disallow federation links between members of a cluster

2011-03-07 Thread michael j. goulish (JIRA)

disallow federation links between members of a cluster
--

 Key: QPID-3123
 URL: https://issues.apache.org/jira/browse/QPID-3123
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Reporter: michael j. goulish
Assignee: michael j. goulish
Priority: Minor


Attempts to create a federation link between two brokers that are members of 
the same cluster should be disallowed.   Otherwise messages delivered to one 
end of the link are double-delivered to members of the cluster.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Created: (QPID-3117) qpidd --help output on --sasl-config is misleading

2011-03-04 Thread michael j. goulish (JIRA)

qpidd --help output on --sasl-config is misleading
--

 Key: QPID-3117
 URL: https://issues.apache.org/jira/browse/QPID-3117
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Reporter: michael j. goulish
Assignee: michael j. goulish
Priority: Minor


The --help output suggest that --sasl-config points to a file.  But it points 
to a directory.



-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Created: (QPID-3096) remove qrsh from cpp/src/tests

2011-03-01 Thread michael j. goulish (JIRA)

remove qrsh from cpp/src/tests
--

 Key: QPID-3096
 URL: https://issues.apache.org/jira/browse/QPID-3096
 Project: Qpid
  Issue Type: Improvement
  Components: C++ Broker
Reporter: michael j. goulish
Assignee: michael j. goulish
Priority: Minor


The qrsh material represents an experiment that was never used, in fact never 
quite completed.  It should be removed.


-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Closed: (QPID-2617) make sasl-based tests config files relocatable

2011-03-01 Thread michael j. goulish (JIRA)


 [ 
https://issues.apache.org/jira/browse/QPID-2617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

michael j. goulish closed QPID-2617.



implemented, and then secondary problems fixed, in the following svn revs:

947748 
947764 
947850 
947994

 make sasl-based tests config files relocatable
 --

 Key: QPID-2617
 URL: https://issues.apache.org/jira/browse/QPID-2617
 Project: Qpid
  Issue Type: Improvement
  Components: C++ Broker
Reporter: michael j. goulish
Assignee: michael j. goulish

 Currently, sasl-based testing is hampered by the fact that the sasl library 
 code is only reading its configuration information from the standard 
 installed location /etc/sasl2 .  This means that sasl installation must be 
 done manually on a machine prior to running sasl-based tests, which makes 
 them not fully automatable.
  1.  Find a way of telling the library to look elsewhere
  2.  expose that mechanism through a broker option 
  3.  check a suitable config file into our source tree, at cpp/src/tests
  4.  reference the locally-created sasldb from the new config file
  5.  the new sasl tests should be runnable by any users -- not just root.

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Commented: (QPID-2617) make sasl-based tests config files relocatable

2011-03-01 Thread michael j. goulish (JIRA)

[
https://issues.apache.org/jira/browse/QPID-2617?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13000917#comment-13000917
]

michael j. goulish commented on QPID-2617:
--

Implemented, and then secondary problems fixed, in the following svn versions:

947748
947764
947850
947994

make sasl-based tests config files relocatable
--

Key: QPID-2617
URL: https://issues.apache.org/jira/browse/QPID-2617
Project: Qpid
Issue Type: Improvement
Components: C++ Broker
Reporter: michael j. goulish
Assignee: michael j. goulish

Currently, sasl-based testing is hampered by the fact that the sasl library
code is only reading its configuration information from the standard
installed location /etc/sasl2 . This means that sasl installation must be
done manually on a machine prior to running sasl-based tests, which makes
them not fully automatable.
1. Find a way of telling the library to look elsewhere
2. expose that mechanism through a broker option
3. check a suitable config file into our source tree, at cpp/src/tests
4. reference the locally-created sasldb from the new config file
5. the new sasl tests should be runnable by any users -- not just root.

--
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
Apache Qpid - AMQP Messaging Implementation
Project: http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Resolved: (QPID-3096) remove qrsh from cpp/src/tests

2011-03-01 Thread michael j. goulish (JIRA)


 [ 
https://issues.apache.org/jira/browse/QPID-3096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

michael j. goulish resolved QPID-3096.
--

Resolution: Fixed

fixed in rev  1075874

 remove qrsh from cpp/src/tests
 --

 Key: QPID-3096
 URL: https://issues.apache.org/jira/browse/QPID-3096
 Project: Qpid
  Issue Type: Improvement
  Components: C++ Broker
Reporter: michael j. goulish
Assignee: michael j. goulish
Priority: Minor

 The qrsh material represents an experiment that was never used, in fact never 
 quite completed.  It should be removed.

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Assigned: (QPID-2057) qpidd --require-encryption forces SASL security layer to be used even if SSL is in use

2011-02-22 Thread michael j. goulish (JIRA)


 [ 
https://issues.apache.org/jira/browse/QPID-2057?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

michael j. goulish reassigned QPID-2057:


Assignee: michael j. goulish

 qpidd --require-encryption forces SASL security layer to be used even if SSL 
 is in use
 --

 Key: QPID-2057
 URL: https://issues.apache.org/jira/browse/QPID-2057
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Affects Versions: 0.5
Reporter: Ted Ross
Assignee: michael j. goulish
Priority: Minor

 If running SSL and using GSSAPI authentication with the --require-encryption 
 option turned on, the SASL layer will force minSsf to be greater than zero, 
 thus causing the SASL security layer to encrypt.
 This is unnecessary double-encryption since SSL is already providing a cipher 
 channel at a lower layer.

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Updated: (QPID-2993) Federated source-local links crash remotely federated cluster member on local cluster startup

2011-02-16 Thread michael j. goulish (JIRA)

[
https://issues.apache.org/jira/browse/QPID-2993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

michael j. goulish updated QPID-2993:
-

Attachment: 2993_bug.sh

Federated source-local links crash remotely federated cluster member on local
cluster startup
-

--
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
Apache Qpid - AMQP Messaging Implementation
Project: http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Issue Comment Edited: (QPID-2993) Federated source-local links crash remotely federated cluster member on local cluster startup

2011-02-16 Thread michael j. goulish (JIRA)


[ 
https://issues.apache.org/jira/browse/QPID-2993?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12995336#comment-12995336
 ] 

michael j. goulish edited comment on QPID-2993 at 2/16/11 3:23 PM:
---

Well, with the latest code tree  ( r1071018 ) I do indeed find something 
interesting.  I cannot get it to actually crash, but with the attached script  
( 2993_bug.sh ) running on a fast 8-proc system, I was able to see 2 out of 10 
instances of broker A2 failing to start up, and logging the following error:

2011-02-16 09:34:23 error Channel exception: not-attached: Channel 1 is not 
attached (qpid/amqp_0_10/SessionHandler.cpp:39)
2011-02-16 09:34:23 critical cluster(20.0.100.36:11855 READY/error) local 
error 1498 did not occur on member 20.0.100.36:11714: not-attached: Channel 1 
is not attached (qpid/amqp_0_10/SessionHandler.cpp:39)
2011-02-16 09:34:23 critical Error delivering frames: local error did not 
occur on all cluster members : not-attached: Channel 1 is not attached 
(qpid/amqp_0_10/SessionHandler.cpp:39) (qpid/cluster/ErrorCheck.cpp:89)
2011-02-16 09:34:23 notice cluster(20.0.100.36:11855 LEFT/error) leaving 
cluster A
2011-02-16 09:34:23 notice Shut down

===

and, by the way, the store version is 4439






  was (Author: mgoulish2):
Well, with the latest code tree  ( r1071018 ) I do indeed find something 
interesting.  I cannot get it to actually crash, but with the attached script  
( 2993_bug.sh ) running on a fast 8-proc system, I was able to see 2 out of 10 
instances of broker A2 failing to start up, and logging the following error:

2011-02-16 09:34:23 error Channel exception: not-attached: Channel 1 is not 
attached (qpid/amqp_0_10/SessionHandler.cpp:39)
2011-02-16 09:34:23 critical cluster(20.0.100.36:11855 READY/error) local 
error 1498 did not occur on member 20.0.100.36:11714: not-attached: Channel 1 
is not attached (qpid/amqp_0_10/SessionHandler.cpp:39)
2011-02-16 09:34:23 critical Error delivering frames: local error did not 
occur on all cluster members : not-attached: Channel 1 is not attached 
(qpid/amqp_0_10/SessionHandler.cpp:39) (qpid/cluster/ErrorCheck.cpp:89)
2011-02-16 09:34:23 notice cluster(20.0.100.36:11855 LEFT/error) leaving 
cluster A
2011-02-16 09:34:23 notice Shut down



  
 Federated source-local links crash remotely federated cluster member on local 
 cluster startup
 -

 Key: QPID-2993
 URL: https://issues.apache.org/jira/browse/QPID-2993
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker, C++ Clustering
Affects Versions: 0.8
 Environment: Debian Linux Squeeze, 32-bit, kernel 2.6.36.2, Dell 
 Poweredge 1950s. Corosync==1.3.0, Openais==1.1.4
Reporter: Mark Moseley
Assignee: michael j. goulish
 Attachments: 2993_bug.sh, cluster-fed-src.sh


 This is related to JIRA 2992 that I opened, but this is for source-local 
 routes. Given the same setup as in JIRA 2992 but using source-local routes 
 (and obviously with the exchanges switched accordingly in the qpid-route 
 statements), i.e. cluster A and cluster B with the routes between A1-B1, 
 when cluster B shuts down in the order B2-B1 and starts back up, the static 
 routes are not correctly re-bound on cluster A's side. However if cluster B 
 is shut down in the order B1-B2 and started back up, the route is correctly 
 created and works. However in the non-functioning case (B2-B1, or A2-A1), 
 there is an additional side-effect: on node A2, qpidd crashes with the 
 following error (cluster A is called 'walclust', B is bosclust):
 2011-01-07 18:57:35 error Channel exception: not-attached: Channel 1 is not 
 attached (qpid/amqp_0_10/SessionHandler.cpp:39)
 2011-01-07 18:57:35 critical cluster(102.0.0.0:13650 READY/error) local error 
 2030 did not occur on member 101.0.0.0:9920: not-attached: Channel 1 is not 
 attached (qpid/amqp_0_10/SessionHandler.cpp:39)
 2011-01-07 18:57:35 critical Error delivering frames: local error did not 
 occur on all cluster members : not-attached: Channel 1 is not attached 
 (qpid/amqp_0_10/SessionHandler.cpp:39) (qpid/cluster/ErrorCheck.cpp:89)
 2011-01-07 18:57:35 notice cluster(102.0.0.0:13650 LEFT/error) leaving 
 cluster walclust
 2011-01-07 18:57:35 notice Shut down
 This happens on both sides of the cluster, so it's not limited to one or the 
 other. This crash does *not* occur in the A1-A2/B1-B2 test (i.e. the test 
 where the route is re-bound correctly). I can cause this to reoccur pretty 
 much every time. I've been resetting the cluster completely to a new state 
 between each test. Occasionally in the B2-B1 test, A1 will also crash with 
 the same error (and vice versa for

[jira] Commented: (QPID-2993) Federated source-local links crash remotely federated cluster member on local cluster startup

2011-02-07 Thread michael j. goulish (JIRA)

[
https://issues.apache.org/jira/browse/QPID-2993?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12991588#comment-12991588
]

michael j. goulish commented on QPID-2993:
--

I was not able to repro the crash, using the code tree from 10 Jan, after 100
iterations.I will try again with latest code tree.

Federated source-local links crash remotely federated cluster member on local
cluster startup
-

--
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
Apache Qpid - AMQP Messaging Implementation
Project: http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Commented: (QPID-2992) Cluster failing to resurrect durable static route depending on order of shutdown

2011-02-03 Thread michael j. goulish (JIRA)


[ 
https://issues.apache.org/jira/browse/QPID-2992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12990165#comment-12990165
 ] 

michael j. goulish commented on QPID-2992:
--

Using modifications of Ken's script, I have reproduced two
bad behaviors, including the one that Mark is reporting.

I don't think this is a bug... well, sort of. Two, actually.
I will submit a doc bug, and probably one enhancement request.

What's happening is this:  messaging systems that include
clusters and stores are sensitive to timing issues around
events like broker introduction and shut-down.

Here are the timing issues that I know of:

1. When you shut down a cluster that is using a store,
   there must be time for the last-broker-standing to
   realize his status, and mark his store as clean.
   I.e. my store is the one we should use at re-start.
   If all brokers are killed too quickly, this will not
   happen.  The cluster will not be able to restart
   because it will not find any store that has been
   marked clean.


2. When you make a topology change, i.e. adding a route
   from one cluster to another to create a federation-of-
   clusters, if you then shut down the cluster soon afterwards
   you may get it before that topology-change has had a chance
   to propagate across the cluster.

   This can cause a problem on re-start that depends on the
   order in which the brokers are killed.  If you *first* kill
   the broker that knew about the topology change before he
   manages to communicate that knowledge to the other broker,
   that's Bad.  because the other broker will be the last-man-
   standing, and it will be *his* store that gets marked as
   clean!  So his store will be re-used at startup, and the
   cluster will have lost knowledge of the topology change.


By altering the timing of events in Ken's script, I was able
to:

A. get no failures in 200 runs.  (original script, plus explicit
   wait-loops fro brokers.)

B. get 100% failure because of no clean store.  (kill both brokers
   in B cluster too close together.)

C. get the failure that Mark reported, about 7% of the time.
   ( place B1 under load, then kill it too soon after route-
 creation. )


So, here's what I will propose...

I. A bit of documentation ( I will take first sketch-whack at it,
   then give to doc professionals) to centralize description of
   this type of problem -- the two I have mentioned above, plus
   whatever anyone else thinks up that is similar.

   This will include best-practices on how to avoid this type of
   problem.


II. A request for enhancement wherever there is no very good way
to avoid one of these multi-broker race conditions.


III. III'll come back and update this Jira with the numbers of
 any resultant Jiras that I open.



 Cluster failing to resurrect durable static route depending on order of 
 shutdown
 

 Key: QPID-2992
 URL: https://issues.apache.org/jira/browse/QPID-2992
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker, C++ Clustering
Affects Versions: 0.8
 Environment: Debian Linux Squeeze, 32-bit, kernel 2.6.36.2, Dell 
 Poweredge 1950s. Corosync==1.3.0, Openais==1.1.4
Reporter: Mark Moseley
Assignee: michael j. goulish
 Attachments: cluster-fed.sh, error


 I've got a 2-node qpid test cluster at each of 2 datacenters, which are 
 federated together with a single durable static route between each. Qpid is 
 version 0.8. Corosync and openais are stock Squeeze (1.2.1-3 and 1.1.2-2, 
 respectively). OS is Squeeze, 32-bit, on Dell Poweredge 1950s, kernel 2.6.36. 
 The static route is durable and is set up over SSL (but I can replicate as 
 well with non-SSL). I've tried to normalize the hostnames below to make 
 things clearer; hopefully I didn't mess anything up.
 Given two clusters, cluster A (consisting of hosts A1 and A2) and cluster B 
 (with B1 and B2), I've got a static exchange route from A1 to B1, as well as 
 another from B1 to A1. Federation is working correctly, so I can send a 
 message on A2 and have it successfully retrieved on B2. The exchange local to 
 cluster A is walmyex1; the local exchange for B is bosmyex1.
 If I shut down the cluster in this order: B2, then B1, and start back up with 
 B1, B2, the static route route fails to get recreated. That is, on A1/A2, 
 looking at the bindings, exchange 'bosmyex1' does not get re-bound to cluster 
 B; the only output for it in qpid-config exchanges --bindings is just:
 snip
 Exchange 'bosmyex1' (direct)
 /snip
 If however I shut the cluster down in this order: B1,

[jira] Resolved: (QPID-2992) Cluster failing to resurrect durable static route depending on order of shutdown

2011-02-03 Thread michael j. goulish (JIRA)


 [ 
https://issues.apache.org/jira/browse/QPID-2992?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

michael j. goulish resolved QPID-2992.
--

Resolution: Won't Fix

I don't really want to say Won't Fix here -- I really want to say Will make 
two separate Jiras.

( please see my other comment for a complete explanation )


 Cluster failing to resurrect durable static route depending on order of 
 shutdown
 

 Key: QPID-2992
 URL: https://issues.apache.org/jira/browse/QPID-2992
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker, C++ Clustering
Affects Versions: 0.8
 Environment: Debian Linux Squeeze, 32-bit, kernel 2.6.36.2, Dell 
 Poweredge 1950s. Corosync==1.3.0, Openais==1.1.4
Reporter: Mark Moseley
Assignee: michael j. goulish
 Attachments: cluster-fed.sh, error


 I've got a 2-node qpid test cluster at each of 2 datacenters, which are 
 federated together with a single durable static route between each. Qpid is 
 version 0.8. Corosync and openais are stock Squeeze (1.2.1-3 and 1.1.2-2, 
 respectively). OS is Squeeze, 32-bit, on Dell Poweredge 1950s, kernel 2.6.36. 
 The static route is durable and is set up over SSL (but I can replicate as 
 well with non-SSL). I've tried to normalize the hostnames below to make 
 things clearer; hopefully I didn't mess anything up.
 Given two clusters, cluster A (consisting of hosts A1 and A2) and cluster B 
 (with B1 and B2), I've got a static exchange route from A1 to B1, as well as 
 another from B1 to A1. Federation is working correctly, so I can send a 
 message on A2 and have it successfully retrieved on B2. The exchange local to 
 cluster A is walmyex1; the local exchange for B is bosmyex1.
 If I shut down the cluster in this order: B2, then B1, and start back up with 
 B1, B2, the static route route fails to get recreated. That is, on A1/A2, 
 looking at the bindings, exchange 'bosmyex1' does not get re-bound to cluster 
 B; the only output for it in qpid-config exchanges --bindings is just:
 snip
 Exchange 'bosmyex1' (direct)
 /snip
 If however I shut the cluster down in this order: B1, then B2, and start B2, 
 then B1, the static route gets re-bound. The output then is:
 snip
 Exchange 'bosmyex1' (direct)
 bind [unix.boston.cust] = 
 bridge_queue_1_8870523d-2286-408e-b5b5-50d53db2fa61
 /bind
 and I can message over the federated link with no further modification. Prior 
 to a few minutes ago, I was seeing this with the Squeeze stock openais==1.1.2 
 and corosync==1.2.1. In debugging this, I've upgraded both to the latest 
 versions with no change.
 I can replicate this every time I try. These are just test clusters, so I 
 don't have any other activity going on on them, or any other 
 exchanges/queues. My steps:
 On all boxes in cluster A and B:
 * Kill the qpidd if it's running and delete all existing store files, i.e. 
 contents of /var/lib/qpid/
 On host A1 in cluster A (I'm leaving out the -a user/test@host stuff):
 * Start up qpid
 * qpid-config add exchange direct bosmyex1 --durable
 * qpid-config add exchange direct walmyex1 --durable
 * qpid-config add queue walmyq1 --durable
 * qpid-config bind walmyex1 walmyq1 unix.waltham.cust
 On host B1 in cluster B:
 * qpid-config add exchange direct bosmyex1 --durable
 * qpid-config add exchange direct walmyex1 --durable
 * qpid-config add queue bosmyq1 --durable
 * qpid-config bind bosmyex1 bosmyq1 unix.boston.cust
 On cluster A:
 * Start other member of cluster, A2
 * qpid-route route add amqps://user/pass@HOSTA1:5671 
 amqps://user/pass@HOSTB1:5671 walmyex1 unix.waltham.cust -d
 On cluster B:
 * Start other member of cluster, B2
 * qpid-route route add amqps://user/pass@HOSTB1:5671 
 amqps://user/pass@HOSTA1:5671 bosmyex1 unix.boston.cust -d
 On either cluster:
 * Check qpid-config exchanges --bindings to make sure bindings are correct 
 for remote exchanges
 * To see correct behaviour, stop cluster in the order B1-B2, or A1-A2, 
 start cluster back up, check bindings.
 * To see broken behaviour, stop cluster in the order B2-B1, or A2-A1, start 
 cluster back up, check bindings.
 This is a test cluster, so I'm free to do anything with it, debugging-wise, 
 that would be useful. 

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Created: (QPID-3037) document race conditions in complex messaging systems

2011-02-03 Thread michael j. goulish (JIRA)

document race conditions in complex messaging systems
-

 Key: QPID-3037
 URL: https://issues.apache.org/jira/browse/QPID-3037
 Project: Qpid
  Issue Type: Improvement
  Components: C++ Broker
Reporter: michael j. goulish
Assignee: michael j. goulish


inspired by QPID-2992

We need some documentation to describe the kinds of 'race' problems you can get 
into when creating and shutting down messaging systems involving clusters, 
store, and federation.  There are 'race conditions' that can give the 
appearance of bugs that depend on, for example, the speed with which you shut 
down brokers and the order in which you shut down brokers in a multi-broker 
messaging system.

We should have a single place in the docs where we collect what we know about 
the most common of these problems, and best practices for avoiding them.

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Assigned: (QPID-2992) Cluster failing to resurrect durable static route depending on order of shutdown

2011-01-27 Thread michael j. goulish (JIRA)


 [ 
https://issues.apache.org/jira/browse/QPID-2992?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

michael j. goulish reassigned QPID-2992:


Assignee: michael j. goulish  (was: Alan Conway)

 Cluster failing to resurrect durable static route depending on order of 
 shutdown
 

 Key: QPID-2992
 URL: https://issues.apache.org/jira/browse/QPID-2992
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker, C++ Clustering
Affects Versions: 0.8
 Environment: Debian Linux Squeeze, 32-bit, kernel 2.6.36.2, Dell 
 Poweredge 1950s. Corosync==1.3.0, Openais==1.1.4
Reporter: Mark Moseley
Assignee: michael j. goulish
 Attachments: cluster-fed.sh, error


 I've got a 2-node qpid test cluster at each of 2 datacenters, which are 
 federated together with a single durable static route between each. Qpid is 
 version 0.8. Corosync and openais are stock Squeeze (1.2.1-3 and 1.1.2-2, 
 respectively). OS is Squeeze, 32-bit, on Dell Poweredge 1950s, kernel 2.6.36. 
 The static route is durable and is set up over SSL (but I can replicate as 
 well with non-SSL). I've tried to normalize the hostnames below to make 
 things clearer; hopefully I didn't mess anything up.
 Given two clusters, cluster A (consisting of hosts A1 and A2) and cluster B 
 (with B1 and B2), I've got a static exchange route from A1 to B1, as well as 
 another from B1 to A1. Federation is working correctly, so I can send a 
 message on A2 and have it successfully retrieved on B2. The exchange local to 
 cluster A is walmyex1; the local exchange for B is bosmyex1.
 If I shut down the cluster in this order: B2, then B1, and start back up with 
 B1, B2, the static route route fails to get recreated. That is, on A1/A2, 
 looking at the bindings, exchange 'bosmyex1' does not get re-bound to cluster 
 B; the only output for it in qpid-config exchanges --bindings is just:
 snip
 Exchange 'bosmyex1' (direct)
 /snip
 If however I shut the cluster down in this order: B1, then B2, and start B2, 
 then B1, the static route gets re-bound. The output then is:
 snip
 Exchange 'bosmyex1' (direct)
 bind [unix.boston.cust] = 
 bridge_queue_1_8870523d-2286-408e-b5b5-50d53db2fa61
 /bind
 and I can message over the federated link with no further modification. Prior 
 to a few minutes ago, I was seeing this with the Squeeze stock openais==1.1.2 
 and corosync==1.2.1. In debugging this, I've upgraded both to the latest 
 versions with no change.
 I can replicate this every time I try. These are just test clusters, so I 
 don't have any other activity going on on them, or any other 
 exchanges/queues. My steps:
 On all boxes in cluster A and B:
 * Kill the qpidd if it's running and delete all existing store files, i.e. 
 contents of /var/lib/qpid/
 On host A1 in cluster A (I'm leaving out the -a user/test@host stuff):
 * Start up qpid
 * qpid-config add exchange direct bosmyex1 --durable
 * qpid-config add exchange direct walmyex1 --durable
 * qpid-config add queue walmyq1 --durable
 * qpid-config bind walmyex1 walmyq1 unix.waltham.cust
 On host B1 in cluster B:
 * qpid-config add exchange direct bosmyex1 --durable
 * qpid-config add exchange direct walmyex1 --durable
 * qpid-config add queue bosmyq1 --durable
 * qpid-config bind bosmyex1 bosmyq1 unix.boston.cust
 On cluster A:
 * Start other member of cluster, A2
 * qpid-route route add amqps://user/pass@HOSTA1:5671 
 amqps://user/pass@HOSTB1:5671 walmyex1 unix.waltham.cust -d
 On cluster B:
 * Start other member of cluster, B2
 * qpid-route route add amqps://user/pass@HOSTB1:5671 
 amqps://user/pass@HOSTA1:5671 bosmyex1 unix.boston.cust -d
 On either cluster:
 * Check qpid-config exchanges --bindings to make sure bindings are correct 
 for remote exchanges
 * To see correct behaviour, stop cluster in the order B1-B2, or A1-A2, 
 start cluster back up, check bindings.
 * To see broken behaviour, stop cluster in the order B2-B1, or A2-A1, start 
 cluster back up, check bindings.
 This is a test cluster, so I'm free to do anything with it, debugging-wise, 
 that would be useful. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Assigned: (QPID-2993) Federated source-local links crash remotely federated cluster member on local cluster startup

2011-01-27 Thread michael j. goulish (JIRA)

[
https://issues.apache.org/jira/browse/QPID-2993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

michael j. goulish reassigned QPID-2993:

Assignee: michael j. goulish (was: Alan Conway)

Federated source-local links crash remotely federated cluster member on local
cluster startup
-

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

-
Apache Qpid - AMQP Messaging Implementation
Project: http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Updated: (QPID-2949) broker prompts console interactively for password when --auth=no

2010-11-22 Thread michael j. goulish (JIRA)

[
https://issues.apache.org/jira/browse/QPID-2949?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

michael j. goulish updated QPID-2949:
-

Attachment: dont_prompt_me_2.diff

obsoletes previous patch

this patch provides a way to tell SaslFactory that console interaction is NOT
ok. i.e. if the code is running as part of a broker, or a demonized client of
some kind. Just tell it to never do interaction, and any attempt to interact
will be treated as an error.

This script demonstrates that all goes well if you supply enough info :

rm -rf /tmp/data_1 /tmp/data_2
mkdir /tmp/data_1 /tmp/data_2

# in window 1:
../qpidd -p 5672 --data-dir /tmp/data_1 --auth=yes --mgmt-enable=yes
--log-enable info+ ./qpidd_1.log --log-source yes
--sasl-config=/home/mick/trunk/qpid/cpp/src/tests/sasl_config

# in window 2:
../qpidd -p 1 --data-dir /tmp/data_2 --auth=yes --mgmt-enable=yes
--log-enable info+ ./qpidd_1.log --log-source yes
--sasl-config=/home/mick/trunk/qpid/cpp/src/tests/sasl_config

# in window 3 ( from qpid dir )
./tools/src/py/qpid-route dynamic add zig/z...@localhost
zig/z...@localhost:1 qmf.default.direct
# and view the created route
./tools/src/py/qpid-route route list localhost:5672

If you say auth=no, that works fine also.

HOWEVER PLEASE NOTE --

if you say auth=yes, but then do not supply enough into to avoid the need for
interaction, the attempted interaction will result in the connection being
closed. Then the originating broker will re-try the connection, and you will
get a two-broker infinite loop until you fix it.

broker prompts console interactively for password when --auth=no

Key: QPID-2949
URL: https://issues.apache.org/jira/browse/QPID-2949
Project: Qpid
Issue Type: Bug
Components: C++ Broker
Affects Versions: 0.8
Reporter: michael j. goulish
Assignee: michael j. goulish
Priority: Minor
Fix For: 0.9

Attachments: dont_prompt_me_2.diff, dont_prompt_me_2.diff,
dont_prompt_me_2.diff, dont_prompt_me_noauth.diff

As a result of checkin svn r1024541, which promoted some client-side Sasl
code to the common library for use in broker, the broker now prompts for a
password when when it is run with --auth=no !
The attached patch removes this behavior by propagating knowledge of
--auth=no down to SaslFactory. If authorization has been turned off, the
Saslfactory will create a null sasl object, just like it does if the code is
compiled with no Sasl support.
TODO -- also must fix the pathway where auth==yes.
NOTE: this is apparently an irritant rather than a disaster, since it did not
affect make check after the original checkin ( r102451 ).

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

-
Apache Qpid - AMQP Messaging Implementation
Project: http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Updated: (QPID-2949) broker prompts console interactively for password when --auth=no

2010-11-16 Thread michael j. goulish (JIRA)

[
https://issues.apache.org/jira/browse/QPID-2949?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

michael j. goulish updated QPID-2949:
-

Description:
As a result of checkin svn r1024541, which promoted some client-side Sasl code
to the common library for use in broker, the broker now prompts for a password
when when it is run with --auth=no !

The attached patch removes this behavior by propagating knowledge of
--auth=no down to SaslFactory. If authorization has been turned off, the
Saslfactory will create a null sasl object, just like it does if the code is
compiled with no Sasl support.

TODO -- also must fix the pathway where auth==yes.

NOTE: this is apparently an irritant rather than a disaster, since it did not
affect make check after the original checkin ( r102451 ).

was:
As a result of checkin svn r1024541, which promoted some client-side Sasl code
to the common library for use in broker, the broker now prompts for a password
when when it is run with --auth=no !

broker prompts console interactively for password when --auth=no

Attachments: dont_prompt_me.diff

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

-
Apache Qpid - AMQP Messaging Implementation
Project: http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Assigned: (QPID-1672) Inter-broker links need full SASL support

2010-09-29 Thread michael j. goulish (JIRA)


 [ 
https://issues.apache.org/jira/browse/QPID-1672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

michael j. goulish reassigned QPID-1672:


Assignee: michael j. goulish  (was: Ken Giusti)

 Inter-broker links need full SASL support
 -

 Key: QPID-1672
 URL: https://issues.apache.org/jira/browse/QPID-1672
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Affects Versions: M4
Reporter: Gordon Sim
Assignee: michael j. goulish



-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Closed: (QPID-2617) make sasl-based tests config files relocatable

2010-05-24 Thread michael j. goulish (JIRA)


 [ 
https://issues.apache.org/jira/browse/QPID-2617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

michael j. goulish closed QPID-2617.


Resolution: Fixed

fixed with svn rev 947748.

 make sasl-based tests config files relocatable
 --

 Key: QPID-2617
 URL: https://issues.apache.org/jira/browse/QPID-2617
 Project: Qpid
  Issue Type: Improvement
  Components: C++ Broker
Reporter: michael j. goulish
Assignee: michael j. goulish

 Currently, sasl-based testing is hampered by the fact that the sasl library 
 code is only reading its configuration information from the standard 
 installed location /etc/sasl2 .  This means that sasl installation must be 
 done manually on a machine prior to running sasl-based tests, which makes 
 them not fully automatable.
  1.  Find a way of telling the library to look elsewhere
  2.  expose that mechanism through a broker option 
  3.  check a suitable config file into our source tree, at cpp/src/tests
  4.  reference the locally-created sasldb from the new config file
  5.  the new sasl tests should be runnable by any users -- not just root.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Created: (QPID-2617) make sasl-based tests config files relocatable

2010-05-19 Thread michael j. goulish (JIRA)

make sasl-based tests config files relocatable
--

 Key: QPID-2617
 URL: https://issues.apache.org/jira/browse/QPID-2617
 Project: Qpid
  Issue Type: Improvement
  Components: C++ Broker
Reporter: michael j. goulish
Assignee: michael j. goulish


Currently, sasl-based testing is hampered by the fact that the sasl library 
code is only reading its configuration information from the standard installed 
location /etc/sasl2 .  This means that sasl installation must be done manually 
on a machine prior to running sasl-based tests, which makes them not fully 
automatable.

 1.  Find a way of telling the library to look elsewhere
 2.  expose that mechanism through a broker option 
 3.  check a suitable config file into our source tree, at cpp/src/tests
 4.  reference the locally-created sasldb from the new config file
 5.  the new sasl tests should be runnable by any users -- not just root.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Updated: (QPID-2187) Allow clients to make secure/authenticated connections to a cluster.

2010-05-14 Thread michael j. goulish (JIRA)


 [ 
https://issues.apache.org/jira/browse/QPID-2187?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

michael j. goulish updated QPID-2187:
-

Attachment: 944158.diff

Cluster + Security
---

* initial observation of a problem was a 2% failure rate in perftests
  of 20,000 messages against a cluster with security enabled.
  Problem was occasional receit of encrypted frames before the
  security codec had been enabled.  This is fixed with locking in
  cluster code (no new locks in broker code) and a callback that is
  fired by broker::ConnectionHandler::Handler to tell the cluster
  code when the opening handshake has finished.
  This was never a problem in the non-clustered broker before because
  everything happened in a single thread.

* the brokers that shadow the connection must not have null
  authenticators rather than real ones, so that they go through all
  the motions but don't do anythig.  Only the directly-connected
  broker can perform the security handshake.

* once the directly-connected broker receives the real user ID
  from its callback, it mcasts that ID to all other brokers.
  Otherwise the shadowing brokers will al think that the user ID
  is anonymous.
  Check this by doing a substantial perftest, and using
  qpid-stat -c localhost:PORT
  to confirm that the brokers all have the same userID for the
  same connection.

* the user ID, negotiated during the Sasl security startup, is
   communicated from the directly connected broker to all other
   cluster brokers.

* If security is *not* being used, then this code should *not* tell
  the brokers anything about the userID -- or it will step on the value
  that is being set by other code pathways.

* test program at cpp/src/tests/cluster_authentication_soak is not yet
  fully automated -- run it with something like
  sudo ./cluster_authentication_soak 500



 Allow clients to make secure/authenticated connections to a cluster.
 

 Key: QPID-2187
 URL: https://issues.apache.org/jira/browse/QPID-2187
 Project: Qpid
  Issue Type: Improvement
 Environment: all
Reporter: Ken Giusti
Assignee: michael j. goulish
 Attachments: 944158.diff


 The current implementation of clustering does not correctly handle 
 authentication correctly.From the trunk build:
 [kgiu...@localhost src]$ ./qpidd --auth yes --realm KGIUSTI.COM --log-enable 
 info+  --load-module ./.libs/cluster.so  --cluster-name ken
 2009-11-02 10:30:58 info Loaded Module: ./.libs/cluster.so
 2009-11-02 10:30:58 info Management enabled
 2009-11-02 10:30:58 notice Initializing CPG
 2009-11-02 10:30:58 notice cluster(127.0.0.1:14581 INIT) membership change: 
 127.0.0.1:14581 (joined: 127.0.0.1:14581(joined) )
 2009-11-02 10:30:58 info No message store configured, persistence is disabled.
 2009-11-02 10:30:58 info SASL enabled
 2009-11-02 10:30:58 notice Listening on TCP port 5672
 2009-11-02 10:30:58 notice cluster(127.0.0.1:14581 INIT) joining cluster ken 
 with url=amqp:tcp:10.16.19.19:5672,tcp:10.16.14.69:5672,tcp:192.168.122.1:5672
 2009-11-02 10:30:58 notice Broker running
 2009-11-02 10:30:58 info cluster(127.0.0.1:14581 READY) member update: 
 127.0.0.1:14581(member)
 2009-11-02 10:30:58 notice cluster(127.0.0.1:14581 READY) first in cluster
 2009-11-02 10:31:05 info SASL: Mechanism list: ANONYMOUS PLAIN DIGEST-MD5 
 LOGIN GSSAPI CRAM-MD5
 2009-11-02 10:31:05 info cluster(127.0.0.1:14581 READY) new local connection 
 127.0.0.1:14581-1
 2009-11-02 10:31:05 info SASL: Starting authentication with mechanism: GSSAPI
 2009-11-02 10:31:05 info SASL: Authentication succeeded for: 
 testu...@kgiusti.com
 2009-11-02 10:31:05 error cluster(127.0.0.1:14581 READY) aborting connection 
 127.0.0.1:14581-1: framing-error: Reserved bits not zero 
 (qpid/framing/AMQFrame.cpp:132)
 2009-11-02 10:31:05 info cluster(127.0.0.1:14581 READY) connection closed 
 127.0.0.1:14581-1
 
 The above error occurs when running perftest against the cluster in the 
 following manner:
 [kgiu...@localhost tests]$ /usr/kerberos/bin/kinit testu...@kgiusti.com
 [kgiu...@localhost tests]$ ./perftest -b localhost.localdomain --mechanism 
 GSSAPI --username testuser --tx 1 --count 1 --summary --log-enable info+
 2009-11-02 10:31:05 info Connecting to tcp:localhost.localdomain:5672
 2009-11-02 10:31:05 info Installing security layer,  SSF: 56
 2009-11-02 10:31:05 warning Connection closed
 Running the same test, but turning off clustering, authentication succeeds.
 Alan has determined that the problem is due to the way the clustered broker 
 constructs the codec chain.  The chain is built without the codec for a 
 secure connection.
 The correct solution would implement a mechanism that allows more generic 
 chaining of the codecs.  It should be possible to

[jira] Closed: (QPID-2187) Allow clients to make secure/authenticated connections to a cluster.

2010-05-14 Thread michael j. goulish (JIRA)


 [ 
https://issues.apache.org/jira/browse/QPID-2187?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

michael j. goulish closed QPID-2187.


Resolution: Fixed

Cluster + Security
---

* initial observation of a problem was a 2% failure rate in perftests
  of 20,000 messages against a cluster with security enabled.
  Problem was occasional receit of encrypted frames before the
  security codec had been enabled.  This is fixed with locking in
  cluster code (no new locks in broker code) and a callback that is
  fired by broker::ConnectionHandler::Handler to tell the cluster
  code when the opening handshake has finished.
  This was never a problem in the non-clustered broker before because
  everything happened in a single thread.

* the brokers that shadow the connection must not have null
  authenticators rather than real ones, so that they go through all
  the motions but don't do anythig.  Only the directly-connected
  broker can perform the security handshake.

* once the directly-connected broker receives the real user ID
  from its callback, it mcasts that ID to all other brokers.
  Otherwise the shadowing brokers will al think that the user ID
  is anonymous.
  Check this by doing a substantial perftest, and using
  qpid-stat -c localhost:PORT
  to confirm that the brokers all have the same userID for the
  same connection.

* the user ID, negotiated during the Sasl security startup, is
   communicated from the directly connected broker to all other
   cluster brokers.

* If security is *not* being used, then this code should *not* tell
  the brokers anything about the userID -- or it will step on the value
  that is being set by other code pathways.

* test program at cpp/src/tests/cluster_authentication_soak is not yet
  fully automated -- run it with something like
  sudo ./cluster_authentication_soak 500



 Allow clients to make secure/authenticated connections to a cluster.
 

 Key: QPID-2187
 URL: https://issues.apache.org/jira/browse/QPID-2187
 Project: Qpid
  Issue Type: Improvement
 Environment: all
Reporter: Ken Giusti
Assignee: michael j. goulish
 Attachments: 944158.diff


 The current implementation of clustering does not correctly handle 
 authentication correctly.From the trunk build:
 [kgiu...@localhost src]$ ./qpidd --auth yes --realm KGIUSTI.COM --log-enable 
 info+  --load-module ./.libs/cluster.so  --cluster-name ken
 2009-11-02 10:30:58 info Loaded Module: ./.libs/cluster.so
 2009-11-02 10:30:58 info Management enabled
 2009-11-02 10:30:58 notice Initializing CPG
 2009-11-02 10:30:58 notice cluster(127.0.0.1:14581 INIT) membership change: 
 127.0.0.1:14581 (joined: 127.0.0.1:14581(joined) )
 2009-11-02 10:30:58 info No message store configured, persistence is disabled.
 2009-11-02 10:30:58 info SASL enabled
 2009-11-02 10:30:58 notice Listening on TCP port 5672
 2009-11-02 10:30:58 notice cluster(127.0.0.1:14581 INIT) joining cluster ken 
 with url=amqp:tcp:10.16.19.19:5672,tcp:10.16.14.69:5672,tcp:192.168.122.1:5672
 2009-11-02 10:30:58 notice Broker running
 2009-11-02 10:30:58 info cluster(127.0.0.1:14581 READY) member update: 
 127.0.0.1:14581(member)
 2009-11-02 10:30:58 notice cluster(127.0.0.1:14581 READY) first in cluster
 2009-11-02 10:31:05 info SASL: Mechanism list: ANONYMOUS PLAIN DIGEST-MD5 
 LOGIN GSSAPI CRAM-MD5
 2009-11-02 10:31:05 info cluster(127.0.0.1:14581 READY) new local connection 
 127.0.0.1:14581-1
 2009-11-02 10:31:05 info SASL: Starting authentication with mechanism: GSSAPI
 2009-11-02 10:31:05 info SASL: Authentication succeeded for: 
 testu...@kgiusti.com
 2009-11-02 10:31:05 error cluster(127.0.0.1:14581 READY) aborting connection 
 127.0.0.1:14581-1: framing-error: Reserved bits not zero 
 (qpid/framing/AMQFrame.cpp:132)
 2009-11-02 10:31:05 info cluster(127.0.0.1:14581 READY) connection closed 
 127.0.0.1:14581-1
 
 The above error occurs when running perftest against the cluster in the 
 following manner:
 [kgiu...@localhost tests]$ /usr/kerberos/bin/kinit testu...@kgiusti.com
 [kgiu...@localhost tests]$ ./perftest -b localhost.localdomain --mechanism 
 GSSAPI --username testuser --tx 1 --count 1 --summary --log-enable info+
 2009-11-02 10:31:05 info Connecting to tcp:localhost.localdomain:5672
 2009-11-02 10:31:05 info Installing security layer,  SSF: 56
 2009-11-02 10:31:05 warning Connection closed
 Running the same test, but turning off clustering, authentication succeeds.
 Alan has determined that the problem is due to the way the clustered broker 
 constructs the codec chain.  The chain is built without the codec for a 
 secure connection.
 The correct solution would implement a mechanism that allows more generic 
 chaining of the codecs.  It should be possible to allow

[jira] Commented: (QPID-2235) clustered broker does not retry CPG calls that return TRY_AGAIN

2009-12-11 Thread michael j. goulish (JIRA)


[ 
https://issues.apache.org/jira/browse/QPID-2235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12789357#action_12789357
 ] 

michael j. goulish commented on QPID-2235:
--

fixed by svn rev 889657.  sorry, i do not have permission to change status of 
this Jira to resolved.   don't know why...


 clustered broker does  not retry CPG calls that return TRY_AGAIN
 

 Key: QPID-2235
 URL: https://issues.apache.org/jira/browse/QPID-2235
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Affects Versions: 0.5
Reporter: Alan Conway
Assignee: Alan Conway
Priority: Minor
 Fix For: 0.6


 Causes occasional failures on joining a cluster like this:
 soak-22: 2009-12-03 15:49:44 notice Initializing CPG
 soak-22: 2009-12-03 15:49:44 critical Unexpected error: Cannot join CPG group 
 soakTestCluster_9edd905b-92b3-4cfb-803f-120d7a088f1f: try again (6)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Updated: (QPID-1988) unhang failover_soak in case of broker error

2009-07-14 Thread michael j. goulish (JIRA)


 [ 
https://issues.apache.org/jira/browse/QPID-1988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

michael j. goulish updated QPID-1988:
-

Attachment: unhang_failover_soak.patch

 unhang failover_soak in case of broker error
 

 Key: QPID-1988
 URL: https://issues.apache.org/jira/browse/QPID-1988
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Reporter: michael j. goulish
 Attachments: unhang_failover_soak.patch


 Currently failover_soak will deliberately leave its other brokers alive if 
 one of them exits unexpectedly.
 But this makes failover_soak a bad citizen in multi-test systems like make 
 check.
 The added line of code in this patch makes this exit-path from the program 
 the same as all others.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Created: (QPID-1988) unhang failover_soak in case of broker error

2009-07-14 Thread michael j. goulish (JIRA)

unhang failover_soak in case of broker error


 Key: QPID-1988
 URL: https://issues.apache.org/jira/browse/QPID-1988
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Reporter: michael j. goulish
 Attachments: unhang_failover_soak.patch

Currently failover_soak will deliberately leave its other brokers alive if one 
of them exits unexpectedly.
But this makes failover_soak a bad citizen in multi-test systems like make 
check.

The added line of code in this patch makes this exit-path from the program the 
same as all others.





-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Updated: (QPID-1908) cluster_manager -- testing tool

2009-06-17 Thread michael j. goulish (JIRA)

[
https://issues.apache.org/jira/browse/QPID-1908?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

michael j. goulish updated QPID-1908:
-

Attachment: cluster_manager.diff

cluster_manager -- testing tool
---

Key: QPID-1908
URL: https://issues.apache.org/jira/browse/QPID-1908
Project: Qpid
Issue Type: Improvement
Components: C++ Broker
Reporter: michael j. goulish
Attachments: cluster_manager.diff

cluster_manager will start a cluster on a set of host:port URLs that you
specify on the command line.
The hosts can all be different -- so long as they are all clustered and have
openais running on them.
It will then periodically check the qpid cluster and make sure that there are
syill brokers running at the expected places.
( For this, it uses ssh and reads /proc . )
If it finds that one or more brokers are missing, it will start new brokers
to replace them.
This will allows us to run long-term tests in which the clients start and
stop (or are killed) during the test -- they will always know what set of
host:ports to attach to. Clients can be written either to try one host:port
after the other until they connect (which is functionality available from the
client library) -- or so that they try a single host:port -- then wait 30
seconds or so and retry if it doesn't work the first time.
( Because of the use of ssh, this is not a very high-speed application. It
often takes more than 30 seconds to check on a cluster of 3 brokers. )

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

-
Apache Qpid - AMQP Messaging Implementation
Project: http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Created: (QPID-1891) fix declaration problems with boost 1_33_1

2009-06-04 Thread michael j. goulish (JIRA)

fix declaration problems with boost 1_33_1
--

 Key: QPID-1891
 URL: https://issues.apache.org/jira/browse/QPID-1891
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Reporter: michael j. goulish


Under Boost 1.33.1, the use of boost::starts_withcaused compilation 
problems like this:

/usr/include/boost/algorithm/string/iterator_range.hpp:289: error: 
make_iterator_range is already declared in this scope


So I replaced it with C stdlib equivalents.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Updated: (QPID-1891) fix declaration problems with boost 1_33_1

2009-06-04 Thread michael j. goulish (JIRA)


 [ 
https://issues.apache.org/jira/browse/QPID-1891?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

michael j. goulish updated QPID-1891:
-

Attachment: (was: tests.diff)

 fix declaration problems with boost 1_33_1
 --

 Key: QPID-1891
 URL: https://issues.apache.org/jira/browse/QPID-1891
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Reporter: michael j. goulish

 Under Boost 1.33.1, the use of boost::starts_withcaused compilation 
 problems like this:
 /usr/include/boost/algorithm/string/iterator_range.hpp:289: error: 
 make_iterator_range is already declared in this scope
 So I replaced it with C stdlib equivalents.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Updated: (QPID-1891) fix declaration problems with boost 1_33_1

2009-06-04 Thread michael j. goulish (JIRA)


 [ 
https://issues.apache.org/jira/browse/QPID-1891?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

michael j. goulish updated QPID-1891:
-

Attachment: tests.diff
tests.diff

 fix declaration problems with boost 1_33_1
 --

 Key: QPID-1891
 URL: https://issues.apache.org/jira/browse/QPID-1891
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Reporter: michael j. goulish

 Under Boost 1.33.1, the use of boost::starts_withcaused compilation 
 problems like this:
 /usr/include/boost/algorithm/string/iterator_range.hpp:289: error: 
 make_iterator_range is already declared in this scope
 So I replaced it with C stdlib equivalents.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Updated: (QPID-1891) fix declaration problems with boost 1_33_1

2009-06-04 Thread michael j. goulish (JIRA)


 [ 
https://issues.apache.org/jira/browse/QPID-1891?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

michael j. goulish updated QPID-1891:
-

Attachment: forked_broker.diff
tests.diff

 fix declaration problems with boost 1_33_1
 --

 Key: QPID-1891
 URL: https://issues.apache.org/jira/browse/QPID-1891
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Reporter: michael j. goulish
 Attachments: forked_broker.diff, tests.diff


 Under Boost 1.33.1, the use of boost::starts_withcaused compilation 
 problems like this:
 /usr/include/boost/algorithm/string/iterator_range.hpp:289: error: 
 make_iterator_range is already declared in this scope
 So I replaced it with C stdlib equivalents.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Updated: (QPID-1873) Queue durability does not propagate in cluster newbie broker update.

2009-05-22 Thread michael j. goulish (JIRA)


 [ 
https://issues.apache.org/jira/browse/QPID-1873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

michael j. goulish updated QPID-1873:
-

Attachment: queue_durability_propagation.diff

 Queue durability does not propagate in cluster newbie broker update.
 

 Key: QPID-1873
 URL: https://issues.apache.org/jira/browse/QPID-1873
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Reporter: michael j. goulish
 Attachments: queue_durability_propagation.diff


 When a newbie broker is added to a cluster, it will hallucinate all of its 
 queues as durable.  
 It happens because the encode / decode stuff in src/qpid/broker/Queue.cpp  
 just doesn't propagate the actual durability status of queues.
 It assumes that all queues are durable.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Created: (QPID-1796) add durability to failover_soak.cpp

2009-04-08 Thread michael j. goulish (JIRA)

add durability to failover_soak.cpp
---

 Key: QPID-1796
 URL: https://issues.apache.org/jira/browse/QPID-1796
 Project: Qpid
  Issue Type: Improvement
  Components: C++ Broker
Reporter: michael j. goulish
Priority: Minor


Add durability option to the cpp failover_soak test.

The failover_soak executable uses several external executables, so they all had 
to be modified as well.

Default is still non-durable.

Setting durability=1 also has the effect of making replaying_sender send 
persistent messages.



-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Updated: (QPID-1693) Several improvements to cpp/src/tests/failover_soak.cpp

2009-02-26 Thread michael j. goulish (JIRA)


 [ 
https://issues.apache.org/jira/browse/QPID-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

michael j. goulish updated QPID-1693:
-

Attachment: failover_soak_improvements.diff

 Several improvements to  cpp/src/tests/failover_soak.cpp  
 --

 Key: QPID-1693
 URL: https://issues.apache.org/jira/browse/QPID-1693
 Project: Qpid
  Issue Type: Improvement
  Components: C++ Broker
Reporter: michael j. goulish
Priority: Minor
 Attachments: failover_soak_improvements.diff


   improvements for failover_soak
   {
 1. Do not kill children if they hang.  Leave
alive for questioning.
 2. Identify what type of child is hanging.
 3. Print a clear end-of-test message for grepping
 ( and clear status indication on same line. )
 4. Detect and error out on spontaneous broker death.
 5. Improve message when child fails to start up.
 6. Don't kill brokers when they've been newly added
to the cluster and are updating.  It generates
spurious errors.
   }

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Created: (QPID-1693) Several improvements to cpp/src/tests/failover_soak.cpp

2009-02-26 Thread michael j. goulish (JIRA)

Several improvements to  cpp/src/tests/failover_soak.cpp  
--

 Key: QPID-1693
 URL: https://issues.apache.org/jira/browse/QPID-1693
 Project: Qpid
  Issue Type: Improvement
  Components: C++ Broker
Reporter: michael j. goulish
Priority: Minor
 Attachments: failover_soak_improvements.diff

  improvements for failover_soak
  {
1. Do not kill children if they hang.  Leave
   alive for questioning.

2. Identify what type of child is hanging.

3. Print a clear end-of-test message for grepping
( and clear status indication on same line. )

4. Detect and error out on spontaneous broker death.

5. Improve message when child fails to start up.

6. Don't kill brokers when they've been newly added
   to the cluster and are updating.  It generates
   spurious errors.
  }


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Created: (QPID-1678) better cluster names in cluster_failover soak

2009-02-24 Thread michael j. goulish (JIRA)

better cluster names in cluster_failover soak 
--

 Key: QPID-1678
 URL: https://issues.apache.org/jira/browse/QPID-1678
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Reporter: michael j. goulish
Priority: Minor
 Attachments: failover_soak_uuid.diff

cluster_failover_soak  in   cpp/src/tests currently uses a random number 
between 0 and 1000 as the uniqing factor in its cluster names.   There may have 
been a collision recently when two tests were running simultaneously on one 
clustered setup during a Ptolemy test.

This change make it use UUIDs.   If one of these collides please tell me, so I 
can buy a lotto ticket with that number on it.



-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Updated: (QPID-1678) better cluster names in cluster_failover soak

2009-02-24 Thread michael j. goulish (JIRA)


 [ 
https://issues.apache.org/jira/browse/QPID-1678?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

michael j. goulish updated QPID-1678:
-

Attachment: failover_soak_uuid.diff

 better cluster names in cluster_failover soak 
 --

 Key: QPID-1678
 URL: https://issues.apache.org/jira/browse/QPID-1678
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Reporter: michael j. goulish
Priority: Minor
 Attachments: failover_soak_uuid.diff


 cluster_failover_soak  in   cpp/src/tests currently uses a random number 
 between 0 and 1000 as the uniqing factor in its cluster names.   There may 
 have been a collision recently when two tests were running simultaneously on 
 one clustered setup during a Ptolemy test.
 This change make it use UUIDs.   If one of these collides please tell me, so 
 I can buy a lotto ticket with that number on it.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Updated: (QPID-1674) failover_soak string-out-of-scope error

2009-02-23 Thread michael j. goulish (JIRA)


 [ 
https://issues.apache.org/jira/browse/QPID-1674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

michael j. goulish updated QPID-1674:
-

Attachment: failover_soak.diff

 failover_soak string-out-of-scope error
 ---

 Key: QPID-1674
 URL: https://issues.apache.org/jira/browse/QPID-1674
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Reporter: michael j. goulish
 Attachments: failover_soak.diff


 Fixes an error caused by a string going out of scope just before ForkedBroker 
 startup.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Created: (QPID-1674) failover_soak string-out-of-scope error

2009-02-23 Thread michael j. goulish (JIRA)

failover_soak string-out-of-scope error
---

 Key: QPID-1674
 URL: https://issues.apache.org/jira/browse/QPID-1674
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Reporter: michael j. goulish
 Attachments: failover_soak.diff

Fixes an error caused by a string going out of scope just before ForkedBroker 
startup.



-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Updated: (QPID-1651) add editable broker argv to src/tests/ClusterFixture

2009-02-06 Thread michael j. goulish (JIRA)


 [ 
https://issues.apache.org/jira/browse/QPID-1651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

michael j. goulish updated QPID-1651:
-

Attachment: ClusterFixtureArgv.diff

 add editable broker argv to src/tests/ClusterFixture
 

 Key: QPID-1651
 URL: https://issues.apache.org/jira/browse/QPID-1651
 Project: Qpid
  Issue Type: Improvement
  Components: C++ Broker
Reporter: michael j. goulish
Priority: Minor
 Attachments: ClusterFixtureArgv.diff


 make the argv that is used for the broker creation editable

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Updated: (QPID-1618) unacked persistent messages don't get to messageStore of newbie cluster broker.

2009-02-05 Thread michael j. goulish (JIRA)


 [ 
https://issues.apache.org/jira/browse/QPID-1618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

michael j. goulish updated QPID-1618:
-

Attachment: qpid_1618.diff

 unacked persistent messages don't get to messageStore of newbie cluster 
 broker.
 ---

 Key: QPID-1618
 URL: https://issues.apache.org/jira/browse/QPID-1618
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Affects Versions: M4
Reporter: michael j. goulish
Priority: Critical
 Attachments: qpid_1618.diff


 When a new broker is added to a cluster, persistent messages that have not 
 yet been ACKed do not get stored in the new brokers messageStore.
 How to reproduce:   ( this is overview -- actual scripts follow )
 
 1. kill any MY_CLUSTER brokers from previous test
 2. remove data dirs from previous test, and recreate
 3. start node 1 as first member of MY_CLUSTER
 4  declare the MY_CLUSTER queue -- durable
 5. connect to it with receiver -- use ack frequency of 10;
 6. connect with publish send only 5 persistent msgs, receiver
will get them but not ack.
 7. start second node
 8. when the state transfer completes kill both nodes.
(receiver should also perish)
 9. start up the second node only, as new first member of MY_CLUSTER.
( I.e. use 2nd broker's data dir path. )
 10. start and attach a new receiver
 PREVIOUS RESULT -- nothing.  messages were never stored in broker2's
messageStore.
 RESULT -- new receiver now gets 5 messages.
   durable queue says that sender confirmed point moved to (5+0)
 ==
 Scripts for reproducing problem.
 ==
 ###
 #step  1 ##
 ###
 #! /bin/bash
 echo 
 echo Step 1: Kill any brokers already running in the MY_CLUSTER cluster
 echo 
 kill -9 `pgrep -f MY_STORE`
 echo There should be no remaining brokers.  Here they are: 
 ps -aef | grep qpidd | grep -v grep
 ###
 #step  2 ##
 ###
 #! /bin/bash
 echo 
 echo Step 2: remove and rebuild the store data dirs.
 echo 
 rm -rf ./data
 mkdir -p ./data/1 ./data/2
 ###
 #step  3 ##
 ###
 #! /bin/bash
 echo 
 echo Step 3: Start the first broker in a cluster, using Store in ./data/1 .
 echo 
 rm broker_1.log
 $QPID_ROOT/cpp/src/qpidd --no-module-dir\
   --load-module $QPID_ROOT/cpp/src/.libs/cluster.so \
   --load-module $STORE_ROOT/cpp/lib/.libs/msgstore.so \
   --cluster-name MY_CLUSTER -p 5813 \
   --auth=no --mgmt-enable=no\
   --log-enable debug --log-to-file ./broker_1.log   \
   --data-dir ./data/1
 ###
 #step  4 ##
 ###
 #! /bin/bash
 echo Step 4: Declaring queue.
 $QPID_ROOT/cpp/examples/direct/declare_queues
 # !!  NOTE  !!
 # edit declare_queues.cpp to do port 5813
 # in call to session.queueDeclare use arg::durable=true
 # and arg::queue=MY_QUEUE,
 # edit exchangeBind call to use:
 # arg::queue = MY_QUEUE,
 ###
 #step  5 ##
 ###
 #! /bin/bash
 echo Step 5:  Starting receiver...
 $QPID_ROOT/cpp/src/tests/receiver \
 -p 5813   \
 --queue MY_QUEUE  \
 --messages 10 \
 --ack-frequency 10
 ###
 #step  6 ##
 ###
 #! /bin/bash
 echo Publish only 5 messages, so the receiver will not yet ack.

[jira] Updated: (QPID-1618) unacked persistent messages don't get to messageStore of newbie cluster broker.

2009-02-05 Thread michael j. goulish (JIRA)


 [ 
https://issues.apache.org/jira/browse/QPID-1618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

michael j. goulish updated QPID-1618:
-

Attachment: (was: qpid_1618.diff)

 unacked persistent messages don't get to messageStore of newbie cluster 
 broker.
 ---

 Key: QPID-1618
 URL: https://issues.apache.org/jira/browse/QPID-1618
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Affects Versions: M4
Reporter: michael j. goulish
Priority: Critical
 Attachments: qpid_1618.diff


 When a new broker is added to a cluster, persistent messages that have not 
 yet been ACKed do not get stored in the new brokers messageStore.
 How to reproduce:   ( this is overview -- actual scripts follow )
 
 1. kill any MY_CLUSTER brokers from previous test
 2. remove data dirs from previous test, and recreate
 3. start node 1 as first member of MY_CLUSTER
 4  declare the MY_CLUSTER queue -- durable
 5. connect to it with receiver -- use ack frequency of 10;
 6. connect with publish send only 5 persistent msgs, receiver
will get them but not ack.
 7. start second node
 8. when the state transfer completes kill both nodes.
(receiver should also perish)
 9. start up the second node only, as new first member of MY_CLUSTER.
( I.e. use 2nd broker's data dir path. )
 10. start and attach a new receiver
 PREVIOUS RESULT -- nothing.  messages were never stored in broker2's
messageStore.
 RESULT -- new receiver now gets 5 messages.
   durable queue says that sender confirmed point moved to (5+0)
 ==
 Scripts for reproducing problem.
 ==
 ###
 #step  1 ##
 ###
 #! /bin/bash
 echo 
 echo Step 1: Kill any brokers already running in the MY_CLUSTER cluster
 echo 
 kill -9 `pgrep -f MY_STORE`
 echo There should be no remaining brokers.  Here they are: 
 ps -aef | grep qpidd | grep -v grep
 ###
 #step  2 ##
 ###
 #! /bin/bash
 echo 
 echo Step 2: remove and rebuild the store data dirs.
 echo 
 rm -rf ./data
 mkdir -p ./data/1 ./data/2
 ###
 #step  3 ##
 ###
 #! /bin/bash
 echo 
 echo Step 3: Start the first broker in a cluster, using Store in ./data/1 .
 echo 
 rm broker_1.log
 $QPID_ROOT/cpp/src/qpidd --no-module-dir\
   --load-module $QPID_ROOT/cpp/src/.libs/cluster.so \
   --load-module $STORE_ROOT/cpp/lib/.libs/msgstore.so \
   --cluster-name MY_CLUSTER -p 5813 \
   --auth=no --mgmt-enable=no\
   --log-enable debug --log-to-file ./broker_1.log   \
   --data-dir ./data/1
 ###
 #step  4 ##
 ###
 #! /bin/bash
 echo Step 4: Declaring queue.
 $QPID_ROOT/cpp/examples/direct/declare_queues
 # !!  NOTE  !!
 # edit declare_queues.cpp to do port 5813
 # in call to session.queueDeclare use arg::durable=true
 # and arg::queue=MY_QUEUE,
 # edit exchangeBind call to use:
 # arg::queue = MY_QUEUE,
 ###
 #step  5 ##
 ###
 #! /bin/bash
 echo Step 5:  Starting receiver...
 $QPID_ROOT/cpp/src/tests/receiver \
 -p 5813   \
 --queue MY_QUEUE  \
 --messages 10 \
 --ack-frequency 10
 ###
 #step  6 ##
 ###
 #! /bin/bash
 echo Publish only 5 messages, so the receiver will not yet

[jira] Created: (QPID-1650) in src/tests, separate the code for ClusterFixture

2009-02-05 Thread michael j. goulish (JIRA)

in src/tests, separate the code for ClusterFixture 
---

 Key: QPID-1650
 URL: https://issues.apache.org/jira/browse/QPID-1650
 Project: Qpid
  Issue Type: Improvement
  Components: C++ Broker
Reporter: michael j. goulish
Priority: Minor
 Attachments: separate_ClusterFixture.diff

Separating out the code for ClusterFixture into its own files so it can be 
re-used.


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Updated: (QPID-1627) Three T-Shirt Alternatives

2009-02-03 Thread michael j. goulish (JIRA)


 [ 
https://issues.apache.org/jira/browse/QPID-1627?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

michael j. goulish updated QPID-1627:
-

Attachment: qpid_shirts.jpg

 Three T-Shirt Alternatives
 --

 Key: QPID-1627
 URL: https://issues.apache.org/jira/browse/QPID-1627
 Project: Qpid
  Issue Type: Wish
Reporter: michael j. goulish
Priority: Trivial
 Attachments: qpid_shirts.jpg


 All three are based on the I'm with Qpid!  idea.
 Just different shapes for the arrow.
 Original,  Manta,  and  Chevron.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Created: (QPID-1624) t-shirt image concept

2009-01-31 Thread michael j. goulish (JIRA)

t-shirt image concept
-

 Key: QPID-1624
 URL: https://issues.apache.org/jira/browse/QPID-1624
 Project: Qpid
  Issue Type: Wish
Reporter: michael j. goulish
Priority: Trivial
 Attachments: im_with_qpid.jpg

Here's a concept for a t-shirt logo based on the I'm with Qpid! slogan.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Updated: (QPID-1624) t-shirt image concept

2009-01-31 Thread michael j. goulish (JIRA)


 [ 
https://issues.apache.org/jira/browse/QPID-1624?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

michael j. goulish updated QPID-1624:
-

Attachment: im_with_qpid.jpg

 t-shirt image concept
 -

 Key: QPID-1624
 URL: https://issues.apache.org/jira/browse/QPID-1624
 Project: Qpid
  Issue Type: Wish
Reporter: michael j. goulish
Priority: Trivial
 Attachments: im_with_qpid.jpg


 Here's a concept for a t-shirt logo based on the I'm with Qpid! slogan.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Updated: (QPID-1618) unacked persistent messages don't get to messageStore of newbie cluster broker.

2009-01-29 Thread michael j. goulish (JIRA)


 [ 
https://issues.apache.org/jira/browse/QPID-1618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

michael j. goulish updated QPID-1618:
-

Attachment: (was: unack_newbie.diff)

 unacked persistent messages don't get to messageStore of newbie cluster 
 broker.
 ---

 Key: QPID-1618
 URL: https://issues.apache.org/jira/browse/QPID-1618
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Affects Versions: M4
Reporter: michael j. goulish
Priority: Critical

 When a new broker is added to a cluster, persistent messages that have not 
 yet been ACKed do not get stored in the new brokers messageStore.
 How to reproduce:   ( this is overview -- actual scripts follow )
 
 1. kill any MY_CLUSTER brokers from previous test
 2. remove data dirs from previous test, and recreate
 3. start node 1 as first member of MY_CLUSTER
 4  declare the MY_CLUSTER queue -- durable
 5. connect to it with receiver -- use ack frequency of 10;
 6. connect with publish send only 5 persistent msgs, receiver
will get them but not ack.
 7. start second node
 8. when the state transfer completes kill both nodes.
(receiver should also perish)
 9. start up the second node only, as new first member of MY_CLUSTER.
( I.e. use 2nd broker's data dir path. )
 10. start and attach a new receiver
 PREVIOUS RESULT -- nothing.  messages were never stored in broker2's
messageStore.
 RESULT -- new receiver now gets 5 messages.
   durable queue says that sender confirmed point moved to (5+0)
 ==
 Scripts for reproducing problem.
 ==
 ###
 #step  1 ##
 ###
 #! /bin/bash
 echo 
 echo Step 1: Kill any brokers already running in the MY_CLUSTER cluster
 echo 
 kill -9 `pgrep -f MY_STORE`
 echo There should be no remaining brokers.  Here they are: 
 ps -aef | grep qpidd | grep -v grep
 ###
 #step  2 ##
 ###
 #! /bin/bash
 echo 
 echo Step 2: remove and rebuild the store data dirs.
 echo 
 rm -rf ./data
 mkdir -p ./data/1 ./data/2
 ###
 #step  3 ##
 ###
 #! /bin/bash
 echo 
 echo Step 3: Start the first broker in a cluster, using Store in ./data/1 .
 echo 
 rm broker_1.log
 $QPID_ROOT/cpp/src/qpidd --no-module-dir\
   --load-module $QPID_ROOT/cpp/src/.libs/cluster.so \
   --load-module $STORE_ROOT/cpp/lib/.libs/msgstore.so \
   --cluster-name MY_CLUSTER -p 5813 \
   --auth=no --mgmt-enable=no\
   --log-enable debug --log-to-file ./broker_1.log   \
   --data-dir ./data/1
 ###
 #step  4 ##
 ###
 #! /bin/bash
 echo Step 4: Declaring queue.
 $QPID_ROOT/cpp/examples/direct/declare_queues
 # !!  NOTE  !!
 # edit declare_queues.cpp to do port 5813
 # in call to session.queueDeclare use arg::durable=true
 # and arg::queue=MY_QUEUE,
 # edit exchangeBind call to use:
 # arg::queue = MY_QUEUE,
 ###
 #step  5 ##
 ###
 #! /bin/bash
 echo Step 5:  Starting receiver...
 $QPID_ROOT/cpp/src/tests/receiver \
 -p 5813   \
 --queue MY_QUEUE  \
 --messages 10 \
 --ack-frequency 10
 ###
 #step  6 ##
 ###
 #! /bin/bash
 echo Publish only 5 messages, so the receiver will not yet ack.

[jira] Updated: (QPID-1618) unacked persistent messages don't get to messageStore of newbie cluster broker.

2009-01-27 Thread michael j. goulish (JIRA)


 [ 
https://issues.apache.org/jira/browse/QPID-1618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

michael j. goulish updated QPID-1618:
-

Attachment: unack_newbie.diff

 unacked persistent messages don't get to messageStore of newbie cluster 
 broker.
 ---

 Key: QPID-1618
 URL: https://issues.apache.org/jira/browse/QPID-1618
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Affects Versions: M4
Reporter: michael j. goulish
Priority: Critical
 Attachments: unack_newbie.diff


 When a new broker is added to a cluster, persistent messages that have not 
 yet been ACKed do not get stored in the new brokers messageStore.
 How to reproduce:   ( this is overview -- actual scripts follow )
 
 1. kill any MY_CLUSTER brokers from previous test
 2. remove data dirs from previous test, and recreate
 3. start node 1 as first member of MY_CLUSTER
 4  declare the MY_CLUSTER queue -- durable
 5. connect to it with receiver -- use ack frequency of 10;
 6. connect with publish send only 5 persistent msgs, receiver
will get them but not ack.
 7. start second node
 8. when the state transfer completes kill both nodes.
(receiver should also perish)
 9. start up the second node only, as new first member of MY_CLUSTER.
( I.e. use 2nd broker's data dir path. )
 10. start and attach a new receiver
 PREVIOUS RESULT -- nothing.  messages were never stored in broker2's
messageStore.
 RESULT -- new receiver now gets 5 messages.
   durable queue says that sender confirmed point moved to (5+0)
 ==
 Scripts for reproducing problem.
 ==
 ###
 #step  1 ##
 ###
 #! /bin/bash
 echo 
 echo Step 1: Kill any brokers already running in the MY_CLUSTER cluster
 echo 
 kill -9 `pgrep -f MY_STORE`
 echo There should be no remaining brokers.  Here they are: 
 ps -aef | grep qpidd | grep -v grep
 ###
 #step  2 ##
 ###
 #! /bin/bash
 echo 
 echo Step 2: remove and rebuild the store data dirs.
 echo 
 rm -rf ./data
 mkdir -p ./data/1 ./data/2
 ###
 #step  3 ##
 ###
 #! /bin/bash
 echo 
 echo Step 3: Start the first broker in a cluster, using Store in ./data/1 .
 echo 
 rm broker_1.log
 $QPID_ROOT/cpp/src/qpidd --no-module-dir\
   --load-module $QPID_ROOT/cpp/src/.libs/cluster.so \
   --load-module $STORE_ROOT/cpp/lib/.libs/msgstore.so \
   --cluster-name MY_CLUSTER -p 5813 \
   --auth=no --mgmt-enable=no\
   --log-enable debug --log-to-file ./broker_1.log   \
   --data-dir ./data/1
 ###
 #step  4 ##
 ###
 #! /bin/bash
 echo Step 4: Declaring queue.
 $QPID_ROOT/cpp/examples/direct/declare_queues
 # !!  NOTE  !!
 # edit declare_queues.cpp to do port 5813
 # in call to session.queueDeclare use arg::durable=true
 # and arg::queue=MY_QUEUE,
 # edit exchangeBind call to use:
 # arg::queue = MY_QUEUE,
 ###
 #step  5 ##
 ###
 #! /bin/bash
 echo Step 5:  Starting receiver...
 $QPID_ROOT/cpp/src/tests/receiver \
 -p 5813   \
 --queue MY_QUEUE  \
 --messages 10 \
 --ack-frequency 10
 ###
 #step  6 ##
 ###
 #! /bin/bash
 echo Publish only 5 messages, so the receiver will not yet ack.

[jira] Created: (QPID-1618) unacked persistent messages don't get to messageStore of newbie cluster broker.

2009-01-27 Thread michael j. goulish (JIRA)

unacked persistent messages don't get to messageStore of newbie cluster broker.
---

 Key: QPID-1618
 URL: https://issues.apache.org/jira/browse/QPID-1618
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Affects Versions: M4
Reporter: michael j. goulish
Priority: Critical
 Attachments: unack_newbie.diff


When a new broker is added to a cluster, persistent messages that have not yet 
been ACKed do not get stored in the new brokers messageStore.


How to reproduce:   ( this is overview -- actual scripts follow )


1. kill any MY_CLUSTER brokers from previous test

2. remove data dirs from previous test, and recreate

3. start node 1 as first member of MY_CLUSTER

4  declare the MY_CLUSTER queue -- durable

5. connect to it with receiver -- use ack frequency of 10;

6. connect with publish send only 5 persistent msgs, receiver
   will get them but not ack.

7. start second node

8. when the state transfer completes kill both nodes.
   (receiver should also perish)

9. start up the second node only, as new first member of MY_CLUSTER.
   ( I.e. use 2nd broker's data dir path. )

10. start and attach a new receiver

PREVIOUS RESULT -- nothing.  messages were never stored in broker2's
   messageStore.

RESULT -- new receiver now gets 5 messages.
  durable queue says that sender confirmed point moved to (5+0)




==
Scripts for reproducing problem.
==


###
#step  1 ##
###
#! /bin/bash

echo 
echo Step 1: Kill any brokers already running in the MY_CLUSTER cluster
echo 
kill -9 `pgrep -f MY_STORE`

echo There should be no remaining brokers.  Here they are: 
ps -aef | grep qpidd | grep -v grep


###
#step  2 ##
###
#! /bin/bash

echo 
echo Step 2: remove and rebuild the store data dirs.
echo 
rm -rf ./data
mkdir -p ./data/1 ./data/2

###
#step  3 ##
###
#! /bin/bash


echo 
echo Step 3: Start the first broker in a cluster, using Store in ./data/1 .
echo 
rm broker_1.log
$QPID_ROOT/cpp/src/qpidd --no-module-dir\
  --load-module $QPID_ROOT/cpp/src/.libs/cluster.so \
  --load-module $STORE_ROOT/cpp/lib/.libs/msgstore.so \
  --cluster-name MY_CLUSTER -p 5813 \
  --auth=no --mgmt-enable=no\
  --log-enable debug --log-to-file ./broker_1.log   \
  --data-dir ./data/1

###
#step  4 ##
###
#! /bin/bash

echo Step 4: Declaring queue.
$QPID_ROOT/cpp/examples/direct/declare_queues

# !!  NOTE  !!
# edit declare_queues.cpp to do port 5813
# in call to session.queueDeclare use arg::durable=true
# and arg::queue=MY_QUEUE,
# edit exchangeBind call to use:
# arg::queue = MY_QUEUE,






###
#step  5 ##
###
#! /bin/bash
echo Step 5:  Starting receiver...

$QPID_ROOT/cpp/src/tests/receiver \
-p 5813   \
--queue MY_QUEUE  \
--messages 10 \
--ack-frequency 10

###
#step  6 ##
###
#! /bin/bash

echo Publish only 5 messages, so the receiver will not yet ack.

$QPID_ROOT/cpp/src/tests/publish \
-p 5813  \
--count 5\
--durable yes\
--destination amq.direct \
--routing-key routing_key\
--log-enable debug
#! /bin/bash

[jira] Reopened: (QPID-1611) queue durability is lost on broker-newbie sync.

2009-01-23 Thread michael j. goulish (JIRA)


 [ 
https://issues.apache.org/jira/browse/QPID-1611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

michael j. goulish reopened QPID-1611:
--


My attached fix is OK wrt the Broker -- but breaks MessageStore tests.



 queue durability is lost on broker-newbie sync.
 ---

 Key: QPID-1611
 URL: https://issues.apache.org/jira/browse/QPID-1611
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Reporter: michael j. goulish
Priority: Critical
 Fix For: M5

 Attachments: 480871.diff


 queue durability does not survive cluster broker newbie sync.
 When a new broker is syncing up with a cluster, it gets all the information 
 it needs to preserve queue durability, 
 But the code pathway that we go through in that case does not tell the 
 journaling code to create a journal.
 This has the effect of making the queue not durable after all.
 !  catch attached patch !

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Updated: (QPID-1611) queue durability is lost on broker-newbie sync.

2009-01-23 Thread michael j. goulish (JIRA)


 [ 
https://issues.apache.org/jira/browse/QPID-1611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

michael j. goulish updated QPID-1611:
-

Attachment: (was: 480871.diff)

 queue durability is lost on broker-newbie sync.
 ---

 Key: QPID-1611
 URL: https://issues.apache.org/jira/browse/QPID-1611
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Reporter: michael j. goulish
Priority: Critical
 Fix For: M5


 queue durability does not survive cluster broker newbie sync.
 When a new broker is syncing up with a cluster, it gets all the information 
 it needs to preserve queue durability, 
 But the code pathway that we go through in that case does not tell the 
 journaling code to create a journal.
 This has the effect of making the queue not durable after all.
 !  catch attached patch !

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Updated: (QPID-1611) queue durability is lost on broker-newbie sync.

2009-01-23 Thread michael j. goulish (JIRA)


 [ 
https://issues.apache.org/jira/browse/QPID-1611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

michael j. goulish updated QPID-1611:
-

Attachment: qpid_1611.diff

 Previous version had problem with MessageStore code during its recovery 
operation.

This version adds a way for the RecoverManager to tell the Queue code that we 
are dong a recovery -- thus avoiding a double queue creation that was messing 
up Store.



 queue durability is lost on broker-newbie sync.
 ---

 Key: QPID-1611
 URL: https://issues.apache.org/jira/browse/QPID-1611
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Reporter: michael j. goulish
Priority: Critical
 Fix For: M5

 Attachments: qpid_1611.diff


 queue durability does not survive cluster broker newbie sync.
 When a new broker is syncing up with a cluster, it gets all the information 
 it needs to preserve queue durability, 
 But the code pathway that we go through in that case does not tell the 
 journaling code to create a journal.
 This has the effect of making the queue not durable after all.
 !  catch attached patch !

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Updated: (QPID-1611) patch for BZ 480871

2009-01-22 Thread michael j. goulish (JIRA)


 [ 
https://issues.apache.org/jira/browse/QPID-1611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

michael j. goulish updated QPID-1611:
-

Attachment: 480871.diff

 patch for BZ 480871
 ---

 Key: QPID-1611
 URL: https://issues.apache.org/jira/browse/QPID-1611
 Project: Qpid
  Issue Type: Bug
  Components: C++ Broker
Reporter: michael j. goulish
Priority: Critical
 Attachments: 480871.diff


 BZ 480871
 queue durability does not survive cluster broker newbie sync.
 !  catch attached patch !

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
Apache Qpid - AMQP Messaging Implementation
Project:  http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

[jira] Updated: (QPID-1611) queue durability is lost on broker-newbie sync.

2009-01-22 Thread michael j. goulish (JIRA)

[
https://issues.apache.org/jira/browse/QPID-1611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

michael j. goulish updated QPID-1611:
-

Description:

queue durability does not survive cluster broker newbie sync.

When a new broker is syncing up with a cluster, it gets all the information it
needs to preserve queue durability,
But the code pathway that we go through in that case does not tell the
journaling code to create a journal.
This has the effect of making the queue not durable after all.

! catch attached patch !

was:
BZ 480871

queue durability does not survive cluster broker newbie sync.

! catch attached patch !

Summary: queue durability is lost on broker-newbie sync. (was: patch
for BZ 480871)

queue durability is lost on broker-newbie sync.
---

Key: QPID-1611
URL: https://issues.apache.org/jira/browse/QPID-1611
Project: Qpid
Issue Type: Bug
Components: C++ Broker
Reporter: michael j. goulish
Priority: Critical
Attachments: 480871.diff

queue durability does not survive cluster broker newbie sync.
When a new broker is syncing up with a cluster, it gets all the information
it needs to preserve queue durability,
But the code pathway that we go through in that case does not tell the
journaling code to create a journal.
This has the effect of making the queue not durable after all.
! catch attached patch !

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

-
Apache Qpid - AMQP Messaging Implementation
Project: http://qpid.apache.org
Use/Interact: mailto:dev-subscr...@qpid.apache.org

94 matches

Mail list logo