- **status**: assigned --> accepted
---
** [tickets:#240] cpsv : checkpoint apis fail with try again continuously when
multinode applications on try to invoke the api at the same time (70 nodes)**
**Status:** accepted
**Milestone:** future
**Created:** Thu May 16, 2013 06:21 AM UTC by A V Mahesh (AVM)
**Last Updated:** Thu May 16, 2013 06:21 AM UTC
**Owner:** A V Mahesh (AVM)
>From : http://devel.opensaf.org/ticket/2954
The issue is seen on SLES 70 node VM setup. Changeset 3855
Two ckpt applications are running on each node on all the 70 nodes. One of the
application that is running on the SC-1 creates an asynchronous collocated
checkpoint. The rest of the applications across the cluster tries to open the
same checkpoint. When try again is returned to the application, the application
waits for 500ms before retrying the api. Some applications continuously get try
again and after 3 minutes the application exits (application specific timeout).
This issue is reproducible only in a 70 node cluster. Traces are available and
huge.
---
Sent from sourceforge.net because [email protected] is
subscribed to https://sourceforge.net/p/opensaf/tickets/
To unsubscribe from further messages, a project admin can change settings at
https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a
mailing list, you can unsubscribe from the mailing list.
------------------------------------------------------------------------------
Slashdot TV. Video for Nerds. Stuff that Matters.
http://pubads.g.doubleclick.net/gampad/clk?id=160591471&iu=/4140/ostg.clktrk
_______________________________________________
Opensaf-tickets mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets