- **status**: review --> fixed
- **assigned_to**: Minh Hon Chau -->  nobody 
- **Blocker**:  --> False
- **Comment**:

release: [cbfaa7cb69aa32c05abd9bb6177410d1af1cd45c]
develop: [c0b25cd7a1a94d386d813735c72d22abded8583b]



---

** [tickets:#2432] dtm: Node reboot because transportd reads invalid pid of 
dtmd**

**Status:** fixed
**Milestone:** 5.17.06
**Created:** Wed Apr 19, 2017 12:02 AM UTC by Minh Hon Chau
**Last Updated:** Thu Apr 20, 2017 06:25 AM UTC
**Owner:** nobody


There's an unexpected node reboot during Opensaf node startup

2017-01-24 18:19:53 SC-1 opensafd: Starting OpenSAF Services(5.2.M0 - 
8532:b6df9e2a2b8b:default) (Using TCP)
2017-01-24 18:19:53 SC-1 osaftransportd[398]: Started
2017-01-24 18:19:53 SC-1 osafdtmd[393]: mkfifo already exists: 
/var/lib/opensaf/osafdtmd.fifo File exists
2017-01-24 18:19:53 SC-1 osaftransportd[398]: Rebooting OpenSAF NodeId = 0 EE 
Name = No EE Mapped, Reason: osafdtmd failed to start, OwnNodeId = 0, 
SupervisionTime = 60
2017-01-24 18:19:53 SC-1 osafdtmd[393]: Started

Another attempt to reproduce this problem by adding more debug log:

2017-04-18 18:01:14 SC-1 opensafd: Starting OpenSAF Services(5.2.0 - 
0:000000000000) (Using TCP)
2017-04-18 18:01:14 SC-1 osaftransportd[380]: fifo_file 
/var/lib/opensaf/osaftransportd.fifo
2017-04-18 18:01:14 SC-1 osaftransportd[380]: mkfifo already exists: 
/var/lib/opensaf/osaftransportd.fifo File exists
2017-04-18 18:01:14 SC-1 osafdtmd[386]: fifo_file /var/lib/opensaf/osafdtmd.fifo
2017-04-18 18:01:14 SC-1 osafdtmd[386]: mkfifo already exists: 
/var/lib/opensaf/osafdtmd.fifo File exists
2017-04-18 18:01:15 SC-1 osaftransportd[380]: __pidfile 
/var/run/opensaf/osaftransportd.pid
2017-04-18 18:01:15 SC-1 osaftransportd[380]: Started
2017-04-18 18:01:15 SC-1 osaftransportd[380]: WA file_path_:/var/run/opensaf, 
file_name_:osafdtmd.pid
2017-04-18 18:01:15 SC-1 osafdtmd[386]: __pidfile /var/run/opensaf/osafdtmd.pid
2017-04-18 18:01:15 SC-1 osaftransportd[380]: WA file name: osafdtmd.pid created
2017-04-18 18:01:15 SC-1 osaftransportd[380]: WA rdstate 6, pid: 4294967295
2017-04-18 18:01:15 SC-1 osaftransportd[380]: Rebooting OpenSAF NodeId = 0 EE 
Name = No EE Mapped, Reason: osafdtmd failed to start, OwnNodeId = 0, 
SupervisionTime = 60
2017-04-18 18:01:15 SC-1 osafdtmd[386]: Started

It could be because osaftransportd fails to read pid in osafdmtd.pid



---

Sent from sourceforge.net because [email protected] is 
subscribed to https://sourceforge.net/p/opensaf/tickets/

To unsubscribe from further messages, a project admin can change settings at 
https://sourceforge.net/p/opensaf/admin/tickets/options.  Or, if this is a 
mailing list, you can unsubscribe from the mailing list.
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
Opensaf-tickets mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets

Reply via email to