Re: [dpdk-dev] [PATCH v4 06/24] ethdev: enable hotplug on multi-process

Zhang, Qi Z Tue, 26 Jun 2018 05:20:02 -0700


> -----Original Message-----
> From: Burakov, Anatoly
> Sent: Tuesday, June 26, 2018 8:09 PM
> To: Zhang, Qi Z <qi.z.zh...@intel.com>; tho...@monjalon.net
> Cc: Ananyev, Konstantin <konstantin.anan...@intel.com>; dev@dpdk.org;
> Richardson, Bruce <bruce.richard...@intel.com>; Yigit, Ferruh
> <ferruh.yi...@intel.com>; Shelton, Benjamin H
> <benjamin.h.shel...@intel.com>; Vangati, Narender
> <narender.vang...@intel.com>
> Subject: Re: [PATCH v4 06/24] ethdev: enable hotplug on multi-process
> 
> On 26-Jun-18 8:08 AM, Qi Zhang wrote:
> > We are going to introduce the solution to handle different hotplug
> > cases in multi-process situation, it include below scenario:
> >
> > 1. Attach a share device from primary
> > 2. Detach a share device from primary
> > 3. Attach a share device from secondary 4. Detach a share device from
> > secondary 5. Attach a private device from secondary 6. Detach a
> > private device from secondary 7. Detach a share device from secondary
> > privately 8. Attach a share device from secondary privately
> >
> > In primary-secondary process model, we assume device is shared by default.
> > that means attach or detach a device on any process will broadcast to
> > all other processes through mp channel then device information will be
> > synchronized on all processes.
> >
> > Any failure during attaching process will cause inconsistent status
> > between processes, so proper rollback action should be considered.
> > Also it is not safe to detach a share device when other process still
> > use it, so a handshake mechanism is introduced.
> >
> > This patch covers the implementation of case 1,2,5,6,7,8.
> > Case 3,4 will be implemented on separate patch as well as handshake
> > mechanism.
> >
> > Scenario for Case 1, 2:
> >
> > attach device
> > a) primary attach the new device if failed goto h).
> > b) primary send attach sync request to all secondary.
> > c) secondary receive request and attach device and send reply.
> > d) primary check the reply if all success go to i).
> > e) primary send attach rollback sync request to all secondary.
> > f) secondary receive the request and detach device and send reply.
> > g) primary receive the reply and detach device as rollback action.
> > h) attach fail
> > i) attach success
> >
> > detach device
> > a) primary perform pre-detach check, if device is locked, goto i).
> > b) primary send pre-detach sync request to all secondary.
> > c) secondary perform pre-detach check and send reply.
> > d) primary check the reply if any fail goto i).
> > e) primary send detach sync request to all secondary
> > f) secondary detach the device and send reply (assume no fail)
> > g) primary detach the device.
> > h) detach success
> > i) detach failed
> >
> > Case 5, 6:
> > Secondary process can attach private device which only visible to
> > itself, in this case no IPC is involved, primary process is not
> > allowed to have private device so far.
> >
> > Case 7, 8:
> > Secondary process can also temporally to detach a share device "privately"
> > then attach it back later, this action also not impact other processes.
> >
> > APIs changes:
> >
> > rte_eth_dev_attach and rte_eth_dev_attach are extended to support
> > share device attach/detach in primary-secondary process model, it will
> > be called in case 1,2,3,4.
> >
> > New API rte_eth_dev_attach_private and rte_eth_dev_detach_private are
> > introduced to cover case 5,6,7,8, this API can only be invoked in
> > secondary process.
> >
> > Signed-off-by: Qi Zhang <qi.z.zh...@intel.com>
> > ---
> 
> <snip>
> 
> > +static int
> > +handle_primary_request(const struct rte_mp_msg *msg, const void
> > +*peer) {
> > +
> > +   struct rte_mp_msg mp_resp;
> > +   const struct eth_dev_mp_req *req =
> > +           (const struct eth_dev_mp_req *)msg->param;
> > +   struct eth_dev_mp_req *resp =
> > +           (struct eth_dev_mp_req *)mp_resp.param;
> > +   struct mp_reply_bundle *bundle;
> > +   int ret = 0;
> > +
> > +   memset(&mp_resp, 0, sizeof(mp_resp));
> > +   strlcpy(mp_resp.name, ETH_DEV_MP_ACTION_REQUEST,
> sizeof(mp_resp.name));
> > +   mp_resp.len_param = sizeof(*req);
> > +   memcpy(resp, req, sizeof(*resp));
> > +
> > +   bundle = calloc(1, sizeof(*bundle));
> > +   if (bundle == NULL) {
> > +           resp->result = -ENOMEM;
> > +           ret = rte_mp_reply(&mp_resp, peer);
> > +           if (ret) {
> > +                   ethdev_log(ERR, "failed to send reply to primary 
> > request\n");
> > +                   return ret;
> > +           }
> > +   }
> > +
> > +   bundle->msg = *msg;
> > +   bundle->peer = peer;
> > +
> > +   ret = rte_eal_mp_task_add(__handle_primary_request, bundle);
> > +   if (ret) {
> > +           resp->result = ret;
> > +           ret = rte_mp_reply(&mp_resp, peer);
> > +           if (ret) {
> > +                   ethdev_log(ERR, "failed to send reply to primary 
> > request\n");
> > +                   return ret;
> > +           }
> > +   }
> 
> What you're doing here is quite dangerous. The parameter "const void *peer"
> is only guaranteed to be valid at the time of the callback - not necessarily
> afterwards. So, if you're handing off sending replies to a separate thread,
> things might blow up because the pointer may no longer be valid.


OK, so what about clone the content a buffer, I think the content should be 
valid before reply is sent, right?

Thanks
Qi
> 
> --
> Thanks,
> Anatoly

Re: [dpdk-dev] [PATCH v4 06/24] ethdev: enable hotplug on multi-process

Reply via email to