[ https://issues.apache.org/jira/browse/ZOOKEEPER-634?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12792942#action_12792942 ]
Andrew Reynhout commented on ZOOKEEPER-634: ------------------------------------------- Epilogue: I spoke too soon, I guess I had been staring at the data for too long. Some parts of the Stat struct were getting repointed or overwritten by the time the values were shuttled into the Ruby glue. Aside from the variation between 32b and 64b, the variations were repeatable (always the same changes, always the same wrong data), so I thought I saw something that wasn't there. Properly duping the struct before passing to the Ruby thread resolved it. > some elements of struct Stat have incorrect values when returned > asynchronously > ------------------------------------------------------------------------------- > > Key: ZOOKEEPER-634 > URL: https://issues.apache.org/jira/browse/ZOOKEEPER-634 > Project: Zookeeper > Issue Type: Bug > Components: c client > Affects Versions: 3.2.1 > Environment: OSX 10.5, 10.6 (32b and 64b, respectively) > Reporter: Andrew Reynhout > Priority: Minor > > I'm working on Ruby bindings for ZK (3.2.2) via the C API. This involves > some painful kludgery due to MRI Ruby's threading model, but I believe this > bug report is valid: > Some of the elements of Stat structs, when returned via the C client, have > values inconsistent with the synchronous call, zkCli output, and > expectations. E.g.: > {noformat:title=zkCli command line} > $ zk/bin/zkCli.sh -server localhost:2182 set /lala lala234 > Connecting to localhost:2182 > WATCHER:: > WatchedEvent: Server state change. New state: SyncConnected > cZxid = 3 > ctime = Sat Dec 19 14:32:03 EST 2009 > mZxid = 69 > mtime = Sat Dec 19 15:31:46 EST 2009 > pZxid = 3 > cversion = 0 > dataVersion = 4 > aclVersion = 0 > ephemeralOwner = 0 > dataLength = 7 > numChildren = 0 > {noformat} > {noformat:title=Ruby script + verbose ZK log output} > 2009-12-19 15:31:48,952:45753(0x751000):zoo_de...@process_completions@1902: > Switching on cptr->completion_type (async) > 2009-12-19 15:31:48,952:45753(0x751000):zoo_de...@process_completions@1916: > Calling COMPLETION_STAT for xid=4b2d3837 rc=0 > 2009-12-19 15:31:48,952:45753(0x751000):zoo_de...@process_completions@1949: > --- PRE ruby callback. > ----------------------- > &res.stat 0x00750e30 > .czxid 3 > .mzxid 69 > .ctime 1261251123149 > .mtime 1261254706180 > .version 4 > .cversion 0 > .aversion 0 > .ephemeralOwner 0 > .dataLength 7 > .numChildren 0 > .pzxid 3 > ======================= > 2009-12-19 > 15:31:48,952:45753(0x751000):zoo_de...@ruby_stat_completion_wrapper@1702: --- > creating ruby thread. > 2009-12-19 > 15:31:48,952:45753(0x7ee000):zoo_de...@ruby_stat_completion_wrapper_2@1685: > --- invoking ruby callback. > ----------------------- > cb->dc 0x002fe000 > cb->rc 0 > cb->ctx 0x01a03870 > cb->stat 0x00750e30 > ->czxid 3 > ->mzxid 69 > ->ctime 1261251123149 > ->mtime 140735387442616 > ->version 0 > ->cversion 0 > ->aversion 27264392 > ->ephemeralOwner 4302638816 > ->dataLength -2100901903 > ->numChildren 32767 > ->pzxid 3 > ======================= > 2009-12-19 > 15:31:48,952:45753(0x7ee000):zoo_de...@ruby_stat_completion_wrapper_2@1687: > --- callback returned, freeing struct. > 2009-12-19 > 15:31:48,952:45753(0x7ee000):zoo_de...@ruby_stat_completion_wrapper_2@1689: > --- struct freed. > sync returned #<ZooKeeperFFI::Stat:0x000001013783d0> > async callback.return_code 0 > attr ?? sync async > ---- -- ---- ----- > czxid OK 3 3 > mzxid OK 69 69 > ctime OK 1261251123149 1261251123149 > mtime -- 1261254706180 140735387442616 > version -- 4 0 > cversion OK 0 0 > aversion -- 0 27264392 > ephemeralOwner -- 0 4302638816 > dataLength -- 7 -2100901903 > numChildren -- 0 32767 > pzxid OK 3 3 > 2009-12-19 15:31:49,052:45753(0x6ce000):zoo_de...@do_io@316: IO thread > terminated > 2009-12-19 15:31:49,052:45753(0x751000):zoo_de...@do_completion@335: > completion thread terminated > 2009-12-19 15:31:49,052:45753(0x705f3be0):zoo_i...@zookeeper_close@2407: > Closing zookeeper session 125a86bdc640020 to [127.0.0.1:2182] > {noformat} > The last set of lines, above, are output from my Ruby code, comparing the > differences between sync and async calls. I added the LOG_DEBUG in > zookeeper.c to dump the contents of res.stat before it gets shipped off to > Ruby...as you can see, the LOG_DEBUG values match the values in the async > Ruby call, but they're wrong. > I've currently only tested on OSX 10.5 (32bit) and 10.6 (64bit), but > sometimes different elements are wrong on each. I can get a comprehensive > list if it's useful, but as examples: > {noformat:title=data variation examples} > ## aexists > # ctime (int64_t): > # 32b OK > # 64b OK > # dataLength (int32_t): > # 32b OK > # 64b NG (== -2100901903 instead of 10) > ## aget > # ctime (int64_t): > # 32b OK > # 64b NG (== 0 instead of something unixtimey e.g. 1261251123149) > # dataLength (int32_t): > # 32b NG (== -1856148083 instead of 10) > # 64b OK > ## aset > # ctime (int64_t): > # 32b OK > # 64b OK > # dataLength (int32_t): > # 32b OK > # 64b NG (== -2100901903 instead of 10) > {noformat} > Lastly, a snippet of my changes to zookeeper.c, hopefully showing that I > haven't done any harm to the data. Note that I'm also seeing similar data > variations for COMPLETION_DATA, which also returns a Stat struct. I haven't > finished the Ruby code for COMPLETION_ACLLIST (again, returns a Stat), so the > same issue might arise there as well. > I realize that hacking the ZK C API is not a great way to build a foreign > binding. :( > {code:title=src/c/src/zookeeper.c} > case COMPLETION_STAT: > LOG_DEBUG(("Calling COMPLETION_STAT for xid=%x > rc=%d",cptr->xid,rc)); > if (rc) { > //cptr->c.stat_result(rc, 0, cptr->data); > ruby_stat_completion_wrapper(cptr->c.stat_result, rc, 0, > (void *)cptr->data); > } else { > struct SetDataResponse res; > deserialize_SetDataResponse(ia, "reply", &res); > LOG_DEBUG(("--- PRE ruby callback.\n\ > -----------------------\n\ > &res.stat 0x%08x\n\ > .czxid %ld\n\ > .mzxid %ld\n\ > .ctime %ld\n\ > .mtime %ld\n\ > .version %d\n\ > .cversion %d\n\ > .aversion %d\n\ > .ephemeralOwner %ld\n\ > .dataLength %d\n\ > .numChildren %d\n\ > .pzxid %ld\n\ > =======================", > &res.stat, > res.stat.czxid, res.stat.mzxid, res.stat.ctime, res.stat.mtime, > res.stat.version, res.stat.cversion, res.stat.aversion, > res.stat.ephemeralOwner, res.stat.dataLength, res.stat.numChildren, > res.stat.pzxid )); > //cptr->c.stat_result(rc, &res.stat, cptr->data); > ruby_stat_completion_wrapper(cptr->c.stat_result, rc, > &res.stat, (void *)cptr->data); > deallocate_SetDataResponse(&res); > } > break; > {code} -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.