Re: Full specs for ruby libraries

Kevin Ballard Wed, 04 Jun 2008 12:14:28 -0700

On Jun 4, 2008, at 11:22 AM, Kevin Clark wrote:

On Mon, Jun 2, 2008 at 6:03 PM, Kevin Ballard <[EMAIL PROTECTED]>wrote:

Any comments/suggestions?


First off, great work. This is a major improvement to the libraries.
Here's my notes from reviewing the commits (through 713371df7f):


Thank you. I've responded below.

For those playing along, you can view these by going to
http://github.com/kballard/thrift/commits/COMMIT

7a1b95c8d886
returns (lines 50, 31):

The first item in the array (field name) isn't actually used/supplied currently

May want to do the same in the tests for consistency.
* I'm not really sure why it's returned at all. Legacy?

I put them in the tests because I figure as long as the code ignoresthem, everything is fine, but if someone fixes the libraries to startcaring, the tests shouldn't break. I assume it's in there (but this isjust a guess) because it makes inspecting the network traffic by handa lot easier to debug, but since the binary protocol doesn't send thenames (and that's the only concrete protocol in the ruby libraries),it hardly matters.

c7101e6334f3b
Consequences of including Thrift in the example group?

AFAIK there should be none. You'll notice all of my specs arestructured this way, creating a concrete example group and includingThrift, simply to make the testing code simpler (as it doesn't have toconstantly namespace all the thrift classes). Note that in665d95c51fe6 I ended up changing the names of all these example groupsto avoid spec leakage when running multiple specs at once, but thesame idea is there.

6cda1b66c24
On testing the remapped methods: Could we mock to expect the old
method called by the new?
Alternatively, if they're the same method (I don't think they are),
it's possible method()
could be used for equality.

No we can't. The way deprecate! works, it actually fetches the newmethod directly with instance_method, binds it to the current object,and calls it. I suppose I could test to ensure instance_method() iscalled with the new symbol. I'll look into doing that now.

665d95c51f
What's going on here?

When I started using the concrete ExampleGroup classes, I didn't run`rake spec` until I'd written several of them (instead I ran theindividual specs separately). Running all the specs at once revealedthat having each ExampleGroup named ThriftSpec was, in fact, re-opening the same class over and over (which makes sense), re-includingThrift over and over, and actually leaking some information betweenspecs. I forget exactly what actually leaked, but `rake spec` wasbroken before that change, and worked fine after that change.

1d897acba89
Range errors on Bignum are ok but on integer aren't?

There's a few reasons I made this change. The biggest one is the codehere assumes it's writing signed integers (which makes sense, all thelibraries assume they're dealing with signed integers and returnsigned from read_foo). The range-checking code there was enforcing thesigned-ness of the integers (although, looking back, I allowedunsigned bytes in write_byte, which differs from write_i16 andwrite_i32). This actually broke the binary protocol because it writesout an i32 with the high bit set when it writes its version(0x80010000), then coerces it back to unsigned when reading (read_i32| VERSION_MASK converts the return value of read_i32 to the equivalentunsigned value with the same bitfield). My two options were to eitherincrease the range bounds of i16/i32 to allow writing unsignedintegers, or remove the range checking. I opted to remove the rangechecking primarily because, looking at the Java and CPP libraries,there's no explicit range checking there, and I figured with thatprecedent I'd rather not slow down the binary protocol by adding extratests to every single integer write. I mean, if the user writes a longas an i16, they shouldn't be the least surprised to find it getsclipped.

But there was nothing I could do about the Bignum range error(Array#pack raises that), so I simply encoded that behavior in thetests.

If there is a real desire to re-introduce range checking I cancertainly do that, with the ranges expanded to include unsignedintegers. Another option is to introduce read/write_ui* functions, butthat would make the ruby libs different than all other libraries.

800706783f1
I think this might have been correct. We don't want the server bailing
out if something goes wrong.
We should discuss.


The problem was it wasn't catching exceptions. The construct

  rescue Exception => e

is clearly designed to catch all exceptions, except it wasn't. I don'tknow what quirk of Ruby was preventing that from working, but intesting it would explicitly catch Exception objects but not e.g.StandardError objects. Using


  rescue => e

seems to catch them all. Granted, I didn't test, this could becatching all subclasses of StandardError rather than Exception, butthe only reason a non-StandardError exception should be thrown is ifthe processor or handler triggers a syntax or load error, and I'mperfectly happy with the server blowing up on that.

210803a358
I think switching sets to use Set breaks backwards compatability.
There's also a bug we need to be aware of that allows arrays to be
passed as sets (because .each doesn't care, and the value of the hash)
isn't used.

I think breaking backwards compat might be ok here, but we need todiscuss

on the list.

This actually should continue to work with pre-generated code, theonly observable difference in clients is when they call a method thatreturns a Thrift::SET, it'll return a Set instead of a hash. But Idon't think that's too big of a change to ask clients to accommodate.I will note that passing a Set to a method, and having a hash-styleset as the default value of a generated struct should actuallycontinue to work (384064468d). Also, there's a decent chance thatclient code that handles the return value will still work anyway, forexample


  client.returnSet("foo").each { |k,v| p k }

That will print the exact same thing regardless of whether a Set or aHash is returned. I realize we can't assume all code (or even mostcode) handles Set return values this way, but my point is there's verylittle to change. Unfortunately there's no way to deprecate the oldstyle, because the method doesn't have any way of knowing what it'sexpected to return.

384064468d
Ah, you got it (the set thing). That might be ok then. May want to do
further testing.

See above. If you have suggestions for any more tests you'd like todo, or any genius ideas about preserving backwards compatibility, I'mall ears. I just felt strongly enough that using {"a" => true, "b" =>true} as a set was too ugly.

c83c437d7b38
I'm trying to remember if get_buffer is for the fast binaryprotocol. I think
at this point it only uses borrow and consume. I'll double check.

If the fast binary protocol uses it, it needs to be changed.#get_buffer was preventing a pretty serious memory optimization(42f75ed74ad). Basically, the MemoryBuffer was holding on to everypiece of data written to it, even though the only way to retrieve dataonce it had been read back out was with #get_buffer (which nobody inthe ruby libraries used). This meant that if a single MemoryBuffer wasused to transport 2MB of data, until that MemoryBuffer was finalizedthose 2MB would have effectively leaked. Without #get_buffer theMemoryBuffer could toss any data the moment it was read.

5f29875a24
This is a good change. I think we should prefer testing with literals
over method calls.


That was my thought.

e27daba8897
Wait what? A hash as the second arg to raise?

Yep. Thrift::Test::Xception is a generated struct from theThriftTest.thrift file. It inherits from StandardError, but it alsoincludes Thrift::Struct. The problem here is that while StandardErrorwants a message as its only arg, Thrift::Struct has other ideas. It re-defines the initializer to take a hash that's used to populate itsfields. When you say


  raise Thrift::Test::Xception, 'error'

What you're really saying is

  raise Thrift::Test::Xception.new('error')

This didn't used to break because Thrift::Struct used the #[] methodexclusively on its argument. What this meant, though, is that thisraise was effectively doing the same thing as


  raise Thrift::Test::Exception.new

This bug cropped up when I made a seemingly-innocuous change to theThrift::Struct initializer, in which I added the line

d.fetch(name.to_s) # d here is the variable used for the hashargument

This suddenly caused the specs to blow up on that line inraiseException(). The only explanation was d.fetch() was raising anexception, which it was, because it was being given 'error' instead ofa Hash.

In other words, this code was always broken, it was just staying quietuntil I played with it. The new form of the raise is equivalent to


  raise Thrift::Test::Xception.new(:message => 'error')

and that behaves exactly as desired.

-Kevin Ballard

--
Kevin Ballard
[EMAIL PROTECTED]

Re: Full specs for ruby libraries

Reply via email to