RPC versioning

Doug Cutting Fri, 03 Oct 2008 09:37:44 -0700

It has been proposed in the discussions defining Hadoop 1.0 that weextend our back-compatibility policy.


http://wiki.apache.org/hadoop/Release1.0Requirements

Currently we only attempt to promise that application code will runwithout change against compatible versions of Hadoop. If one hasclusters running different yet compatible versions, then one must use adifferent classpath for each cluster to pick up the appropriate versionof Hadoop's client libraries.

The proposal is that we extend this, so that a client library from oneversion of Hadoop will operate correctly with other compatible Hadoopversions, i.e., one need not alter one's classpath to contain theidentical version, only a compatible version.

Question 1: Do we need to solve this problem soon, for release 1.0,i.e., in order to provide a release whose compatibility lifetime is ~1year, instead of the ~4months of 0. releases? This is not clear to me.Can someone provide cases where using the same classpath when talkingto multiple clusters is critical?

Assuming it is, to implement this requires RPC-level support forversioning. We could add this by switching to an RPC mechanism withbuilt-in, automatic versioning support, like Thrift, Etch or ProtocolBuffers. But none of these is a drop-in replacement for Hadoop RPC.They will probably not initially meet our performance and scalabilityrequirements. Their adoption will also require considerable anddestabilizing changes to Hadoop. Finally, it is not today clear whichof these would be the best candidate. If we move too soon, we mightregret our choice and wish to move again later.

So, if we answer yes to (1) above, wishing to provide RPCback-compatibility in 1.0, but do not want to hold up a 1.0 release, isthere an alternative to switching? Can we provide incrementalversioning support to Hadoop's existing RPC mechanism that will sufficeuntil a clear replacement is available?


Below I suggest a simple versioning style that Hadoop might use to
permit its RPC protocols to evolve compatibly until an RPC system with
built-in versioning support is selected.  This is not intended to be a
long-term solution, but rather something that would permit us to more
flexibly evolve Hadoop's protocols over the next year or so.

This style assumes a globally increasing Hadoop version number.  For
example, this might be the subversion repository version of trunk when
a change is first introduced.

When an RPC client and server handshake, they exchange version
numbers.  The lower of their two version numbers is selected as the
version for the connection.

Let's walk through an example.  We start with a class that contains
no versioning information and a single field, 'a':

  public class Foo implements Writable {
    int a;

    public void write(DataOutput out) throws IOException {
      out.writeInt(a);
    }

    public void readFields(DataInput in) throws IOException {
      a = in.readInt();
    }
  }

Now, in version 1, we add a second field, 'b' to this:

  public class Foo implements Writable {
    int a;
    float b;                                        // new field

    public void write(DataOutput out) throws IOException {
      int version = RPC.getVersion(out);
      out.writeInt(a);
      if (version >= 1) {                           // peer supports b
        out.writeFloat(b);                          // send it
      }
    }

    public void readFields(DataInput in) throws IOException {
      int version = RPC.getVersion(in);
      a = in.readInt();
      if (version >= 1) {                           // if supports b
        b = in.readFloat();                         // read it
      }
    }
  }

Next, in version 2, we remove the first field, 'a':

  public class Foo implements Writable {
    float b;

    public void write(DataOutput out) throws IOException {
      int version = RPC.getVersion(out);
      if (version < 2) {                            // peer wants a
        out.writeInt(0);                            // send it
      }
      if (version >= 1) {
        out.writeFloat(b);
      }
    }

    public void readFields(DataInput in) throws IOException {
      int version = RPC.getVersion(in);
      if (version < 2) {                            // peer writes a
        in.readInt();                               // ignore it
      }
      if (version >= 1) {
        b = in.readFloat();
      }
    }
  }

Could something like this work? It would require just some minorchanges to Hadoop's RPC mechanism, to support the version handshake.Beyond that, it could be implemented incrementally as RPC protocolsevolve. It would require some vigilance, to make sure that versioninglogic is added when classes change, but adding automated tests againstprior versions would identify lapses here.

This may appear to add a lot of version-related logic, but withautomatic versioning, in many cases, some version-related logic is stillrequired. In simple cases, one adds a completely new field with adefault value and is done, with automatic versioning handling much ofthe work. But in many other cases an existing field is changed and theapplication must translate old values to new, and vice versa. Thesecases still require application logic, even with automatic versioning.So automatic versioning is certainly less intrusive, but not as much asone might first assume.

The fundamental question is how soon we need to address inter-versionRPC compatibility. If we wish to do it soon, I think we'd be wise toconsider a solution that's less invasive and that does not force us intoa potentially premature decision.


Doug

RPC versioning

Reply via email to