Re: Redesign of the Java API

Johan Stuyts Wed, 10 Sep 2008 04:20:35 -0700

I agree with Bryan that we should break these up into separate JIRAissues

ASAP.  I also think that is important to think of some things in a cross-
language manner.  For example, using an options object for constructors
doesn't really require any cross-language consideration, changing the
TProtocol interface requires some because it is best to keep uniformity,
and changing the message protocol requires a lot because it is necessary
to keep interoperability.

Okay, as soon as there is consensus that an issue is worth addressing Iwill create a JIRA issue for it.

I agree that message protocol changes should only be considered if itprovides enough benefit.

I do feel however that uniformity across languages is less important. Ifmaintainers of a language are willing to do a bit of extra work to mapfrom the protocol interface of other languages to their language, Iwouldn't mind having a different protocol interface in that language.People working on multiple languages might object though.

- A different set of functions for TProtocol so protocol implementations
have more flexibility (For example see

https://issues.apache.org/jira/browse/THRIFT-110 (Changes to the IDLwould

still be required to be able to implement a compact format)).

I think this has to be considered in a cross-language way.  The fact that
TProtocol has the same interface in each language means that it is much
easier to code up the same protocol in each language.

I agree that it is easier if the protocol interfaces are the same acrossall languages, but the new interface won't make it difficult to keeplanguages in sync: instead of writing a field with two methods you write afield with one method.

Sure it is possible to do the same thing with a stateful protocol, but astateful protocol is much more complex than a stateless one. Note that ifyou want to verify that the data is always correct the protocol must bestateful, because you would have to check that a stop field is notfollowed by other fields in a structure for example. Neither the currentprotocol interface nor the proposed protocol interface will allow this tobe done in a stateless manner.

- Removal of the name when writing the beginning of the structure as
nobody seems to need this
(https://issues.apache.org/jira/browse/THRIFT-8). There is no need tohang
on to obsolete constructs and confuse new users.
This is currently used in the TDebugProtocol in C++ (which produces
human-readable representations of Thrift structs) and theTSimpleJSONProtocolin Java which is a lossy write-only protocol that produceseasily-consumable
JSON strings for scripting languages.

Okay, I'll document that the name of the structure that is passed to theprotocol is meant for human consumption only, and will close THRIFT-8.

- Remove the Client and Processor classes from generated code ofservices
so server and client implementations have more freedom concerning how
(service and) function selection information is communicated and
processed, and how I/O is handled. See below.
I don't think that these need to be removed. They are the most commonuse
case by far, but it is completely possible to replace them.

It just does not feel right to me where this code is located now. Theprocessing can differ significantly between implementations. Providing onemethod of request processing in the generated code is confusing. Clientand server implementors might hesitate to ignore the code and write theirown request processing, because they feel that would be like workingagainst the framework.

Also if improvements to the request handling have to be made, allgenerated code has to be regenerated. If the code is moved to the clientand server implementations only the library needs to be upgraded and alldeployed code benefits.

To be able to replace Client and Processor other interfaces and classesare needed (i.e. the Service and Function types I described earlier).Again I feel it would be confusing if the generated code contained bothClient and Processor, and Service and Functions.

- Add an interface to the generated code and add support classes fordoing
asynchronous calls (i.e. support for sequence IDs). See below.
This can be done without changing any of the existing interfaces.

From the perspective of a caller of a function it does not look this way.How do I sent a request for which I either:

- poll for the result, or
- get notified by an event.

Each client implementation has to provide its own mechanism now. It wouldbe nice if Thrift provided standard interfaces (for polling, notificationsor both) for handling asynchronous calls.

- Drop the 'T' prefix of types as this is not customary in Java.

As I said when this was suggested for Ruby, I think that the T
prefix makes it easier to import classes without name collisions and
distinguishes specific interfaces like TProtocol from generic names
like "Protocol" or "Binary".

I think that (except for the case Bryan mentioned) collisions will be veryrare.

This is something that we discussed a few months ago when we had ameetingabout creating a more compact protocol. It think the generic term weused
to describe it was "out of bad exchange of IDL information".  I think it
is a great idea, but it has to be implemented cross-language.

Interoperability is my main objective. I will not implement anotherprotocol without consulting other people.

This is not the case. For example, the TNonblockingServer in C++accessessockets directly (via libevent) and hands TMemoryBuffers to theProcessor.
The purpose of the TServerTransport is to make it easier to implement
servers that are less I/O-aware (like the TThreadPoolServer) in a uniform
fashion.

I probably was too bold in my first statement. I think it is desirable tokeep the transports to reuse common methods of communication, but feelthat the transport handling should be pushed more to the background, i.e.an implementation detail instead of a core abstraction.

The extra layer of transports around I/O streams seems redundant to me.

I would be open to dropping the Transport interface in favor of something
built into Java, but I am not able to find anything that supports both
input and output.  For example, it looks like java.net.Socket extends
Object and implements no interfaces.

What about using 'InputStream' and 'OutputStream'? A protocol can be giventhe streams which it will read from and write to directly.

I think it is even possible to split protocol into two interfaces:InputProtocol and OutputProtcol.


--
Kind regards,

Johan Stuyts

Re: Redesign of the Java API

Reply via email to