Concern Creep on the Processor interface

Berin Loritsch Tue, 11 Oct 2005 07:19:49 -0700

The Processor interface used to be very simple, and reasonablydocumented. Over time it has adopted new methods as part of itscontract, and those have not been well documented. The only reason thatI am bringing this up is that I am trying to implement my own Processor,and there is a lot that the interface requires that is of little or noconcern to me. First lets see what it used to be 2 years and 7 months ago:


interface Processor
{
   boolean process(Environment env) throws Exception;


   // the remaining methods were introduced in 2.1
   ProcessingPipeline processInternal(Environment env) throws Exception;
   Configuration getComponentConfigurations();
}

Already we see we added some scope creep from the 2.0 to the 2.1 series(the last I worked on Cocoon was the 2.0 series). For example, why isit necessary for a Processor contract to expose the componentconfigurations? The "processInternal" method is a coin toss.Presumably it is to enable cocoon:// or sitemap:// psuedo-protocols tobe more consistent--allowing a parent processor to callprocessInternal() on child processors. Nevertheless, one would wonderwhy the original process() method wasn't changed to return aProcessingPipeline instead of a boolean in this case.

At this point I also want to point out that the original process()method has decent JavaDocs so that you can understand its purpose andwhy it exists, the remaining methods are not that way.

A month later the getComponentConfigurations() method was refactored toreturn a Map--presumably of component Configuration objects, but thereis still no documentation on what the expected keys are.

Three months later processInternal was changed to buildPipline (samearguments and return value)--a better picture but still nothing in theJavaDocs to help understand the method purpose.

Two months later we add the Processor getRootProcessor() method tosupport internal redirects. Now this is one thing that makes Processorsmuch more difficult to implement. Why can't such a thing be handled bya ProcessorHelper or something. The root processor problem isorthagonal to the responsibilities of just one processor.

16 months, 2 weeks ago we had the biggest change to the wholeinterface. We have an interface with an internal class?! TheInternalPipelineDescription has a reason for existing, I'm sure.However I do have to wonder why it is part of the interface. At thispoint we are specifying implementation details in the interface. Thecontract of the Processor is no longer an active component (i.e. I tellyou how to do something), but a passive one (i.e. I ask you how to dostuff for myself). The buildPipeline() method is now altered to use theInternalPipelineDescription instead of return a ProcessingPipeline. Atthe same time we add the getContext() and getSourceResolver() methods.My head is now realing. This is pure insanity. Why not just get rid ofthe interface and simply use a base class? After all we are no longerdocumenting a contract, we are documenting how to implement theProcessor. My guess is that limitations in the TreeProcessor approachcaused this to be necessary. But again, couldn't most of these thingshave been handled by an external helper or utility class? Does itreally need to affect the interface?

11 months, 3 weeks ago we refactored the getComponentConfigurations()again so that we now have just an array of configurations. Not a biggy,but I'm still not convinced it is needed here.

3 months ago we have the last change to the Processor interface, and Iam convinced this should have been a TreeProcessor interface thatextends the core Processor interface. We added methods for setting,getting, and removing attributes for the sitemap interpreters.

The bottom line is that we have exploded the complexity of what wasoriginally intended to be a light-weight interface. The only solutionfor the processor is a complex solution. The only implementation for aprocessor is the tree processor. We've made sure that the interfacerequires it to be that way. I've got much simpler needs, and there is awhole host of issues with implementing all these methods that donothing. I'd like to see if we can't separate all the differentconcerns in the Processor interface into multiple interfaces. What isthe core concerns?

I'm in the process of identifying the real contracts, and I'll haveanother post about that.

Concern Creep on the Processor interface

Reply via email to