[RAT] Pipelines...

Robert Burrell Donkin Mon, 05 Aug 2013 07:12:26 -0700

Essentially, Rat is simple.

A source (perhaps a file system or a compressed archive) is walked,producing documents. Each document (perhaps a file in a file system, ora resources in an archive) flows through a pipeline - a series ofprocessing steps, enriching with various meta-data. An end pointcollates the data.


It seems to me that the current code fails to express this

...

At the moment, IDocumentAnalyser[1] is implemented by most steps in thepipeline (and other stuff too), wired together in a potentially flexiblefashion. This now seems over-engineered to me.

I think a concrete Pipeline would be more obvious, with controlledextension points at each step of the processing.


Opinions...?
Objections...?

Robert

[1]http://svn.apache.org/viewvc/creadur/rat/trunk/apache-rat-core/src/main/java/org/apache/rat/document/IDocumentAnalyser.java?view=markup

[RAT] Pipelines...

Reply via email to