Hi,

an AE/CasConsumer can have a state and use that aggregate information over all 
the CASes it sees. When the last document in the set produced by the reader is 
reached, the event collectionProcessComplete() is triggered on the AE. This is 
the point where further evaluation on the aggregated information can happen or 
where the results can be persisted somewhere.

Mind that AEs can by default be deployed multiple times, meaning that each only 
sees a part of the data, while CCs per default cannot be deployed multiple 
times, meaning they see each CAS.

-- Richard

On 03.10.2013, at 07:14, ThanhDK <[email protected]> wrote:

> Hi all,
> 
> I am new to UIMA and from what I see, the concept of AE is very
> single-document centric. My question is, from UIMA point of view, what is
> the standard way to write a analysis component of which input is a set of
> documents? For instance, a clustering engine that clusters similar documents
> to the same basket, or an trending topic detector that detect new topics
> from a set of documents.
> 
> I had a look at the CPE  before but it looks to me like just a iterator that
> collect documents one by one, send it through the AEs and collects the output.
> 
> Regards

Reply via email to