Hi, an AE/CasConsumer can have a state and use that aggregate information over all the CASes it sees. When the last document in the set produced by the reader is reached, the event collectionProcessComplete() is triggered on the AE. This is the point where further evaluation on the aggregated information can happen or where the results can be persisted somewhere.
Mind that AEs can by default be deployed multiple times, meaning that each only sees a part of the data, while CCs per default cannot be deployed multiple times, meaning they see each CAS. -- Richard On 03.10.2013, at 07:14, ThanhDK <[email protected]> wrote: > Hi all, > > I am new to UIMA and from what I see, the concept of AE is very > single-document centric. My question is, from UIMA point of view, what is > the standard way to write a analysis component of which input is a set of > documents? For instance, a clustering engine that clusters similar documents > to the same basket, or an trending topic detector that detect new topics > from a set of documents. > > I had a look at the CPE before but it looks to me like just a iterator that > collect documents one by one, send it through the AEs and collects the output. > > Regards
