Hi Tommaso, Thanks for the reply!
> Apart from the specific algorithms used for clustering / collapsing, which > would define the UIMA pipeline implementations/configurations, you could > use SolrCas [3] to finally write data in the index. Are there algorithm implementations built into the UIMA project that could be used as-is or tweaked, or would we need to create these from scratch? We have quite a large investment in the Hadoop ecosystem. Is it a common use-case to parallelise UIMA jobs using map/reduce?
