Im looking at creating a distributed steaming pipeline for processing text documents (eg cleaning, NER and machine learning). Documents will generally be under 1mb and processing will be stateless. Was aiming to feed documents from various sources and additional data into Kafka to be streamed to the proccing pipeline in Samza. Would this be an appropriate use case for Samza?
- Samza for text processing Rob Martin
- Re: Samza for text processing Jagadish Venkatraman
- Re: Samza for text processing Rob Martin
- Re: Samza for text processing Jagadish Venkatraman