Stephane, I think too much will depend on the nature of your data and the flow gauntlet you run it through. Out of the box, NiFi can run on a t2.micro, although a modest flow will quickly exceed that. A flow doing a high volume of regular expressions in parallel might benefit from a compute-optimized instance. Some flows with simple processing of many large objects will be bound more by IO than CPU. And the performance of the systems NiFi connects with is likely to be a big factor.
Learning which of these problems you will have requires developing and running the flow for a while. I recommend a general-purpose instance until you scale up enough to know which, if any, specialized instance optimized for compute, memory, or IO would help. You might also consider the disk configurations and provisioned IOPS options there. The great thing about EC2 is that you can start small and trade up to a bigger instance when you know more. Thanks, James On Wed, Jun 29, 2016 at 8:51 PM, Stéphane Maarek <[email protected]> wrote: > Hi, > > I'm wondering which instance on AWS EC2 is best suited for NiFi (let's say > for a standalone). I'm wondering if it's a compute instance (c4), or > something else? and why? > > Thanks for your help! > Stephane >
