Stephane,

I think too much will depend on the nature of your data and the flow
gauntlet you run it through.  Out of the box, NiFi can run on a t2.micro,
although a modest flow will quickly exceed that.  A flow doing a high
volume of regular expressions in parallel might benefit from a
compute-optimized instance.  Some flows with simple processing of many
large objects will be bound more by IO than CPU.  And the performance of
the systems NiFi connects with is likely to be a big factor.

Learning which of these problems you will have requires developing and
running the flow for a while.  I recommend a general-purpose instance until
you scale up enough to know which, if any, specialized instance optimized
for compute, memory, or IO would help.  You might also consider the disk
configurations and provisioned IOPS options there.  The great thing about
EC2 is that you can start small and trade up to a bigger instance when you
know more.

Thanks,

James

On Wed, Jun 29, 2016 at 8:51 PM, Stéphane Maarek <[email protected]>
wrote:

> Hi,
>
> I'm wondering which instance on AWS EC2 is best suited for NiFi (let's say
> for a standalone). I'm wondering if it's a compute instance (c4), or
> something else? and why?
>
> Thanks for your help!
> Stephane
>

Reply via email to