I will check,

thank you!

*Monica Franceschini*
Solution Architecture Manager

*Big Data Competence Center
Engineering Group*
Corso Stati Uniti 23/C, 35127 Padova, Italia
Tel: +39 049.8283547
Fax: +39 049.8692566
Twitter: @twittmonique
www.spagobi.org <http://www.spagobi.org> - www.eng.it <http://www.eng.it/web/eng_en/home> *proud SpagoBI supporter and contributor*
SpagoBI


Respect the environment. Please don't print this e-mail unless you really need to.

The information transmitted is intended only for the person or entity to which it is addressed and may contain confidential and/or privileged material. Any review, retransmission, dissemination or other use of, or taking of any action in reliance upon, this information by persons or entities other than the intended recipient is prohibited. If you received this in error, please contact the sender and delete the material from any computer.

Il 27/10/2016 15:55, Bryan Bende ha scritto:
In the case of a GetFile processor it is managed by a single node since the file being picked up is on the local filesystem of one of the nodes.

There are other approaches to parallelize work... If you had a shared network location you can use ListFile + FetchFile in a certain way so that one node does the listing, and then all nodes do fetching. The same can be done for ListHDFS + FetchHDFS, and some other List + Fetch processors.

This post talks about some of this: https://community.hortonworks.com/articles/16120/how-do-i-distribute-data-across-a-nifi-cluster.html

-Bryan


On Thu, Oct 27, 2016 at 9:40 AM, Monica Franceschini <[email protected] <mailto:[email protected]>> wrote:

    Thank you Bryan,

    yes that's what I meant and it makes sense to me. Only a further
    question: is this stream parallelized if needed on the
    (hypothetical) Nifi cluster  or it is managed by a single node?

    Cheers

    Monica




Reply via email to