Hello, We have use case where we have to load two tables say Customer (here customer ID is a sequence and gets generated while we load data) and purchase_order. While loading purchase_order need to populate customer_id based on SSN present in the purchase_order table. Since there is this dependency, trying to create this in one process group with Step1 - load customer, step2 - load purchase order with dummy customer_id, step 3 - join purchase_order and customer based on ssn and populate customer_id in purchase_order.
While doing so, there are multiple flow files generated for customer table as we are loading this data based on partition. Would like to know, how to trigger next processor only once, when all flow files are processed by previous processor? Looking for help or if there are any better approaches to achieve this? Thanks Sam
