New reply on DataCleaner's online discussion forum (https://datacleaner.org/forum):
Kasper Sørensen replied to subject 'When you split rows in a job at point A you only want to see them splitted after point A, right?' ------------------- You've hit a limit here in what DC's engine currently supports. Changing the cardinality of rows (like the Tokenizer can do) is not that common, and has the side-effect that it is applied to all rows after the operation is done. Unfortunately we don't have a really good workaround I think. Maybe the Grouper component can be used inbetween to correlate those records that shouldn't be split in the first place - but that's a really dirty workaround IMO :-/ The cause of this is that the engine is implemented in a way where it simply converts the graph into a list of components, each with a set of conditions on them. This worked really well until we started introducing components like the Tokenizer that can change the cardinality of rows. Only a few components can do this, and new components being built that need it are generally creating an output data stream to avoid the issue. But the issue should probably be fixed in the engine anyway. ------------------- View the topic online to reply - go to https://datacleaner.org/topic/1118/When-you-split-rows-in-a-job-at-point-A-you-only-want-to-see-them-splitted-after-point-A%2C-right%3F -- You received this message because you are subscribed to the Google Groups "DataCleaner-notify" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/datacleaner-notify. For more options, visit https://groups.google.com/d/optout.
