New discussion topic on DataCleaner's online discussion forum (https://datacleaner.org/forum):
wvholland posted the subject 'When you split rows in a job at point A you only want to see them splitted after point A, right?' ------------------- Sorry for the subject, but let's start with an example ... Suppose you use the 'default' customers.csv (as part of the distribution) and analyze that with a value distribution on id. Now you will see that (at least id 1-100) will have unique entries. For the sake of this example, now let's assume ... we add a tokenizer on the streetname, and split the tokens to individual rows (don't ask me why). And we add another value distribution and use the newly tokenized rows. So we end up with 2 value distributions, one pointing to the original data source (not aware of a tokenizer and splitting of rows) and another completely aware of that. Now, what we see is: that both analyzers are impacted by the splitting of the rows. Would it be possible that only after the transformer of splitting the tokens, all components further on that chain will be impacted by that splitting? ------------------- View the topic online to reply - go to https://datacleaner.org/topic/1118/When-you-split-rows-in-a-job-at-point-A-you-only-want-to-see-them-splitted-after-point-A%2C-right%3F -- You received this message because you are subscribed to the Google Groups "DataCleaner-notify" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/datacleaner-notify. For more options, visit https://groups.google.com/d/optout.
