New discussion topic on DataCleaner's online discussion forum 
(https://datacleaner.org/forum):

wvholland posted the subject 'When you split rows in a job at point A you only 
want to see them splitted after point A, right?'

-------------------

Sorry for the subject, but let's start with an example ...
Suppose you use the 'default' customers.csv (as part of the distribution) and 
analyze that with a value distribution on id. Now you will see that (at least 
id 1-100) will have unique entries. 

For the sake of this example, now let's assume ...
we add a tokenizer on the streetname, and split the tokens to individual rows 
(don't ask me why). And we add another value distribution and use the newly 
tokenized rows.

So we end up with 2 value distributions, one pointing to the original data 
source (not aware of a tokenizer and splitting of rows) and another completely 
aware of that.

Now, what we see is: that both analyzers are impacted by the splitting of the 
rows. Would it be possible that only after the transformer of splitting the 
tokens, all components further on that chain will be impacted by that splitting?


-------------------

View the topic online to reply - go to 
https://datacleaner.org/topic/1118/When-you-split-rows-in-a-job-at-point-A-you-only-want-to-see-them-splitted-after-point-A%2C-right%3F

-- 
You received this message because you are subscribed to the Google Groups 
"DataCleaner-notify" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/datacleaner-notify.
For more options, visit https://groups.google.com/d/optout.

Reply via email to