Suppose, we have 3 people called: Francis, Cedric and Nina. Base on what
they have eaten, we want to cluster people by "diet", "non-diet".

# original data file, named as filename "food.csv".
Francis|potato
Francis|chocolate
Francis|chocolate
Francis|milk
Cedric|vegetable
Cedric|vegetable
Cedric|potato
Nina|potato
Nina|chocolate
Nina|chocolate
Nina|potato

# Step 1: I plan to reform the food.csv file into following frame:
    Francis    Cedric    Nina
potato    1    1    2
chocolate    2    0    2
milk    1    0    0    0
fish    0    1    0    0
vegetable    0    2    0

But, I have problem to reform the original file into that way I want.
Basically, I have only tried:
> org<-read.table("food.csv",col.name=c("name","food_list"), sep="|") #
import file to R
> x<-data.frame(Francis=rep(0,times=5))    # set up the frame of file
> rownames(x)<-org$food_list
here I failed, and don't know how to continue.
Please help me out with the proper R command.

# Step 2: I want to draw 3 overlapped barplot with x-axis(5 food name list),
y-axis(the occurence value of each food by each person).

# Step 3: By comparing the barplots, I should be able to draw the
conclusion: Francis and Nina are in the group of "non-diet", and Cedric is
in the group of "diet".

Well, that's my simple strategy of "clustering", if any of you have better
suggestion, I really appreciate it.

Thanks!

        [[alternative HTML version deleted]]

______________________________________________
R-help@stat.math.ethz.ch mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to