Hi, For my project i am trying find item sets. I have been givena data file with data in the following form
lines with starts with letter A, those tells us the ID of the website and the Title The line with ”C” separates users and the lines with ”V” are the websites they visited. A,1121,1,"Microsoft Magazine","/magazine" A,1083,1,"MS Access Support","/msaccesssupport" A,1145,1,"Visual Fox Pro Support","/vfoxprosupport" A,1276,1,"Visual Test Support","/vtestsupport" C,"10001",10001 V,1000,1 V,1001,1 V,1002,1 C,"10002",10002 V,1001,1 V,1003,1 C,"10003",10003 V,1001,1 V,1003,1 V,1004,1 I am supposed to classify and filter the data based on theuser and the websites he visited. But the entire data is in one file and myprogram reads the file line by line. I am not able to figure out how to processthe user and then the websites he visited to bucket them into a list in whicheach line consists of websites visited by that user alone. could someone help me with generating the output file which must have a list of buckets, each bucket with the sites visited by one user. For example "Support Desktop","Knowledge Base" "Support Desktop","Knowledge Base","Microsoft.com Search" "Norway" "misc" "Knowledge Base","Microsoft.com Search" "International IE content" "Microsoft.com Search"
project.py
Description: Binary data
_______________________________________________ Chennaipy mailing list Chennaipy@python.org https://mail.python.org/mailman/listinfo/chennaipy