Depends, what you are doing with the data? Also what are the value types. Are you interested in all of them for analysis?
One thing to note is that if there are unique IDs in your data, they may need to be converted into ordinal Ints for Mahout. So you need to map your ID to ordinal Ints for input then when you get the data out you may need to do the reverse map to get your IDs back. On May 22, 2014, at 10:47 PM, Chhaya Vishwakarma <[email protected]> wrote: Hi, I have a CSV file with following columns name.age,salary,experience When I convert it to a sequence file what exactly happens to the data ? How does sequence file will look like? And onc sequence file is converted to vectors how does it look like I want to understand what happens when we create sequence and vectors from input data Regards, Chhaya Vishwakarma ________________________________ The contents of this e-mail and any attachment(s) may contain confidential or privileged information for the intended recipient(s). Unintended recipients are prohibited from taking action on the basis of information in this e-mail and using or disseminating the information, and must notify the sender and delete it from their system. L&T Infotech will not accept responsibility or liability for the accuracy or completeness of, or the presence of any virus or disabling code in this e-mail"
