Depends, what you are doing with the data? Also what are the value types. Are 
you interested in all of them for analysis?

One thing to note is that if there are unique IDs in your data, they may need 
to be converted into ordinal Ints for Mahout. So you need to map your ID to 
ordinal Ints for input then when you get the data out you may need to do the 
reverse map to get your IDs back.

On May 22, 2014, at 10:47 PM, Chhaya Vishwakarma 
<[email protected]> wrote:


Hi,

I have a CSV file with following columns name.age,salary,experience

When I convert it to a sequence file what exactly happens to the data ?
How does sequence file will look like?

And onc sequence file is converted to vectors how does it look like
I want to understand what happens when we create sequence and vectors from 
input data

Regards,
Chhaya Vishwakarma


________________________________
The contents of this e-mail and any attachment(s) may contain confidential or 
privileged information for the intended recipient(s). Unintended recipients are 
prohibited from taking action on the basis of information in this e-mail and 
using or disseminating the information, and must notify the sender and delete 
it from their system. L&T Infotech will not accept responsibility or liability 
for the accuracy or completeness of, or the presence of any virus or disabling 
code in this e-mail"

Reply via email to