[R] Importing data into R and combining 2 files

2009-05-14 Thread Sunita22

Hello

I have to import 2 txt files into R. 1 file contains the data and the other
contains the header, column headings, datatypes and labels for the data.

I have 2 problems:

1) my data file has mixed type of data e.g. 1 2 3 4 5 3-5 02/04/06 3 4 5 and
so on, the data file is tab separated. when I import it, the data is getting
stored in one single variable say V1. I need to separate it into rows and
columns. how do I this? Which commands in R would be useful for the same?

2) The other file is also tab separated. the 6 lines contains header and
introduction as in the name of the dataset, year, etc. and then column names
its datatypes and labels. After importing the data in this file also gets
stored in one single variable. I need to separate it into rows and columns.
how do I this? Which commands in R would be useful for the same?

Thank you in advance 

Regards
Sunita
-- 
View this message in context: 
http://www.nabble.com/Importing-data-into-R-and-combining-2-files-tp23545291p23545291.html
Sent from the R help mailing list archive at Nabble.com.

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


Re: [R] Importing data into R and combining 2 files

2009-05-14 Thread jim holtman
What have you tried?  Check the Intro manual for hints.

?read.table   probably using sep='\t'

On Thu, May 14, 2009 at 1:30 PM, Sunita22 sunita...@gmail.com wrote:


 Hello

 I have to import 2 txt files into R. 1 file contains the data and the other
 contains the header, column headings, datatypes and labels for the data.

 I have 2 problems:

 1) my data file has mixed type of data e.g. 1 2 3 4 5 3-5 02/04/06 3 4 5
 and
 so on, the data file is tab separated. when I import it, the data is
 getting
 stored in one single variable say V1. I need to separate it into rows and
 columns. how do I this? Which commands in R would be useful for the same?

 2) The other file is also tab separated. the 6 lines contains header and
 introduction as in the name of the dataset, year, etc. and then column
 names
 its datatypes and labels. After importing the data in this file also gets
 stored in one single variable. I need to separate it into rows and columns.
 how do I this? Which commands in R would be useful for the same?

 Thank you in advance

 Regards
 Sunita
 --
 View this message in context:
 http://www.nabble.com/Importing-data-into-R-and-combining-2-files-tp23545291p23545291.html
 Sent from the R help mailing list archive at Nabble.com.

 __
 R-help@r-project.org mailing list
 https://stat.ethz.ch/mailman/listinfo/r-help
 PLEASE do read the posting guide
 http://www.R-project.org/posting-guide.htmlhttp://www.r-project.org/posting-guide.html
 and provide commented, minimal, self-contained, reproducible code.




-- 
Jim Holtman
Cincinnati, OH
+1 513 646 9390

What is the problem that you are trying to solve?

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


Re: [R] Importing data into R and combining 2 files

2009-05-14 Thread Sunita Patil
Hello

Yes I have used read.table(file name, sep=\t) for reading the text file

Thank you

On Thu, May 14, 2009 at 11:07 PM, jim holtman jholt...@gmail.com wrote:

 What have you tried?  Check the Intro manual for hints.

 ?read.table   probably using sep='\t'

 On Thu, May 14, 2009 at 1:30 PM, Sunita22 sunita...@gmail.com wrote:


 Hello

 I have to import 2 txt files into R. 1 file contains the data and the
 other
 contains the header, column headings, datatypes and labels for the data.

 I have 2 problems:

 1) my data file has mixed type of data e.g. 1 2 3 4 5 3-5 02/04/06 3 4 5
 and
 so on, the data file is tab separated. when I import it, the data is
 getting
 stored in one single variable say V1. I need to separate it into rows and
 columns. how do I this? Which commands in R would be useful for the same?

 2) The other file is also tab separated. the 6 lines contains header and
 introduction as in the name of the dataset, year, etc. and then column
 names
 its datatypes and labels. After importing the data in this file also gets
 stored in one single variable. I need to separate it into rows and
 columns.
 how do I this? Which commands in R would be useful for the same?

 Thank you in advance

 Regards
 Sunita
 --
 View this message in context:
 http://www.nabble.com/Importing-data-into-R-and-combining-2-files-tp23545291p23545291.html
 Sent from the R help mailing list archive at Nabble.com.

 __
 R-help@r-project.org mailing list
 https://stat.ethz.ch/mailman/listinfo/r-help
 PLEASE do read the posting guide
 http://www.R-project.org/posting-guide.htmlhttp://www.r-project.org/posting-guide.html
 and provide commented, minimal, self-contained, reproducible code.




 --
 Jim Holtman
 Cincinnati, OH
 +1 513 646 9390

 What is the problem that you are trying to solve?




-- 
Our Thoughts have the Power to Change our Destiny.
Sunita

[[alternative HTML version deleted]]

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


Re: [R] Importing data into R and combining 2 files

2009-05-14 Thread Andy Choens
On Thu, 2009-05-14 at 10:30 -0700, Sunita22 wrote:
 Hello
 
 I have to import 2 txt files into R. 1 file contains the data and the other
 contains the header, column headings, datatypes and labels for the data.
 

This is your first complicating factor.

 I have 2 problems:
 
 1) my data file has mixed type of data e.g. 1 2 3 4 5 3-5 02/04/06 3 4 5 and
 so on, the data file is tab separated. when I import it, the data is getting
 stored in one single variable say V1. I need to separate it into rows and
 columns. how do I this? Which commands in R would be useful for the same?
 

This shouldn't be too hard.

 2) The other file is also tab separated. the 6 lines contains header and
 introduction as in the name of the dataset, year, etc. and then column names
 its datatypes and labels. After importing the data in this file also gets
 stored in one single variable. I need to separate it into rows and columns.
 how do I this? Which commands in R would be useful for the same?
 

This isn't that hard either, but it's not all in the best place.


The following is my 2 cents on this. I don't know what platform you are
on, so it's possible that my reference to sed may be more trouble than
it's worth. You have it if you are running Linux or OS X. 

Your data structure is part of the problem. Where is this data set
coming from? That could be a key piece of information that could help
someone show you a short cut. 

I would start by rolling your two files together into one big happy tab
separated file. You can remove the header entirely. It's just going to
get in the way. I am assuming that the order of your variables
(horizontally) are in the same order in the two files. I would double
check that these are in the same horizontal oder before actually
proceeding any further.

Delete the header. It's not going to to import correctly with
read.table(). You could stick this in as a note in your .R code if you
would like. (#)

As for labels, it is often easier in R to drop the integer = factor
label structure found in programs like SPSS. Rather than 1=Yes 5=No I
use Yes and No in the actual data. For most categorical data, this makes
it easier to work with. For ordinal data it can be more of a problem
though. If it's all just categorical, I would use a tool such as sed
(Linux/Unix commandline) to go through and apply my labels. Or you can
pull the data into R first and then do this with R. It's your choice. If
you are on Windows and don't know what sed is, forget about sed and just
use R to reassign your variables. R may make your life easier here.

When you use read.table to import your text file, it will store it in a
single variable. This variable will be a data frame and should preserve
your individual columns and rows. If you aren't familiar with data
frames, you should really start with some introductory material. I will
assume that you are in a hurry. There are some really great texts such
as Introduction to R that you should read, but a quick primer can be
found at:

http://www.statmethods.net/index.html

This is an especially good link if you've ever used SPSS/PSPP before
trying to use R, since the author also started in SPSS and understands
how/why R is confusing to people making this switch. There are also some
good links to other introductory materials that you should read.

Since you have all of your labels in a separate

Note: You will get more help on this forum if your request for help
includes reproducible code/information. Thus, if you told us how to
reproduce a dummy example of your two text file, (although this may be
private/proprietary), examples of the code you have tried and what you
get as a result usually results in better answers.



-- 
This is the price and the promise of citizenship.
- Barack Obama

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.