Hi Arun,

Thanks for replying.

Sorry I didn't list it, I do have democracy index in my dataset.

The full  set includes these columns:

Country Code (3 letter abbreviation)
Country
Education
Freedom house demo index
log population
log real gdp
nominal savings
polity demo index
year (5 yr interval: 1950, 1955, 1960....)
sample (not sure what it's for yet)
world income instrument

This is a set of panel data for some 200 countries with 10 obs each. each
country has it's own missing data for various columns.

I imported the csv file into an object in R called Table 1.

Here is the few lines I wrote so far:

#Importing data from excel to R

Table1 <- read.csv("5YearPanel.csv")
Table1 <- data.frame(Table1)
Table1

# Deleting Netherlands data from Table1 and naming the new table deDutch

deDutch <- subset(Table1, country!="Netherlands")
deDutch

What I am trying to do next is to clean the data in R as follows:

1) Take avg values of each column for each country.
2) Store these values in a new object.
3) Based on the median income, I want to divide them into a subset called
high income (i.e. >median), and a subset called low income (i.e. <=median).
4) Once I get it cleaned, I believe I can start running regressions with the
data.

I'll look at your comments and try things out first.

Thank you!







--
View this message in context: 
http://r.789695.n4.nabble.com/keep-average-values-and-delete-duplicate-rows-in-R-tp4647677p4647681.html
Sent from the R help mailing list archive at Nabble.com.

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to