I have a data frame of ~200 columns and ~20,000 rows where each column
consists of binary responses (0,1) and a 9 for missing data. I am
interested in finding the columns for which there are fewer than 100
individuals with responses of 0. 

I can use an apply function to generate a table for each column, but I'm
not certain whether I can subset a list based on some criterion as
subset() is designed for vectors, matrices or dataframes.

For example, I can use the following:
tt <- apply(data, 2, table)

Which returns an object of class list. Here is some sample output from
tt

$R0235940b

    0     1     9 
 2004  1076 15361 

$R0000710a

    0     9 
    2 18439 

$R0000710b

    0     1     9 
 3333  3941 11167 

tt$R0000710a meets my criteria and I would want to be able to easily
find this instead of rolling through the entire output. Is there a way
to subset this list to identify the columns which meet the criteria I
note above?


Thanks,
Harold




        [[alternative HTML version deleted]]

______________________________________________
[email protected] mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html

Reply via email to