Re: [R] Getting a list of unique gene names from a list with semi-colons

R. Michael Weylandt <michael.weyla...@gmail.com> Fri, 06 Jan 2012 18:19:29 -0800

I think you can do this with something like this (untested):

unique(unlist(strsplit(XXX, ",")))


Michael

On Jan 6, 2012, at 8:05 PM, Kurinji Pandiyan <kurinji.pandi...@gmail.com> wrote:

> Hello,
> 
> I have one column in my dataframe that has gene names of interest.
> Unfortunately, due to the fact that some probes lie between two genes or
> two transcripts of a gene, it looks something like this -
> 
>  FAM81A  LOC283050;LOC283050;LOC283050;ZMIZ1  PINK1;PINK1  MRPL12;MRPL12
> C1orf114  MMS19;UBTD1
> I would like to know how to get a list with all the names with no
> semi-colons and removing the replicates. I would like the end result to
> look like -
> 
> FAM81A
> LOC283050
> ZMIZI
> PINK1
> MRPL12
> C1orf114
> MMS19
> UBTD1
> 
> Thanks a lot for your help!
> Kurinji
> 
>    [[alternative HTML version deleted]]
> 
> ______________________________________________
> R-help@r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Getting a list of unique gene names from a list with semi-colons

Reply via email to