Re: [galaxy-dev] Query Fastq files for particular sequence elements

Hans-Rudolf Hotz Thu, 14 Jun 2012 10:29:18 -0700

Hi Jane

I recommend mapping the data again yourself....

Alternatively, you might wanna play with 'grep' (if you have the GalaxyUnix tools installed in your Galaxy server), or use the tool 'Selectlines that match an expression'. I would do a Fastq to Tab on your datafirst. Or you can try the emboss tool 'fuzznuc' on the Fasta version ofyou data.

...but assuming you are talking about 'big' fastq files, mapping thedata again yourself is most likely the way to go.


Regards, Hans



On 06/14/2012 07:12 PM, Jane Dorweiler wrote:

Greetings all,
I've been trying to find a way to query fastq files for particular
sequence elements. Our data was mapped using BWA by our collaborator,
and repetitive elements 'ignored', but we are now interested in
determining whether a couple specific repetitive elements of interest
are differentially represented in the raw read files. Are there any
tools that anyone has developed to do anything like this -- and that
perhaps I'm simply missing as I explore the available tools?
In the short term, I've written a very crude python script to begin
exploring the question, but I'm sure there has to be a much better way.
If there are no such tools available, I'm hopeful that someone might
have some helpful suggestions, or that perhaps it could be explored
during the upcoming conference &/or training day in July.
Thanks and Best Regards, Jane
--
Jane E. Dorweiler, PhD



___________________________________________________________
Please keep all replies on the list by using "reply all"
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

   http://lists.bx.psu.edu/

___________________________________________________________
Please keep all replies on the list by using "reply all"
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

 http://lists.bx.psu.edu/

Re: [galaxy-dev] Query Fastq files for particular sequence elements

Reply via email to