It works, thanks!
I am wondering why using Text Manipulation/Compute function, galaxy
changes brackets '[' to '/__ob__/' and /'__cb__' for ']', /so for this:
/str(c1)[1:2] --> //str(c1)__ob__1:2__cb__ /
thanks a lot,
The tool "Fetch Sequences -> Extract Genomic DNA" can be used to
extract fasta sequences. The coordinates can be BED, GTF, etc. and the
"genome" doesn't necessarily have to be an actual genome, just a fasta
file in your history.
To subset a data string, the tool "Text Manipulation -> Trim" might be
helpful. This would only work if you want to use the same rules for an
entire file (or split your file up and run the tool on those subfiles
using different rules). Practical for some cases, but not all.
And the final option is for coordinate data - tools in "Operate on
Genomic Intervals". Once you have the final coordinate set, going back
and using the "Fetch Sequences" tool can capture the associated result
fasta sequence, from a native genome or a fasta file in your history,
as described above.
Hopefully this gives you an option that will work for your project,
On 6/5/11 7:14 AM, Michal Stuglik wrote:
I am wondering if galaxy has tool to substring/extract sequence/text
from another sequence/text based on coordinates in columns (start, end
column) or how to do it in Text Manipulation/Compute?
all the best,
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org. Please keep all replies on the list by
using "reply all" in your mail client. For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:
To manage your subscriptions to this and other Galaxy lists,
please use the interface at: