Re: [galaxy-user] Extract Genomic DNA Problem

2011-06-22 Thread Stephen Taylor

Hi jen,


In your original dataset, there are extra spaces around the tabs.
Where ^I indicates a tab and $ indicates an end-of-line character,
the entire datafile looks like this:

chr5 ^I 47258168 ^I 47259240$
chr18 ^I 1938527 ^I 1939965$
chr2 ^I 101973625 ^I 101974007$
chr4 ^I 75653898 ^I 75674045$
chr19 ^I 4258837 ^I 4263299$
chr4 ^I 39330049 ^I 39372715$
chr4 ^I 9606881 ^I 9610083$
chr15 ^I 7264937 ^I 7265599$
chr21 ^I 6659189 ^I 6667015$
chr2 ^I 351239 ^I 352821$

This could have been introduced in many ways, which is why the tools in
Text Manipulation can be so handy. Hopefully this helps!



Ah I see. Thanks for the tip.

Steve


Best,

Jen
Galaxy team

On 6/21/11 7:58 AM, Stephen Taylor wrote:

Hi Jeremy,



This is a formatting issue with your input file; it needs to be
tab-delimited but it's not currently. You'll need to:

(a) convert spaces to tabs using the Convert delimiters to Tabs tool;
(b) click on the pencil icon and set the data type to BED.


Thanks, this works, but as a user I cannot see (but obviously you can
:-)) that there is a difference between my original and the one I did
step (a) and (b) on. I thought I had uploaded a bed file and converted
tabs to spaces.

The data is shared here:

http://main.g2.bx.psu.edu/u/stephentaylor/h/test

Not working is my original

Working is the new data that I did (a) and (b) on.

What did I miss?

Thanks,

Steve



Best,
J.

On Jun 21, 2011, at 8:45 AM, Stephen Taylor wrote:


Hi,

I was trying to extract FASTA sequences using the following tab
separated data for Chicken on the Galaxy Main server:

chr5 47258168 47259240
chr18 1938527 1939965
chr2 101973625 101974007
chr4 75653898 75674045
chr19 4258837 4263299
chr4 39330049 39372715
chr4 9606881 9610083
chr15 7264937 7265599
chr21 6659189 6667015
chr2 351239 352821


I got the following galaxy output:



7: Extract Genomic DNA on data 6
empty
format: fasta, database: galGal3
Info: 10 warnings, 1st is: Unable to fetch the sequence from
'47258168' to '1072' for build 'galGal3'.
Skipped 10 invalid lines, 1st is #1, chr5 47258168 47259240

Any ideas what I am doing wrong?

Thanks,

Steve
___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org. Please keep all replies on the list by
using reply all in your mail client. For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

http://lists.bx.psu.edu/




___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org. Please keep all replies on the list by
using reply all in your mail client. For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

http://lists.bx.psu.edu/




___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

 http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

 http://lists.bx.psu.edu/


Re: [galaxy-user] Extract Genomic DNA Problem

2011-06-22 Thread Fabian Owuor
Hi all,

i am not sure if this is the right forum.

but i need as much access to start up information on the system for
developers.

we are part of a group in Kenya trying to come up with some apps on a
gnome sequencing project we are starting here.

regards
-- 
Fabian .J. Owuor
 
 www.adelphitrading.co.ke | www.iHub.co.ke (red member)
 www.four99.co.ke | www.fabian.me.ke | www.epiphany.co.ke
 Yu: +254 753.333.824 | Safaricom: +254 721.948.852 | Skype: kootie
 Orange: +254 772.189.962 | Airtel: +254 735.270.852 | tweet: @theusfab

___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-user] Extract Genomic DNA Problem

2011-06-21 Thread Jeremy Goecks
Stephen,

This is a formatting issue with your input file; it needs to be tab-delimited 
but it's not currently. You'll need to:

(a) convert spaces to tabs using the Convert delimiters to Tabs tool;
(b) click on the pencil icon and set the data type to BED.

Best,
J.

On Jun 21, 2011, at 8:45 AM, Stephen Taylor wrote:

 Hi,
 
 I was trying to extract FASTA sequences using the following tab separated 
 data for Chicken on the Galaxy Main server:
 
 chr5   4725816847259240
 chr18  1938527 1939965
 chr2   101973625   101974007
 chr4   7565389875674045
 chr19  4258837 4263299
 chr4   3933004939372715
 chr4   9606881 9610083
 chr15  7264937 7265599
 chr21  6659189 6667015
 chr2   351239  352821
 
 
 I got the following galaxy output:
 
 
 
 7: Extract Genomic DNA on data 6
 empty
 format: fasta, database: galGal3
 Info: 10 warnings, 1st is: Unable to fetch the sequence from '47258168' to 
 '1072' for build 'galGal3'.
 Skipped 10 invalid lines, 1st is #1, chr5 47258168 47259240
 
 Any ideas what I am doing wrong?
 
 Thanks,
 
 Steve
 ___
 The Galaxy User list should be used for the discussion of
 Galaxy analysis and other features on the public server
 at usegalaxy.org.  Please keep all replies on the list by
 using reply all in your mail client.  For discussion of
 local Galaxy instances and the Galaxy source code, please
 use the Galaxy Development list:
 
 http://lists.bx.psu.edu/listinfo/galaxy-dev
 
 To manage your subscriptions to this and other Galaxy lists,
 please use the interface at:
 
 http://lists.bx.psu.edu/


___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-user] Extract Genomic DNA Problem

2011-06-21 Thread Stephen Taylor

Hi Jeremy,



This is a formatting issue with your input file; it needs to be tab-delimited 
but it's not currently. You'll need to:

(a) convert spaces to tabs using the Convert delimiters to Tabs tool;
(b) click on the pencil icon and set the data type to BED.


Thanks, this works, but as a user I cannot see (but obviously you can :-)) that there is a difference between my 
original and the one I did step (a) and (b) on. I thought I had uploaded a bed file and converted tabs to spaces.


The data is shared here:

http://main.g2.bx.psu.edu/u/stephentaylor/h/test

Not working is my original

Working is the new data that I did (a) and (b) on.

What did I miss?

Thanks,

Steve



Best,
J.

On Jun 21, 2011, at 8:45 AM, Stephen Taylor wrote:


Hi,

I was trying to extract FASTA sequences using the following tab separated data 
for Chicken on the Galaxy Main server:

chr5 4725816847259240
chr181938527 1939965
chr2 101973625   101974007
chr4 7565389875674045
chr194258837 4263299
chr4 3933004939372715
chr4 9606881 9610083
chr157264937 7265599
chr216659189 6667015
chr2 351239  352821


I got the following galaxy output:



7: Extract Genomic DNA on data 6
empty
format: fasta, database: galGal3
Info: 10 warnings, 1st is: Unable to fetch the sequence from '47258168' to 
'1072' for build 'galGal3'.
Skipped 10 invalid lines, 1st is #1, chr5 47258168 47259240

Any ideas what I am doing wrong?

Thanks,

Steve
___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

http://lists.bx.psu.edu/




___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

 http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

 http://lists.bx.psu.edu/


Re: [galaxy-user] Extract Genomic DNA Problem

2011-06-21 Thread Jennifer Jackson

Hi Stephen,

In your original dataset, there are extra spaces around the tabs.
Where ^I indicates a tab and $ indicates an end-of-line character, 
the entire datafile looks like this:


chr5 ^I 47258168 ^I 47259240$
chr18 ^I 1938527 ^I 1939965$
chr2 ^I 101973625 ^I 101974007$
chr4 ^I 75653898 ^I 75674045$
chr19 ^I 4258837 ^I 4263299$
chr4 ^I 39330049 ^I 39372715$
chr4 ^I 9606881 ^I 9610083$
chr15 ^I 7264937 ^I 7265599$
chr21 ^I 6659189 ^I 6667015$
chr2 ^I 351239 ^I 352821$

This could have been introduced in many ways, which is why the tools in 
Text Manipulation can be so handy. Hopefully this helps!


Best,

Jen
Galaxy team

On 6/21/11 7:58 AM, Stephen Taylor wrote:

Hi Jeremy,



This is a formatting issue with your input file; it needs to be
tab-delimited but it's not currently. You'll need to:

(a) convert spaces to tabs using the Convert delimiters to Tabs tool;
(b) click on the pencil icon and set the data type to BED.


Thanks, this works, but as a user I cannot see (but obviously you can
:-)) that there is a difference between my original and the one I did
step (a) and (b) on. I thought I had uploaded a bed file and converted
tabs to spaces.

The data is shared here:

http://main.g2.bx.psu.edu/u/stephentaylor/h/test

Not working is my original

Working is the new data that I did (a) and (b) on.

What did I miss?

Thanks,

Steve



Best,
J.

On Jun 21, 2011, at 8:45 AM, Stephen Taylor wrote:


Hi,

I was trying to extract FASTA sequences using the following tab
separated data for Chicken on the Galaxy Main server:

chr5 47258168 47259240
chr18 1938527 1939965
chr2 101973625 101974007
chr4 75653898 75674045
chr19 4258837 4263299
chr4 39330049 39372715
chr4 9606881 9610083
chr15 7264937 7265599
chr21 6659189 6667015
chr2 351239 352821


I got the following galaxy output:



7: Extract Genomic DNA on data 6
empty
format: fasta, database: galGal3
Info: 10 warnings, 1st is: Unable to fetch the sequence from
'47258168' to '1072' for build 'galGal3'.
Skipped 10 invalid lines, 1st is #1, chr5 47258168 47259240

Any ideas what I am doing wrong?

Thanks,

Steve
___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org. Please keep all replies on the list by
using reply all in your mail client. For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

http://lists.bx.psu.edu/




___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org. Please keep all replies on the list by
using reply all in your mail client. For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

http://lists.bx.psu.edu/


--
Jennifer Jackson
http://usegalaxy.org/
http://galaxyproject.org/
___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

 http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

 http://lists.bx.psu.edu/


Re: [galaxy-user] Extract Genomic DNA Problem

2011-06-21 Thread John David Osborne
Hi Jen,

Where do you get your AXT or NIB files in order to do the extract genome 
operation? I understand that extract genomic DNA is dependent on those files 
and correct paths/files in AlignSec.loc?

This is for our local instance of Galaxy.

 -John




From: galaxy-user-boun...@lists.bx.psu.edu 
[galaxy-user-boun...@lists.bx.psu.edu] On Behalf Of Jennifer Jackson 
[j...@bx.psu.edu]
Sent: Tuesday, June 21, 2011 10:23 AM
To: Stephen Taylor
Cc: galaxy-u...@bx.psu.edu
Subject: Re: [galaxy-user] Extract Genomic DNA Problem

Hi Stephen,

In your original dataset, there are extra spaces around the tabs.
Where ^I indicates a tab and $ indicates an end-of-line character,
the entire datafile looks like this:

chr5 ^I 47258168 ^I 47259240$
chr18 ^I 1938527 ^I 1939965$
chr2 ^I 101973625 ^I 101974007$
chr4 ^I 75653898 ^I 75674045$
chr19 ^I 4258837 ^I 4263299$
chr4 ^I 39330049 ^I 39372715$
chr4 ^I 9606881 ^I 9610083$
chr15 ^I 7264937 ^I 7265599$
chr21 ^I 6659189 ^I 6667015$
chr2 ^I 351239 ^I 352821$

This could have been introduced in many ways, which is why the tools in
Text Manipulation can be so handy. Hopefully this helps!

Best,

Jen
Galaxy team

On 6/21/11 7:58 AM, Stephen Taylor wrote:
 Hi Jeremy,


 This is a formatting issue with your input file; it needs to be
 tab-delimited but it's not currently. You'll need to:

 (a) convert spaces to tabs using the Convert delimiters to Tabs tool;
 (b) click on the pencil icon and set the data type to BED.

 Thanks, this works, but as a user I cannot see (but obviously you can
 :-)) that there is a difference between my original and the one I did
 step (a) and (b) on. I thought I had uploaded a bed file and converted
 tabs to spaces.

 The data is shared here:

 http://main.g2.bx.psu.edu/u/stephentaylor/h/test

 Not working is my original

 Working is the new data that I did (a) and (b) on.

 What did I miss?

 Thanks,

 Steve


 Best,
 J.

 On Jun 21, 2011, at 8:45 AM, Stephen Taylor wrote:

 Hi,

 I was trying to extract FASTA sequences using the following tab
 separated data for Chicken on the Galaxy Main server:

 chr5 47258168 47259240
 chr18 1938527 1939965
 chr2 101973625 101974007
 chr4 75653898 75674045
 chr19 4258837 4263299
 chr4 39330049 39372715
 chr4 9606881 9610083
 chr15 7264937 7265599
 chr21 6659189 6667015
 chr2 351239 352821


 I got the following galaxy output:

 

 7: Extract Genomic DNA on data 6
 empty
 format: fasta, database: galGal3
 Info: 10 warnings, 1st is: Unable to fetch the sequence from
 '47258168' to '1072' for build 'galGal3'.
 Skipped 10 invalid lines, 1st is #1, chr5 47258168 47259240

 Any ideas what I am doing wrong?

 Thanks,

 Steve
 ___
 The Galaxy User list should be used for the discussion of
 Galaxy analysis and other features on the public server
 at usegalaxy.org. Please keep all replies on the list by
 using reply all in your mail client. For discussion of
 local Galaxy instances and the Galaxy source code, please
 use the Galaxy Development list:

 http://lists.bx.psu.edu/listinfo/galaxy-dev

 To manage your subscriptions to this and other Galaxy lists,
 please use the interface at:

 http://lists.bx.psu.edu/


 ___
 The Galaxy User list should be used for the discussion of
 Galaxy analysis and other features on the public server
 at usegalaxy.org. Please keep all replies on the list by
 using reply all in your mail client. For discussion of
 local Galaxy instances and the Galaxy source code, please
 use the Galaxy Development list:

 http://lists.bx.psu.edu/listinfo/galaxy-dev

 To manage your subscriptions to this and other Galaxy lists,
 please use the interface at:

 http://lists.bx.psu.edu/

--
Jennifer Jackson
http://usegalaxy.org/
http://galaxyproject.org/
___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/
___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-user] Extract Genomic DNA Problem

2011-06-21 Thread Jennifer Jackson

Hi John,

Currently, these come from the UCSC Genome Browser's download area.
http://hgdownload.cse.ucsc.edu/downloads.html

AXT files usually come from the primary data source. UCSC provides tools 
to create NIB, 2bit, etc. files from fasta (found under Source 
section, same link as above). If you have other genomes not in their 
primarily vertebrate dataset that you want to use with tools that 
require these formats, the tools would allow you to create your own.


The Galaxy team, too, has been discussing creating files like this for a 
wider range of genomes, but no specific plans are in place yet.


Please let us know if we can help more,

Best,

Jen
Galaxy team

On 6/21/11 10:24 AM, John David Osborne wrote:

Hi Jen,

Where do you get your AXT or NIB files in order to do the extract genome 
operation? I understand that extract genomic DNA is dependent on those files 
and correct paths/files in AlignSec.loc?

This is for our local instance of Galaxy.

  -John




From: galaxy-user-boun...@lists.bx.psu.edu 
[galaxy-user-boun...@lists.bx.psu.edu] On Behalf Of Jennifer Jackson 
[j...@bx.psu.edu]
Sent: Tuesday, June 21, 2011 10:23 AM
To: Stephen Taylor
Cc: galaxy-u...@bx.psu.edu
Subject: Re: [galaxy-user] Extract Genomic DNA Problem

Hi Stephen,

In your original dataset, there are extra spaces around the tabs.
Where ^I indicates a tab and $ indicates an end-of-line character,
the entire datafile looks like this:

chr5 ^I 47258168 ^I 47259240$
chr18 ^I 1938527 ^I 1939965$
chr2 ^I 101973625 ^I 101974007$
chr4 ^I 75653898 ^I 75674045$
chr19 ^I 4258837 ^I 4263299$
chr4 ^I 39330049 ^I 39372715$
chr4 ^I 9606881 ^I 9610083$
chr15 ^I 7264937 ^I 7265599$
chr21 ^I 6659189 ^I 6667015$
chr2 ^I 351239 ^I 352821$

This could have been introduced in many ways, which is why the tools in
Text Manipulation can be so handy. Hopefully this helps!

Best,

Jen
Galaxy team

On 6/21/11 7:58 AM, Stephen Taylor wrote:

Hi Jeremy,



This is a formatting issue with your input file; it needs to be
tab-delimited but it's not currently. You'll need to:

(a) convert spaces to tabs using the Convert delimiters to Tabs tool;
(b) click on the pencil icon and set the data type to BED.


Thanks, this works, but as a user I cannot see (but obviously you can
:-)) that there is a difference between my original and the one I did
step (a) and (b) on. I thought I had uploaded a bed file and converted
tabs to spaces.

The data is shared here:

http://main.g2.bx.psu.edu/u/stephentaylor/h/test

Not working is my original

Working is the new data that I did (a) and (b) on.

What did I miss?

Thanks,

Steve



Best,
J.

On Jun 21, 2011, at 8:45 AM, Stephen Taylor wrote:


Hi,

I was trying to extract FASTA sequences using the following tab
separated data for Chicken on the Galaxy Main server:

chr5 47258168 47259240
chr18 1938527 1939965
chr2 101973625 101974007
chr4 75653898 75674045
chr19 4258837 4263299
chr4 39330049 39372715
chr4 9606881 9610083
chr15 7264937 7265599
chr21 6659189 6667015
chr2 351239 352821


I got the following galaxy output:



7: Extract Genomic DNA on data 6
empty
format: fasta, database: galGal3
Info: 10 warnings, 1st is: Unable to fetch the sequence from
'47258168' to '1072' for build 'galGal3'.
Skipped 10 invalid lines, 1st is #1, chr5 47258168 47259240

Any ideas what I am doing wrong?

Thanks,

Steve
___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org. Please keep all replies on the list by
using reply all in your mail client. For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

http://lists.bx.psu.edu/




___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org. Please keep all replies on the list by
using reply all in your mail client. For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

http://lists.bx.psu.edu/


--
Jennifer Jackson
http://usegalaxy.org/
http://galaxyproject.org/
___
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using reply all in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

   http://lists.bx.psu.edu/listinfo/galaxy