Hello Jianguang,
Setting the type during loading from the FTP area into a history as a
dataset is optional. Through experience, I believe that this helps to
speed up the process, but this is purely anecdotal.
For datasets that have been imported, but have not had format
auto-detected, or the format detected was incorrect or not specific
enough (e.g. "fastq" when you want "fastqsanger"), just edit the
dataset's attributes. Click on the pencil icon in the upper right corner
of any dataset, click on the dataset tab in the form that comes up in
the middle panel, pick the type from the menu, and save.
This can be done with any dataset, at any point. After running certain
tools, reassignment of datatype or other metadata (column assignments,
found on the first tab of the same "Edit Attributes" form above) is
needed - often the tool will note if this is the case.
Galaxy has some logic that will prevent the misalignment of obviously
incorrect metadata - including widely inappropriate datatypes. But
tuning the type to be specific and correct between a group of common
format datatypes (for example: a group like "tabular, bed, interval")
would be for you to ensure.
Setting "database" is also optional and can be performed during or after
loading, or at any other time, by clicking through the pencil icon on
Edit Attributes form (first tab).
In most cases - skip converting spaces to tabs - unless you really are
working with a strict tabular dataset and are certain that no fields
contain internal whitespace (including informative/metadata headers).
Hopefully this helps,
Jen
Galaxy team
On 3/21/13 7:30 AM, Du, Jianguang wrote:
Hi Everyone,
When I upload my datasets onto my history via FTP method (using
FileZilla), do I need to specify the file format under "File Format"
of "Upload File from your computer"?
I noticed that the screencast of how to upload datasets via FTP just
leaves the "File Format" as "Auto-detect". However, I also noticed
this sentence in the help for Auto-detect: "the system will attempt to
detect Axt, Fasta, Fastqsolexa, Gff, Gff3, Html, Lav, Maf, Tabular,
Wiggle, Bed and Interval (Bed with headers) formats". Do I need to
specify the format of my datasets if the format of my datasets is not
listed in the sentence above?
Thanks.
Jianguang
___________________________________________________________
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org. Please keep all replies on the list by
using "reply all" in your mail client. For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:
http://lists.bx.psu.edu/listinfo/galaxy-dev
To manage your subscriptions to this and other Galaxy lists,
please use the interface at:
http://lists.bx.psu.edu/
--
Jennifer Hillman-Jackson
Galaxy Support and Training
http://galaxyproject.org
___________________________________________________________
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org. Please keep all replies on the list by
using "reply all" in your mail client. For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:
http://lists.bx.psu.edu/listinfo/galaxy-dev
To manage your subscriptions to this and other Galaxy lists,
please use the interface at:
http://lists.bx.psu.edu/