Re: [galaxy-dev] best practice for data matrices with metadata

2011-02-17 Thread Keith Sheppard
This seems really interesting! Just from a quick investigation it
looks like it is targeting the kind of problem I want to solve. I
wonder if you (or anyone else) have some experience to share with
bioHDF in terms of how easy it is to use or how much traction it's
gaining in the community? The mail list is very quiet
http://mail.hdfgroup.org/pipermail/biohdf_hdfgroup.org/

Thanks,
Keith

On Wed, Feb 16, 2011 at 4:21 PM, Kevin  wrote:
> I am wondering if  the biohdf project is a good way to keep the meta data.
>
> Cheers
> Kevin
> On 17-Feb-2011, at 5:11 AM, Keith Sheppard  wrote:
>
>> Hi All,
>>
>> I'm working on building some galaxy tools which can be used together
>> in a workflow and one data structure that seems to show up a lot is a
>> data matrix where both the data rows and data columns have annotations
>> which I would like to preserve through the workflow.
>>
>> Here is an example that hopefully make this more concrete: we have X
>> genotyping probes in a microarray that we have run on Y samples which
>> results in a data matrix with X rows and Y columns. Each of the probes
>> (rows) has annotation data like allele (A vs B allele), sequence, SNP
>> ID, etc and each of the columns (samples) has it's own annotation data
>> like strain, date etc...
>>
>> So what do you think is the best way to represent this kind of
>> structure? Does galaxy have a mechanism that allows you to associate
>> files as a logical group which would allow my intensity data and meta
>> data to stay together as a "dataset" without having to be in the same
>> file?
>>
>> Thank you
>> Keith
>> ___
>> To manage your subscriptions to this and other Galaxy lists, please use
>> the interface at:
>>
>>  http://lists.bx.psu.edu/
>



-- 
keithsheppard.name

___
To manage your subscriptions to this and other Galaxy lists, please use the 
interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] best practice for data matrices with metadata

2011-02-16 Thread Kevin
I am wondering if  the biohdf project is a good way to keep the meta  
data.


Cheers
Kevin
On 17-Feb-2011, at 5:11 AM, Keith Sheppard  wrote:


Hi All,

I'm working on building some galaxy tools which can be used together
in a workflow and one data structure that seems to show up a lot is a
data matrix where both the data rows and data columns have annotations
which I would like to preserve through the workflow.

Here is an example that hopefully make this more concrete: we have X
genotyping probes in a microarray that we have run on Y samples which
results in a data matrix with X rows and Y columns. Each of the probes
(rows) has annotation data like allele (A vs B allele), sequence, SNP
ID, etc and each of the columns (samples) has it's own annotation data
like strain, date etc...

So what do you think is the best way to represent this kind of
structure? Does galaxy have a mechanism that allows you to associate
files as a logical group which would allow my intensity data and meta
data to stay together as a "dataset" without having to be in the same
file?

Thank you
Keith
___
To manage your subscriptions to this and other Galaxy lists, please  
use the interface at:


 http://lists.bx.psu.edu/

___
To manage your subscriptions to this and other Galaxy lists, please use the 
interface at:

 http://lists.bx.psu.edu/


[galaxy-dev] best practice for data matrices with metadata

2011-02-16 Thread Keith Sheppard
Hi All,

I'm working on building some galaxy tools which can be used together
in a workflow and one data structure that seems to show up a lot is a
data matrix where both the data rows and data columns have annotations
which I would like to preserve through the workflow.

Here is an example that hopefully make this more concrete: we have X
genotyping probes in a microarray that we have run on Y samples which
results in a data matrix with X rows and Y columns. Each of the probes
(rows) has annotation data like allele (A vs B allele), sequence, SNP
ID, etc and each of the columns (samples) has it's own annotation data
like strain, date etc...

So what do you think is the best way to represent this kind of
structure? Does galaxy have a mechanism that allows you to associate
files as a logical group which would allow my intensity data and meta
data to stay together as a "dataset" without having to be in the same
file?

Thank you
Keith
___
To manage your subscriptions to this and other Galaxy lists, please use the 
interface at:

  http://lists.bx.psu.edu/