Pawel Sztromwasser wrote:
> Hello BASErs,
> 
> After little hands-on session with BASE I have reported some 
> bugs/comments on trac:
> 
> http://base.thep.lu.se/ticket/1360#preview
> http://base.thep.lu.se/ticket/1361#preview

Great. We'll have a look at them.


> These are rather minor things, but there is something that worries us 
> more. I know that the array design concept is not really holding when 
> comes to Illumina technology. It is more like a list of features. I am 
> also aware of that some features are being removed from consecutive 
> versions of .bgx files. Thus the 'skip feature' option had to be 
> introduced to import plugin allowing skipping some features that are 
> present in raw data files, but not .bgx files. It is a common practice 
> to run this plugin with skip option set to true.
>
> This has a serious implication: one can accidentally import data from 
> human IBS file to a raw bioassay that has a mouse (for example) array 
> design attached to it. If features are allowed to be skipped, a common 
> subset of features will be imported to db and no warnings will be issued 
> about wrong design. To mislead the user even more, the raw data file 
> will be marked as validated (because plugin finished run with no errors).
> 
> The array design concept was intended to provide a way of checking if 
> the right raw data files are linked with right samples/extracts in the 
> experiment, and that the right data is analyzed in experiment. It seems 
> that it might not be the case for Illumina.
> 
> Do you think there is a way to improve this and provide control over 
> what gets imported with what design?

One problem is that the raw data files contains no information whatsoever about
which array design that was used. Do you have any ideas yourself? Given a BGX 
file
and two raw data files (one matching and one none-matching) how do you tell
them apart?

The only thing I can think of is to report an error if the number of skipped
data lines goes above a certain threshold. Any idea about what a good value
for that threshold might be? 10? 100?

Another possibility is to add a check in the experiment overview that compare
the number of features on the array design with the number of raw data spots in
the raw bioassay. If the difference is too big a warning could be generated.

/Nicklas

------------------------------------------------------------------------------
Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day 
trial. Simplify your report design, integration and deployment - and focus on 
what you do best, core application coding. Discover what's new with 
Crystal Reports now.  http://p.sf.net/sfu/bobj-july
_______________________________________________
The BASE general discussion mailing list
basedb-users@lists.sourceforge.net
unsubscribe: send a mail with subject "unsubscribe" to
basedb-users-requ...@lists.sourceforge.net

Reply via email to