Once upon a time, Y D Sun wrote:
> I find that the elapsed time of filtering CDS for a sequence is
> proportional to the total number of sequences stored in a database. For
> exmaple, when there is only one sequence in the database, the filtering
> takes 5 seconds. If one more sequence is added to the DB, the filtering
> time for one sequence will take about 10 seconds. When there are 3
> sequences in DB, the filtering time will be about 15 seconds.
> 
> What is the cause of such a problem? 

Just to double-check...  I presume you're still talking about code
of the form:

    SequenceDB seqdb = new BioSQLSequenceDB(...);
    Sequence seq = seqdb.getSequence("foo");
    FeatureHolder cds = seq.filter(new FeatureFilter.ByType("CDS"));

If the time to do that shows any strong dependancy on the
number of sequences in the database, I'd suggest that this
is (very) strong evidence of missing/corrupted indices.  Are
you using the tweaked schema I sent to you last week?  Are
you still of PostgreSQL, or have you tried MySQL?

    Thomas.
_______________________________________________
Biojava-l mailing list  -  [EMAIL PROTECTED]
http://biojava.org/mailman/listinfo/biojava-l

Reply via email to