Has someone benchmarked this?

I could easily stand corrected on this (not tested by me), but I'd look for a way to touch the variable outPutData "by reference" (in other words touching as little as possible).

"Put into", I think, has more overhead. In a situation with so much data, every little bit helps.

I know that in HC, when you put somevariable into tOutVariable, that the entire tOutVariable gets recreated, not just appended.

Ken Ray has a demonstration of the the use of "by reference" but does not allude to any speed improvement.

http://www.sonsothunder.com/devres/revolution/tips/scrp003.htm


Also again I could be corrected, but perhaps moving the data from field "massList" into a variable might speed things up a bit. I'm wondering that ANY referring to any field from inside such a long loop is a possible time consumer.

another thing -- the last line in the loop

put "====================" & return after outputData

perhaps that could be a constant, defined at the top of the script


constant kBarDisplay = "===================="

then use this in the loop
put kBarDisplay & return after outputData

this one might have negligible difference, but I haven't tested this.


Thanks to everyone who responded (Mark Schonewille, Mark Smith, Viktoras Didziulis, Robert Brenstein, apologies if I forgot anyone) - a whole range of solutions, including arrays, databases and 'just' variables. Anyway, I coded up the brute force method without any finess... here's the business bit of the script with some frippery...




put "NEW SEARCH, MASS = " & pepMass & " at " & ppm & " ppm error" & return after outputData







I ran a brutal test, of 45,000 lines in massList and 16,000 lines in seqDB

My crude attempt seems to be capable, even running within the Rev IDE, of completing the 720million comparisons in about 30minutes (OK, admittedly CoreDuo 2.66GHz, 2GB RAM). That's 24million a minute! (I deliberately put some searches that would match at the end of seqDB, to be sure I searched through most of the file each time). I am pretty happy with this, and I'd be looking for at least a 10-fold gain in speed to code up a harder solution.



Do you experts thing a 10-fold gain is feasible? 100-fold?

Rob

________________________________

Prof R J Beynon[h]
Proteomics and Functional Genomics Group
Faculty of Veterinary Science
University of Liverpool
Crown Street, Liverpool L69 7ZJ

________________________________

Phone: +44 151 794 4312

Fax: +44 151 794 4243

Email: [EMAIL PROTECTED]

http://www.liv.ac.uk/pfg

________________________________



This email was sent on Sun, 15 Jul, 2007 at 8:58 PM.



_______________________________________________
use-revolution mailing list
[email protected]
Please visit this url to subscribe, unsubscribe and manage your subscription preferences:
http://lists.runrev.com/mailman/listinfo/use-revolution


--


stephen barncard
s a n  f r a n c i s c o
- - -  - - - - - - - - -



_______________________________________________
use-revolution mailing list
[email protected]
Please visit this url to subscribe, unsubscribe and manage your subscription 
preferences:
http://lists.runrev.com/mailman/listinfo/use-revolution

Reply via email to