Re: Quickest was to compare 2 CR lists?

Richard Gaskin Tue, 04 Nov 2008 10:18:42 -0800

Jim Ault wrote:

Note to Richard Gaskin, the benchmark master:  Is there any benefit to using


sort list1 numeric ascending by length(each)
sort list2 numeric ascending by length(each)

--then doingrepeat for each

   ...
end repeat
--such that the lists are skewed so that the shortest comparisons will occur
closest to the beginning of the variable, as opposed to the extreme of the
shortest match happening at the end of the variable list?  Obviously the
same number of match operations must occur, but the sorted lists might yield
more time savings than the sorting operations consume.  Perhaps the reverse
is true (sort descending).

My guess is that short lists of short lines will make no difference,  such
as lists of 2000 lines would be considered short.

Hard to say, though I agree it's useful to note that a given algorithmmay perform better in some cases and worse in others depending on howwell it scales.

In general, custom sort functions are computationally intensive,measuring only slightly better than less concise syntax to accomplish asimilar task. When you ask the engine to work hard, it's going to takea lot of work, even if the syntax to trigger it seems lightweight. :)

As for how that all plays out as you describe, I don't know offhand andwould have to test it out. Not likely this week, as I have a fewdeadlines to meet.

Unrelated to your specific question but relevant to this thread is areminder that the split command is also computationally intensive,despite its seductively trim syntax. Under the hood it has to walkthrough the whole text, keeping track of delimiters along the way,parsing it up and putting those parts into array slots, so it's reallynot much different from "repeat for each". If the data already lives inan array that's not a problem of course since you won't be using thesplit command, but if you need to transform data from lists to arraysand back again you may notice a performance loss over just keeping it ina list and using "repeat for each".

So while I don't have a specific answer to your question and in thatregard am close to useless here <g>, perhaps the only thing I couldcontribute would be to stress that there is sometimes an inversecorrelation between the simplicity of syntax and the complexity of thealgorithm that syntax will invoke.

How that plays out in practice depends on many things, but here are twogeneral guidelines I've adopted from my testing:

When a task requires accessing specific elements from a list, using anarray will be about an order of magnitude faster than using "line xof.." or "lineoffset".

However, when you need to traverse an entire data set, "repeat for eachline" performs surprisingly well, testing here about 20% faster than"repeat for each element".


--
 Richard Gaskin
 Managing Editor, revJournal
 _______________________________________________________
 Rev tips, tutorials and more: http://www.revJournal.com
_______________________________________________
use-revolution mailing list
[email protected]
Please visit this url to subscribe, unsubscribe and manage your subscription 
preferences:
http://lists.runrev.com/mailman/listinfo/use-revolution

Re: Quickest was to compare 2 CR lists?

Reply via email to