Re: Chunks vs Arrays - surprising benchmarking results

Paul Looney Fri, 07 Aug 2009 10:51:55 -0700

Richard,
Very true. Especially regarding the limitations of the filter.

The multiple search works best with hardwired searches and searcheswhere the user selects from a series of pop-up or radio buttons. Itworks well when you are generating canned lists.

PL


On Aug 7, 2009, at 10:22 AM, Richard Gaskin wrote:

Paul Looney wrote:
I have nothing to add directly to the chunk vs array discussion(Trevor's reply was very good) but I have often found it helpfulto increase the speed of compound selections by breaking theminto individual ones.For instance if you have a large database of names and sexes andyou want to select every female named "Jan" ("Jan" could be maleor female).Select all of the Jans first (this will run much faster than thecompound selection).Then select all of the females from the result of the firstselection (this will run faster because it is searching only"Jan"s - a very small list).This double selection will run faster than a single compoundselection.Obviously this requires a known data-set where one filter willeliminate a lot of records (selecting "female", then selecting"Jan" would be much slower in our example because, presumably,half of the list is female and a small portion is Jan).On many lists this can create a much bigger speed difference thanchunk vs array variance you noted.
One of the tough challenges with this sort of benchmarking is thatdifferent methods will favor different test cases.
But with delimited rows and columns, I haven't found a way to makea two-pass search run faster than one pass, except in veryspecialized cases as you noted.
There's a temptation to use the filter command for the first pass,but filter is only faster when testing the first few items;filtering on the 10th item is much slower, and attempting to testthe 50th item in a sample data set caused Rev to hang. RegEx is aharsh mistress.
In my case, I don't often know in advance which item will besearched. The queries I'm running usually come from a Search dialogin which the user can specify criteria. I could make the searchfunction smart enough to special-case certain types of searches touse a two-pass method in which the first pass is the filter commandwhere practical, but the overhead of analyzing both the query andthe data to make such determinations may detract from the benefitsof doing so, esp. since my continued testing on this isincreasingly nudging me toward multi-dimensional arrays anyway.Even with the data bloat and the surprising overhead of movingarrays in and out of storage, with a little extra work to deal withthose the performance of arrays seems unbeatable in the broadestrange of use cases I've run thus far.
--
 Richard Gaskin
 Fourth World
 Revolution training and consulting: http://www.fourthworld.com
 Webzine for Rev developers: http://www.revjournal.com
_______________________________________________
use-revolution mailing list
[email protected]
Please visit this url to subscribe, unsubscribe and manage yoursubscription preferences:
http://lists.runrev.com/mailman/listinfo/use-revolution


_______________________________________________
use-revolution mailing list
[email protected]
Please visit this url to subscribe, unsubscribe and manage your subscription 
preferences:
http://lists.runrev.com/mailman/listinfo/use-revolution

Re: Chunks vs Arrays - surprising benchmarking results

Reply via email to