Re: [Jprogramming] kadane algorithm vs brute force timing and memory

Raul Miller Fri, 07 Apr 2023 21:37:17 -0700

Generally speaking, when thinking about performance, benchmarking is a
critical step. There's a *lot* of theoretical issues which are only
approximations, at best.


That said, in this case, there's also documentation (if you know where to look).

For example:

https://wiki.jsoftware.com/wiki/Vocabulary/SpecialCombinations#Virtual_Nouns

I hope this helps,

-- 
Raul

On Fri, Apr 7, 2023 at 11:05 PM Thomas McGuire <tmcguir...@gmail.com> wrote:
>
> OK thanks to you, Raul and Henry I think I have gotten this through my thick 
> head. The crux of my misunderstanding are the results you get from performing 
> the box verb with the suffixes adverb:
>
>    <\. _2 1 _3 4 _1 2 1 _5 4
> ┌─────────────────────┬──────────────────┬────────────────┬─────────────┬───────────┬────────┬──────┬────┬─┐
> │_2 1 _3 4 _1 2 1 _5 4│1 _3 4 _1 2 1 _5 4│_3 4 _1 2 1 _5 4│4 _1 2 1 _5 4│_1 2 
> 1 _5 4│2 1 _5 4│1 _5 4│_5 4│4│
> └─────────────────────┴──────────────────┴────────────────┴─────────────┴───────────┴────────┴──────┴────┴─┘
>
> From what you are saying this is made by a linear walk through the list. The 
> results being displayed are essentiall intermediate results from serial 
> appending going on inside of J.
>
> So when I broke down the ‘kad’ algorithm it was my ignorance that made me 
> line up the above with the ([ >. +)/ operation and think this was being done 
> more than a single pass through the original list.
>
> Wow.
>
> Thanks for being so patient with me.
>
> Tom McGuire
>
> > On Apr 7, 2023, at 9:51 AM, Henry Rich <henryhr...@gmail.com> wrote:
> >
> > (u/\. y) is defined as operating on successive suffixes, making it 
> > apparently O(n^2), but the implementation runs right to left in (n-1) 
> > applications of u.  The result of each suffix is fed into the next-larger 
> > suffix.
> >
> > (u/\ y), OTOH, runs in O(n^2) time unless u is known to be associative.
> >
> > Henry Rich
> >
> > On 4/7/2023 3:09 AM, Thomas McGuire wrote:
> >> In my haste to produce an email after finally figuring out how Pepe’s 
> >> elegant J implementation of the maximal sum of the subsequences worked ,I 
> >> can see how you might interpret this as I was disappointed in the 
> >> implementation as not being “pure” kadane. I meant only to classify the 
> >> implementation so that the CPU times I was recording made sense
> >>
> >> From the perspective of big O analysis. Pure kadane is O(n). Where Pepe’s 
> >> algorithm is being applied to a list (of length n)  of lists of varying 
> >> length 1..n, The operation performed 1+1+2+…+n-1 times. Which places it 
> >> about O(n^2) though it’s been a while since I have done any algorithm 
> >> analysis.
> >>
> >> In terms of a J implementation of the broader category of maximal sum of 
> >> the subsequences of a list of numbers, Pepe’s algorithm is fantastic. From 
> >> the perspective of my tutorial it is an incredibly insightful jump from 
> >> the naive brute force implementations I had used.  It is such a concise J 
> >> implementation that highlights a significant portion of J’s advanced 
> >> techniques. I intend on adding it to my tutorial I just hope I can explain 
> >> it well enough to make it accessible to my target audience.
> >>
> >>
> >>> On Apr 6, 2023, at 10:55 PM, Henry Rich <henryhr...@gmail.com> wrote:
> >>>
> >>> I think that when you say Pepe's code is not the Kadane algorithm you are 
> >>> conflating algorithm and implementation.
> >>>
> >>> Kadane's idea is that (1) at each new prospective starting point for the 
> >>> maximal sequence, you have the option of starting at 0 or the previous 
> >>> best total; (2) every position is a prospective ending point.  Pepe's 
> >>> algorithm follows these observations.
> >>>
> >>> The usual implementations keep two values through the loop: (a) the 
> >>> maximum value found on sequences that have ended; (b) the current total 
> >>> on sequences that the new value may add onto.
> >>>
> >>> Well, that's one way to do it.  Another is to produce (a) as an 
> >>> intermediate result for each possible ending position, and then at the 
> >>> end see which of those values is the largest.  That's what Pepe did.  I 
> >>> approve his decision, since keeping two values through the scan would 
> >>> require ponderous indexing operations. This is the sort of implementation 
> >>> decision that surprises users of compiled languages.
> >>>
> >>> Pepe's implementation in no way changes the algorithm.  It's merely a 
> >>> reordering of when comparisons are made.  Pepe's algorithm is O(n), just 
> >>> as the usual algorithms are.
> >> As I tried to explain above the ([ >. +)/ is O(n) but when used on 
> >> suffixes with the ‘\.’ operator it becomes O(n^2). the kadane”0 
> >> implementation that Elijah proposed operates through the array one at a 
> >> time in O(n) producing a list of possible maximal sums which is extracted 
> >> with >./ also in O(n). That kadane algorithm uses your technique in 
> >> Chapter 25 Loopless Code VI, to perform a single run through the array.
> >>
> >> Just trying to be accurate so I can explain the implementations correctly 
> >> in my essay.
> >>> Pepe knew that >./ is very fast, probably 10% of the time of the other 
> >>> bit.  The time spent in (([ >. +)/\.) is O(n), not worth our discussion; 
> >>> but if you want to blame someone for the speed, blame the implementation 
> >>> (i. e. me), not the algorithm. Perhaps (+ 00&>.)~/\. would be faster; I 
> >>> haven't tested it, but we're down to minute questions of instruction 
> >>> ordering.
> >> No complaints about J’s speed. Even the naive brute force implementation 
> >> runs in less than a second on a 10000 element array. I have been using the 
> >> timings as a relative guage that the various implementations work the way 
> >> I think they are supposed to.
> >>
> >> The one thing I did have a question on now that I’ve drawn you into this 
> >> thread. Is that the memory requirement for Pepe’s algorithm is about 1/5 
> >> the size of the kadane”0 algorithm. The global variable assignment appears 
> >> to be the operation that causes the large amount of memory to be used. 
> >> Elijah tried to explain it and hsi explanation is valid for some of the 
> >> implementations such as the use of ‘\’ and Fold. But I don’t understand in 
> >> the setting of a global variable.
> >>
> >> Tom McGuire
> >>> Henry Rich
> >>>
> >>>
> >>>
> >>>
> >>> On 4/6/2023 8:12 PM, Thomas McGuire wrote:
> >>>>> On Apr 3, 2023, at 1:28 PM, Jose Mario Quintana 
> >>>>> <jose.mario.quint...@gmail.com> wrote:
> >>>>>
> >>>>> For what it is worth...
> >>>>>
> >>>>> kad=. >./@:(([ >. +)/\.)
> >>>>>
> >>>> OK that is worth a lot. But it’s not really the kadane algorithm. It 
> >>>> inserts a kadane-like summation in between elements for each suffix 
> >>>> generated. Kadane obtains the sum on a single pass through the array. 
> >>>> However it’s an incredibly elegant refactoring of the brute force 
> >>>> example and saves performing partial sums on the suffixes.
> >>>>
> >>>> I will say after figuring out exactly what is was doing it wasn’t 
> >>>> intuitively obvious (to me anyway) that this should work. A naive view 
> >>>> is that the largest sum in the example below is produced by the 
> >>>> subsequence :
> >>>> 4 _1 2 1
> >>>>
> >>>> So that sequence is never generated in Jose’s version of maximum sum. 
> >>>> However due to it’s Kadane-like properties it is generating the highest 
> >>>> sum for each suffix as if the partial sum started at the first element. 
> >>>> In essence using a kadane-like approach in place of performing partial 
> >>>> sums. Therefore in one pass on each suffix you find the greatest partial 
> >>>> sum without having to calculate each partial sum individually.
> >>>>
> >>>> Jose’s version is fairly quick coming in at about double the processing 
> >>>> time of Jose’s kadane”0 version and well below the brute force method of 
> >>>> taking the partial sums of all the suffixes.
> >>>>
> >>>> Tom McGuire
> >>>>
> >>>>> For example, the maximum sum subarray,
> >>>>>
> >>>>>
> >>>>>  kad _2 1 _3 4 _1 2 1 _5 4
> >>>>> 6
> >>>>>
> >>>>>
> >>>>>   kad i.0
> >>>>> __
> >>>>>
> >>>>>
> >>>>>
> >>>>>
> >>>>>
> >>>>> On Mon, Apr 3, 2023 at 5:22 AM Elijah Stone <elro...@elronnd.net> wrote:
> >>>>>
> >>>>>> Your kadane routines are all bound to have asymptotically poor space 
> >>>>>> usage
> >>>>>> under the present interpreter because >./ is not fused; so they all 
> >>>>>> must
> >>>>>> create an intermediate result array whose length is the same as ranvec,
> >>>>>> and
> >>>>>> then apply >./.  I think (haven't looked very closely) that locmax is 
> >>>>>> the
> >>>>>> same
> >>>>>> as your final answer; so you could skip generating the intermediate 
> >>>>>> array
> >>>>>> and
> >>>>>> simply look at locmax.  Better yet, get rid of the global variable, and
> >>>>>> turn
> >>>>>> this all into a fold single (rather than fold multiple), where locmax
> >>>>>> becomes
> >>>>>> the running value.
> >>>>>>
> >>>>>> On Mon, 3 Apr 2023, Elijah Stone wrote:
> >>>>>>
> >>>>>>>  7!:2 '>./ 1 kadane\ ranvec' [ locmax=: __
> >>>>>>> 1412576
> >>>>>>>   7!:2 '>./ kadane"0 ranvec' [ locmax=: __
> >>>>>>> 772960
> >>>>>>>   7!:2 '>./ kadane"1 ranvec2' [ locmax=: __ [ ranvec2=: ,.ranvec
> >>>>>>> 1412960
> >>>>>>>   1 <@$@$\ i.5
> >>>>>>> ┌─┬─┬─┬─┬─┐
> >>>>>>> │1│1│1│1│1│
> >>>>>>> └─┴─┴─┴─┴─┘
> >>>>>>>   <@$@$"0 i.5
> >>>>>>> ┌─┬─┬─┬─┬─┐
> >>>>>>> │0│0│0│0│0│
> >>>>>>> └─┴─┴─┴─┴─┘
> >>>>>>>
> >>>>>>> 1 kadane\ y applies kadane to length-1 subsequences of y--that is, in
> >>>>>> this
> >>>>>>> case, vectors, which have length 1.  Whereas kadane"0 applies to cells
> >>>>>> of y,
> >>>>>>> which have rank 0.  More space is required to store a length-1 vector
> >>>>>> than a
> >>>>>>> scalar, since in the former case the length also needs to be stored.
> >>>>>>>
> >>>>>>> On Mon, 3 Apr 2023, Thomas McGuire wrote:
> >>>>>>>
> >>>>>>>> I’m wondering if someone with more J internals knowledge can tell me
> >>>>>> why my
> >>>>>>> Kadane algorithm implementation is taking up more memory than my naive
> >>>>>> brute
> >>>>>>> force method that generates all subsequences then sums them. The time 
> >>>>>>> is
> >>>>>>> about what I would expect except for the FOLD operation. The time 
> >>>>>>> space
> >>>>>> data
> >>>>>>> were generated on a MacBook Pro 2019 era I9 processor 2.3GHz.
> >>>>>>>>  ranvec =: _5 + ?10000$10
> >>>>>>>>  kadane =: 3 : 'locmax =: y >. locmax + y'
> >>>>>>>>  kadane1 =: 3 : 'locmax =: (] >. locmax&+) y'
> >>>>>>>>
> >>>>>>>>  NB. first brute force implementation
> >>>>>>>>  timespacex '>./(>./@(+/\))\.ranvec'
> >>>>>>>> 0.032948 396448
> >>>>>>>>
> >>>>>>>>  NB. fork version brute force
> >>>>>>>>  timespacex '>./([: >./ (+/\))\. ranvec'
> >>>>>>>> 0.034208 396448
> >>>>>>>>
> >>>>>>>>  NB. first kadane implementation
> >>>>>>>>  locmax =: __
> >>>>>>>>  timespacex '>./ 1 kadane\ranvec'
> >>>>>>>> 0.004034 1.41251e6
> >>>>>>>>
> >>>>>>>>  NB. fork kadane version
> >>>>>>>>  locmax =: __
> >>>>>>>>  timespacex '>./ 1 kadane1\ranvec'
> >>>>>>>> 0.005267 1.41277e6
> >>>>>>>>
> >>>>>>>>  NB. new dnfs anonymous kadane function
> >>>>>>>>  locmax =: __
> >>>>>>>>  timespacex '>./ 1 {{locmax =: y >. locmax + y}}\ranvec'
> >>>>>>>> 0.003625 1.41347e6
> >>>>>>>>
> >>>>>>>>  NB. fork dnfs anonymous kadane function
> >>>>>>>>  locmax =: __
> >>>>>>>>  timespacex '>./ 1 {{locmax =: (] >. locmax&+) y}}\ranvec'
> >>>>>>>> 0.004776 1.41386e6
> >>>>>>>>
> >>>>>>>>  NB. Fold kadane implementation
> >>>>>>>>  timespacex '>./ __ ] F:. ([ >. +) ranvec'
> >>>>>>>> 0.017393 905792
> >>>>>>>>
> >>>>>>>> This was all part of a J Tutorial I have written
> >>>>>>> https://code.jsoftware.com/wiki/User:Tom_McGuire/KadaneAlgorithmTutorial
> >>>>>>>> Tom McGuire
> >>>>>>>> ----------------------------------------------------------------------
> >>>>>>>> For information about J forums see 
> >>>>>>>> http://www.jsoftware.com/forums.htm
> >>>>>>> ----------------------------------------------------------------------
> >>>>>>> For information about J forums see http://www.jsoftware.com/forums.htm
> >>>>>>>
> >>>>>> ----------------------------------------------------------------------
> >>>>>> For information about J forums see http://www.jsoftware.com/forums.htm
> >>>>>>
> >>>>> ----------------------------------------------------------------------
> >>>>> For information about J forums see http://www.jsoftware.com/forums.htm
> >>>> ----------------------------------------------------------------------
> >>>> For information about J forums see http://www.jsoftware.com/forums.htm
> >>> ----------------------------------------------------------------------
> >>> For information about J forums see http://www.jsoftware.com/forums.htm
> >> ----------------------------------------------------------------------
> >> For information about J forums see http://www.jsoftware.com/forums.htm
> >
> > ----------------------------------------------------------------------
> > For information about J forums see http://www.jsoftware.com/forums.htm
>
> ----------------------------------------------------------------------
> For information about J forums see http://www.jsoftware.com/forums.htm
----------------------------------------------------------------------
For information about J forums see http://www.jsoftware.com/forums.htm

Re: [Jprogramming] kadane algorithm vs brute force timing and memory

Reply via email to