Re: [racket-users] Re: Places performance & channel capacity

Neil Van Dyke Thu, 21 Jan 2016 15:55:22 -0800

If I understand correctly, you're ultimately looking for a general waythat you can write this kind of record processing code simply in thefuture. And that, right now, you're investing some one-timeexperimental effort, to assess feasibility and to find anapproach/guidelines that you can apply more simply in the future.

Regarding feasibility, unless I misunderstand this pilot application, Ithink that it could be done in Racket in a way that scales almostperfectly with each additional core added, limited only by the ultimatefilesystem I/O. That might involve Places, perhaps with larger workunits or more efficient communication and coordination; or, as I thinkyou said earlier, you could simply fork separate processes afterpartitioning (just via file positions) the input file. These approachescould get towards the shallow end of what people do when writing reallyperformance-sensitive systems, like some Internet or transactionprocessing servers (or database engines, or operating system kernels) --they are not necessarily simple. Though you can generalize the hardwork for simple future use (e.g.,`for-each-bytes-line-from-file/unordered/indexes`,`for-each-foo-format-record`, `query-from-foo-format-file-to-bar-file`,`define-foo-file-transformation`, etc.). Sometimes people on this listwill spend 5-30 minutes figuring out a code solution to a problem postedon the list, but harder performance-sensitive stuff can take days orlonger to do well, and it has to be done well, by definition.

Going back to "simply", rather thansimply-after-upfront-hard-work-done-by-application-programmer, maybethere's opportunity forsimply-after-further-hard-work-done-by-core-Racket-programmers... Forexample, perhaps some of the core Racket string routines could beoptimized further (I imagine they're already got a working-over, whenUnicode was added), so that even simple programs run faster. And maybethere are Places facilities that could be optimized further, or some newfacility added. And maybe there's a research project for betterparallelizing support.

BTW, there might still be a few relatively simple efficiency tweaks inyour current approach (e.g., while skimming, I think I saw a snippet ofcode doing something like `(write-bytes (bytes-append ...))`, perhaps totry to keep a chunk of bytes contiguous, for interleaving with otherthreads' writes).

If the work was parsing HTML or JSON, then the places version wouldprobably be worth it on a 4 core machine.

For HTML and JSON parsing, unlike your records application, I think theparser itself has to be one thread, but you could probably put someexpensive application-specific behavior that happens during the parse inother threads. Neither my HTML nor JSON parsers was designed to be usedthat way, but my streaming JSON parser might be amenable to it. The HTMLparser is intended to build a potentially-big AST in one shot, so noother threads while it's working, though it should be reasonably fastabout it (it was written on a 166MHz Pentium laptop with 48MB RAM,usually on battery power).


Neil V.

--
You received this message because you are subscribed to the Google Groups "Racket 
Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to racket-users+unsubscr...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Re: [racket-users] Re: Places performance & channel capacity

Reply via email to