Re: How to Run a line of code in external program

Bruce Van Allen Wed, 13 Jan 2010 05:49:48 -0800

On 2010-01-12 at 11:31 AM, chriscorb...@gmail.com (Chris) wrote:

Just curious, are people also using the "shell worksheet" in BBEdit?


Short answer:

Yes.

Long answer:

Oh, yes.

Here's one of my uses:

I prepare voter data for election campaigns. I run large sets ofdata through a series of processes to make a useful "voter file"for campaign planning and voter contact.

The problem: how to manage a complex and time-consuming dataprocessing task, as follows:

Each county has its own format for voter registration data, anda county's data structures and database field names may changefrom one election to the next. Voter data churns constantly --people move, die, and change name, party, gender, and so on.Jurisdiction boundaries, precinct lines, and even zip code areaschange, too.

In this environment, each time I set up a voter file, I have tostart from the beginning, building from raw data about voters,streets, districts, past elections, and other information, anyof which might have changed in content or structure since thelast time I processed it.

A typical county's voter roll requires 17 processes, whichcumulatively clean, standardize, and cross-tabulate the datainto final form. Each process ends with tests, whose resultsmust be checked ("bio-optically" ;-) before the next process maybegin. Some times it's necessary to back up one or more steps inthe processing when a problem is found.

Running the complete processing series with no interruptionstakes five to eight hours (I can do other work during most ofthat time).

Is that enough of a problem statement? Add to it the obviousneed to keep written track of things both during the processingand between processing occasions.


My solution:

For each election, each county gets a data processing directoryinto which I copy a set of BBEdit shell worksheets, one for eachof the 17 processes, plus a few others.

Each worksheet is named for its process; the content of theworksheet is one or more lines of input arguments, followed bya call to the script the does the processing. When the script isexecuted, its output prints out on the worksheet.

For my processes, the output includes progress indicators asfiles are read or written, counts of things found, samplings ofin-process data, and finally the test results from the processand the paths to the data file(s) that the process yielded.

Here's an example of one of these worksheets, down to andincluding the line with #-#-#:


A='Project=OCT2009'
B='base_dir=/Volumes/Campaigns/2009/CO_01'
C='source_file=voter_tabs.txt'
D='criteria=all' # 'criteria=age<50'
E='crosstab=gender pty_group age_cohort zip'

perl /Volumes/LIB/make_cross_tab_summaries "$A" "$B" "$C" "$D" "$E"
#-#-#

The above worksheet sample uses a format that works with thestandard bash shell under OS X Snow Leopard. My scripts parsestandard input as name=value pairs.

Select all lines from A= down to and including the line with#-#-#. When you press Enter, the output will print below the#-#-# line. I have an Applescript that clears the sheet belowthe #-#-# line and then re-selects the top lines and #-#-# line,ready for me to press Enter again to re-run the process.

With its own dedicated shell worksheet, each process and itsinput parameters, progress reports, and outcomes may bereviewed, re-run, checked, and annotated for future reference.Multiple worksheets may be opened and their processes executedsimultaneously (assuming non-dependence).

There is only one copy, in a central library, of the actualscript for each processing step; it may be pointed to bymultiple shell worksheets each with its own parameters.

During script development, I start using the shell worksheet tocall the script from the very beginning. Reflecting this, thefirst line output from the scripts I'm describing here simplyshows that the script initialized and loaded its needed modules:

Tue Jan 12 19:21:29 2010 Initializing... Process 5396 usingBVA::XDATA 3.90, BVA::XUI 2.9, BVA::XACT 1.11,Spreadsheet::WriteExcel 2.25

If a script has a problem, warnings and error messages spill outdown the worksheet (yes, you can cancel a worksheet process),becoming breadcrumbs for the "warnings are friends" path back tofunctioning code.

Perl, the language I most enjoy working in, provides a strongset of debugging, profiling, and testing tools. I can invokethese with a few lines kept on the worksheet but normallycommented out. Again, the results from the profiler or testsuite print out on the worksheet for study.

There's more to how I do all this, but I think you can see thatthis satisfies the requirements of my problem statement quite well.

Most of what I describe could be done on the command line,especially by someone adept at using all the tools of thatenvironment to pipe output to files, capture warnings, tweakinput variables, etc.

But I've come to enjoy how handy it is to have an executableinvocation, together with its most recent input parameters,outputs, and system messages, plus comments and alternativeinputs, all encapsulated in a single file, which may be one ofseveral such files that together allow me to direct an entirelibrary of ruthless code at unsuspecting data.

All happening, of course, in an application highly likely toalready be running on any machine I'm working on, BBEdit.

Whoa, how did I just write so much? Consider that a sign of howmuch I appreciate shell worksheets for enabling me to handle oneof my crucial workflows so well.


Best,




   - Bruce

_bruce__van_allen__santa_cruz_ca_

-- 
You received this message because you are subscribed to the 
"BBEdit Talk" discussion group on Google Groups.
To post to this group, send email to bbedit@googlegroups.com
To unsubscribe from this group, send email to
bbedit+unsubscr...@googlegroups.com
For more options, visit this group at
http://groups.google.com/group/bbedit?hl=en
If you have a feature request or would like to report a problem, 
please email "supp...@barebones.com" rather than posting to the group.

Re: How to Run a line of code in external program

Reply via email to