[Toybox] [CLEANUP] uuencode.c, pass 1, traditional encoding

Rob Landley Thu, 11 Apr 2013 21:44:19 -0700

Now let's look at traditional encoding. (Same bat-file, samebat-changeset. Oh _wow_ I'm dating myself with that reference, althoughin my defense I only saw it in reruns.)

uu_3bytes() does the same "shift input into an int, output encoded 6bit character" but the encoding is just add 32 to the value. (That'sthe space character, and the next 63 characters after that are allprintable, so...) Just a really quick cleanup pass on this function:remove curly brackets around single lines, and replace the assignmentinto out[] with an xputc().

uu_line(): take out the special case to print something for a length 0line (the standard doesn't require it). Instead wrap the whole thing inif (len > 0) line the b64_line() does. The big xprintf() went awaybecause 3bytes is outputting stuff itself.

Traditional uuencoded lines start with the length of the line, so thetuples are always 3 bytes encoded as 4 characters, and that initiallength tells you when to ignore bits of the end. That goes inside theif() statement, along with a simple for() loop calling uu_3bytes() onevery 3 bytes of output until we're done. (We don't care about fallingoff the end because we assume the input is big enough, and whatevertrailing garbage potentially winds up in those last couple bytes won'tget decoded at the far end due to the length saying not to.)

Hmmm, although now that I think about it this implies that encoding thesame file twice could produce slightly different results, even thoughthey decode to the same thing. But this would only actually happen if asignal handler called us and crapped out the stacknon-deterministically, and modern linux actually has a separate signalstack, so it can't actually happen. Otherwise buf[] contains either theprevious line or zero, deterministically. (That's black magic enoughI'm tempted to throw a memset() in there, but it's not worth the extracode.)

Oh wait, we pass along len all the way to uu_3bytes() so it'll onlyshift/load the bytes we give it into the integer output is producedfrom, it's zero initialized and the rest remain zeroed. So nevermind,already handled. :)


Where was I?

uuencode_uu() does the same inbuf/outbuf setup using toybuf, and againthe output buffer went away and we can replace both with a single inputbuffer on the stack. The size of the xread was 45 bytes, and the specsays:

The maximum number of octets to be encoded on each line shall be 45.

So that's actually already correct. Adjust the whitespace (I tend to dospaces after commas and statements that aren't function names like if() vs func(), and around assignment characters. Habit I picked upsomewhere, more important to be consistent than right with whitespace.I sometimes cheat and remove spaces to fit in 80 chars, but space aftercommas is less important than space after if or before curly brackets...


Finally we get to uuencode_main():

The variable declarations got redone based on the needs of the code inthe function, so let's skip that except to note that I renamedencode_filename to name because it was an unnecessarily long localvariable name. (A three line function needs less descriptive names thana twenty line function. I try not to use a name like "k" if the scopeit lives in is longer than 10 lines or so, but I do note that "i" as aloop index is tradithional! (Lightning strike! Yeah, discworldreference.)

So: toybuf gets filled with a base64 table via a loop. (I could do thatonly for -m but didn't bother, compared to the exec a for loopinitializing a 64-entry table with code that fits in a cache line istrivial.)

The remaining cleanups are whitespace and changing the name ofencode_filename to just name.


And that's pass 1!

Rob
_______________________________________________
Toybox mailing list
[email protected]
http://lists.landley.net/listinfo.cgi/toybox-landley.net

[Toybox] [CLEANUP] uuencode.c, pass 1, traditional encoding

Reply via email to