Re: [M100] Loading cross platform .CO files

Brian K. White Sun, 08 Mar 2026 17:17:21 -0700

On 3/8/26 15:35, B 9 wrote:

On Sun, Mar 8, 2026 at 6:03 AM Stephen Adolph <[email protected]<mailto:[email protected]>> wrote:
    Two extensions that would be nice to have.  What do you think?

    1.  Some way to identify and adjust addresses to enable relocatable
    code.  I know teeny has this feature.  Can we have another encoding
    element for that?
/Having/ an encoding element is easy since there are so many “invalid”codes — for instance, !r could be prefixed before two byte addresses.
/Using/ it, however, is another matter. As Brian points out, the programto generate such code may be tricky for arbitrary programs. I don’t knowmuch about doing that yet, but it seems to me you’d need to start fromthe source assembly code, not just a .CO file. Also, there’s a beauty tothe encoding right now: there is exactly one escape character (|!|) andit does exactly one thing (flip the high bit of the next character). Iknow all real-world programs differ from the crystalline beauty of theoriginal concept, but there’s also something to be said for havingsmall, sharp tools that do only one thing but do them remarkably well.
An optional extension would also be contrary to the direction Brian washeading, with his data and loader being separable. My program alreadyties the loader and data together as I’m just seeing this encoding as away to transmit a file to a Model T, not an archival format, but Irecognize the loss of generality. .
Investigating HXFER <https://github.com/LivingM100SIG/Living_M100SIG/blob/main/M100SIG/Lib-10-TANDY200/HXFER.DOC> is on my to do list.Instead of marking the changes needed for relocation inline, as we wouldbe doing, it simply appends bytes at the end of a hex file. While it isinefficient compared to your !-encoding, it has some very nice featuresincluding being future-proof: I was able to easily convert the HXFERfiles to .CO without knowing anything about HXFER by simply using astandard hex-to-binary tool (|xxd -r -p|) and truncating the file theLEN specified in the header. Will a future computer-archaeologist curseus for inventing a new format? I think not since, unlike HXFER, ourmethod has the benefit of including the decoder with the data, but it’sworth considering.
    2.  Adjust ROM calls per platform.   This might require another
    method altogether.  Maybe there would need to be a library of calls
    and some code to identify the call.  More ambitious.
Not as crazy as it sounds. I’ve been struck repeatedly by how theofficial ROM calls have persisted from the Kyotronic 85 to its kin,usually with just the address changed. The main incompatibility that Irecall so far is the RS232 calls for the NEC PC-8201A which have a verypeculiar initialization string. That said, it is definitely moreambitious and probably shouldn’t even be part of the encoding. At leastnot at first. I’d want to see a proof of concept that’s able to converta non-trivial program (from assembly source code) to .CO files for morethan one machine. I see Model 100 / Tandy 200 conversion as the mostlikely to be successful. I believe Kyotronic to Olivetti M10 should befairly similar, too.
I’d also been considering using |!| followed by digits to represent RunLength Encoding (RLE) of the next non-digit character, similar to theSixel image protocol. However, that’s probably not a terribly usefulthing to do given that very few .CO files have the same byte repeatedmore than three times.
I’m not quite ready to work on any of these extensions yet as I’d liketo finish up my loader (100% functional, proper sanity checks,documentation, and testing). Please bring up relocating in the encodingand automagic translation again as they may get more traction in mybrain later.
    Also have we landed on an agreed encoding?

Until we find the next hiccup, I think yes. The main features I believe are:

  * All characters represent themselves except |!| and a character
    preceded by |!|.
  * A character preceded by |!| will have its high-bit flipped and the
    |!| discarded.
  * All Model T computers share a set of characters which should be encoded:
      o A literal |!| (bang, 33) is encoded as characters 33 and 161,
        which display as |!à| on a Model 100. ⁰
      o |"| (double quote, 34) confuses BASIC when included in DATA
        statements.
      o |^Z| (control-Z, 26) signals the End Of File and cannot be
        received over the serial port to a text file.¹
      o All control characters (anything less than |Space|, 32) are
        removed by the BASIC tokenizer with the exception of |Tab|.²
      o |DEL| (delete, 127) is removed when a file or program is opened
        with EDIT.
⁰ Other Kyotronic kin may not show anything for “high ASCII”, but thecharacters are preserved.
¹ In fact, after a ^Z, the rest of the file will be missing!


 ² For simplicity, Brian’s co2ba also encodes |Tab| and |Space|, but

that’s still valid by this encoding.


Actually I stopped doing that since it's silly.

The code is nothing at all, and actually more generic without usingdirect selection logic like "is the value less than 34".

It's far more useful to have it ask "is the value in the unsafe list ?"and then the list can change any time for any reason in any way withoutchanging the user needing to change the code.

So the unsafe list, a special option that's just a shorthand for adding127 to the unsafe list, the choice of "!", the xor value, the length ofthe output lines, the line numbering, are all actually configurablesimply because there is no reason not to.

Like if I hadn't known about the special case of 127 because it neverbit me personally, and so I never added the EDITSAFE option just forthat, the script would still work for someone else who discovered theyneeded that. They could just use the UNSAFE option to customize the listof unsafe values.

One thing that's fixed right now is the fact that the shift operation isxor and not whatever you want like the way I originally just had +/-64

But for instance, xor 64 actually works too, and has the property that127 doesn't become 255, nor would 255 become 127. I guess that doesn'tmatter but I just always thought 255 was sometimes a problem too, maybein other venues that need to handle the file. I guess the utility ofbeing able to change the xor value would be if you need to encodesomething that would end up making another value you can't have, so youcan shift the whole mess around and find something that works for yourparticular case. like idk some old system 7 mac software that forwhatever reason doesn't like some byte, or maybe doesn't like some byte*combo* like it interprets it. Like in fact bash by default interprets !itself when you type it in manually or paste into a command line. So Ifyou were going to be dealing with encoded data on the command line, thatis a case where you might wish it were not any of the meaningful bashcharacters.

I keep changing my mind about some things. Originally I was includingthe ! and the 64 or 128 in the header line so that the header says howto decode the payload along with it's size and exe address and name.Then made those not configurable and removed them from the header. Butthey are configurable again and so they should probably be consideredmetadata and be defined in the header. (the header data line)

I do consider the user free to generate a loader with whatever encodingoptions they may want for whatever their own reasons may be. So as faras I'm concerned there doesn't really need to be a religious consensusfor the payload format, because the payload and it's decoder aregenerated together. It's good enough for me that the generator is itselfa published and easily available thing after the fact, unlike so many ofthe exotic loaders that currently exist. Sure they technically includethe code to decode, but the best ones include a binary blob that isessentially inscrutable. It exists and is physically possible to traceit's execution with a 80c85 datasheet and m100 reference manual...



I also now added a METHOD option that generates different encoding types.

So you can generate a loader that uses the quasi hex pair method JamesYio and Kurt McCullum use.


And the even cruder simple direct csv ints.

There is not really any need I can see to use them for real, but it'sinteresting to generate the exact same payload the different ways wherethat is the only difference, just to compare them.

Basically they all end up taking the exact same full run time, becausethe faster ones are exactly offset by there increased transfer time(though that may not be true if you're not transferring as slow as I am)But of course the smaller storage size is an important difference so thenew way is definitely the way to go.

METHOD A (default) = What do we call it? I was going to call it Adolphencoding, or "the quite ok encoding" (a joke referencing an imagecompression that came out a few years ago), but actually it's the resultof all 3 of us at this point. We aren't doing exactly what Steve was andwe each changed something. I wrote Adolph/B9/White in a comment in thescript just to have some sort of label and get both your names in there.


B = Identical data as A but the loader code is implemented a different way.

H = hex pairs - but like James Yi and Kurt McCullum use, where thealphabet is like a-p and treated as "byte-97*256 + nextbyte-97".

I = The good old practically no code required ints. Just read & poke ina loop once for every byte. The loop iterator is just the addressitself. The entire loop is just the tail end of the first and only lineof code! But each byte takes 2 to 4 bytes...

And so at some point I hope to be able to add other methods like a fancymachine language option based on whatever you develop, and that shouldprobably also advance over time to include yet more options to useactual compression.

I had thought about RLE too. I think it's been shown to more than payfor itself even though it's so simple and doesn't usually gain much.It's like low hanging fruit so I was probably going to do it sometimejust to see how it works out.

Basically it's cool to have a loader generator instead of a loader, soyou can spit out variations and test & compare all kinds of thingseasily. It's more work to get from zero to the first level of indirectlygetting loader generator produce the final output you want (vs say justhandcrafting a loader for one file directly)but once you do have a generic loader generator working even just forthe simple easy cases where the binary isn't weird, it's easy to addtiny incremental additions to that and after a while you really havesomething cool without ever having to invest in some big project thatwouldn't seem worth it.

Note that all high-ASCII codes are preserved and do not need to be escaped.
The main unknown about this encoding is what to refer to it as. We cankeep calling it Stephen Adolph’s encoding, since you suggested it, but I


hah! what was I just saying... hehe

know I personally wouldn’t like having any idea named after me unless Iwas done coming up with new ideas. I have a feeling you’re not done yet,so in my head I’ve been calling it !-encoding (pronounced “bangencoding”). What do you think it should be called?

bangcode/!code works for me even though my generator will spit outanything you want in place of !. It needs to be short, so describing allthe actual variable details is out. We could be bold and claim t-codefor the rest of time. The encoding needed for Model T's, because it'sthe list of which are the illegal bytes that's different. Or A-code.WHat encoding is that? Oh it's just a code.

Maybe it could be the first ever legitimate reason to say "exscape"since it's exclamation escape coding.

I just realized... when using ! in particular, the coding is also quiteliteral. As you read the data, !a is in fact literally not a.

I was thinking of using space as the prefix for relocatableplaceholders. It sounds bad but one it's one of the other free valuesbelow 34 and better than tab, and I think I kind of like the idea thatit would make all the relocate objects stand out. tab would actuallymaybe be even better for that, not just because they stand out evenmore, but because it would mean all the normal tabs would get encodedand collapsed.


Then maybe space could be for rle?



--
bkw

Re: [M100] Loading cross platform .CO files

Reply via email to