On Wed, 2008-02-20 at 17:23 -0500, Ben Scott wrote:
> On Wed, Feb 20, 2008 at 3:58 PM, Alex Hewitt <[EMAIL PROTECTED]> wrote:
> >  A friend of ours wrote a bunch of recipe files using something called
> >  Microsoft Write.
> 
>   Yah, "Windows Write" is/was one of the "accessories" that came with
> Windows 3.x.  It morphed into "WordPad" in Windows 95 and later.
> WordPad still exists.  It won't write the Write (hah) format anymore,
> but it can read it, and save in some variant of the RTF format.
> 
> >  Theoretically Microsoft Word is supposed to be able to read such files
> >  but I found that the version I was using (Word 2003) wouldn't.
> 
>   Curious.  My install of Word 2003 can.  Are you sure you installed
> all the import/export filters?  If you did a "Minimal" or "Custom"
> install (instead of the mondo-huge "Full"), I don't think those are
> all included by default.

I sit corrected! ;^) Word 2003 complains about the file saying in effect
"this might be a virus but I have a converter that will convert it" and
it does. I think the original reason I wrote the filter was because our
friend didn't have Word and I didn't want to manually edit her 83 files.
I'll see if Word can be called from the command line to do the
converting.

-Alex

> 
> > Writing a filter in Python was trivial and I was  able to convert the
> > files to plain text.
> 
>   For future reference, the strings(1) command can be used to much the
> same effect.
> 
> > ... the file itself in ASCII, a series of bytes  again in non-ASCII,
> > followed by a repeat of some of the original ASCII.
> 
>   That sounds very similar to the MS Word .DOC format, and I bet
> they're related.  DOC files do not interleave the formatting with the
> text, as (for example) HTML or Word Perfect did.  Instead, all the
> plain text is stored in one blob, and then the formatting information
> is stored in a different blob.  The formatting directives have
> "pointers" to the position of the text they effect.
> 
>   The "repeat" you describe is not actually a repeat, but a follow-on
> save.  Word and friends work in an interesting fashion.  You open the
> file, and it loads the base text blob described above.  You start
> making your changes.  Those changes go into an undo buffer.  That undo
> buffer is actually backstored on the disk in temporary files.
> (That's why a directory containing Word files people are busy editing
> accumulates lots of odd temp files until they close the original.)
> 
>   When you invoke "Save", the undo buffer -- essentially like a "diff"
> -- gets tacked on to the end of the main file.  This made saves fast
> on slow computers already overburdened by Microsoft bloatware.  Loads
> were slower, of course, but the reasoning was that people care about
> save speed more than load speed.    As you can imagine, if there are
> lots of saves, rebuilding the text is not so easy as running
> strings(1) on it.
> 
>   In Word, if you turn off "Fast Saves", it writes out a full, unified
> version of the text instead.  This became the default at some point --
> I have no idea when.
> 
> > But the interesting thing was that I  couldn't easily find a Microsoft tool 
> > that
> > understood the format which originated with Windows 95 or an earlier version
> > of Windows.
> 
>   Start -> Programs -> Accessories -> WordPad
> 
>   My copy of Win XP Pro opens .WRI files automatically in WordPad.  I
> just double-click the file.
> 
>   WordPad is an optional component for Windows.  Perhaps the computer
> was installed with a "minimalist" attitude, so various optional tools
> were not there when you needed them?
> 
> -- Ben
> _______________________________________________
> gnhlug-discuss mailing list
> gnhlug-discuss@mail.gnhlug.org
> http://mail.gnhlug.org/mailman/listinfo/gnhlug-discuss/

_______________________________________________
gnhlug-discuss mailing list
gnhlug-discuss@mail.gnhlug.org
http://mail.gnhlug.org/mailman/listinfo/gnhlug-discuss/

Reply via email to