So here's what I'm planning:
Read all the stdin till we hit end-of-file, treat it as whatever type
is specified on the command line, and process it as the home document.
Probably need two new command-line switches, --stdin-type=foo and
--stdin-url=foo, to allow you to specify the stdin
Nori,
Is this for text documents, or do you also process S-JIS HTML? Does
the parser work OK on S-JIS HTML?
Bill
Hi Bill,
At Wed, 31 Oct 2001 15:59:29 PST,
Bill Janssen wrote:
There's a question here of what the output character set should be for
a Plucker doc, something we really
I see that the current way a missing tag (like a URL with
http://foo.bar.com/bletch.html#tag
but there's no tag in bletch.html) is handled is to point the link
to the beginning of the page which presumably would have contained the
link (in the above example, to paragraph 0 of bletch.html).
This is extremely counter-intuitive. The user is jumped to some place
that may be wildly out of context (for a long page). I'm going to
change the parser to treat such URLs as excluded, just as with any other
URL which doesn't exist.
I have seen pages do this *INTENTIONALLY* by
I'm still not sure how this would work. The user already sets the font
preferred to read documents in via the Preferences panel.
No, they don't set the font, they *CHANGE* the font via the
Preferences panel. What I'm suggesting is that we have a way to set a
default font for that pdb
The MSN change has affected Slate.com, an online magazine owned by MS.
The re-styling is so bad that I figured I'd start plucking it instead
of looking at it in a browser. Unfortunately, it's in UTF-8 and
XHTML, and contains a number of the standard odd characters. I
wrote a little csh/Python
Hi Bill,
At Fri, 2 Nov 2001 19:31:30 PST,
Bill Janssen wrote:
Is this for text documents, or do you also process S-JIS HTML? Does
the parser work OK on S-JIS HTML?
What do you mean about S-JIS HTML?
# Sorry, I am not so familiar about HTML.
--
Nori
Code has been added to CVS, for the viewer and the parser to support color
elements.
A somewhat difficult merge--probably missed at least one thing somewhere
along the way.
If someone could test it out and see what things need correcting, especially
that the charset work by Bill still is
http://foo.bar.com/bletch.html#tag
but there's no tag in bletch.html) is handled is to point the link
to the beginning of the page which presumably would have contained the
link (in the above example, to paragraph 0 of bletch.html).
[...]
change the parser to treat such URLs as excluded,