Plucking stdin

2001-11-03 Thread Bill Janssen
So here's what I'm planning: Read all the stdin till we hit end-of-file, treat it as whatever type is specified on the command line, and process it as the home document. Probably need two new command-line switches, --stdin-type=foo and --stdin-url=foo, to allow you to specify the stdin

Re: Support for Japanese characters.-2

2001-11-03 Thread Bill Janssen
Nori, Is this for text documents, or do you also process S-JIS HTML? Does the parser work OK on S-JIS HTML? Bill Hi Bill, At Wed, 31 Oct 2001 15:59:29 PST, Bill Janssen wrote: There's a question here of what the output character set should be for a Plucker doc, something we really

behavior of missing tag?

2001-11-03 Thread Bill Janssen
I see that the current way a missing tag (like a URL with http://foo.bar.com/bletch.html#tag but there's no tag in bletch.html) is handled is to point the link to the beginning of the page which presumably would have contained the link (in the above example, to paragraph 0 of bletch.html).

Re: behavior of missing tag?

2001-11-03 Thread David A. Desrosiers
This is extremely counter-intuitive. The user is jumped to some place that may be wildly out of context (for a long page). I'm going to change the parser to treat such URLs as excluded, just as with any other URL which doesn't exist. I have seen pages do this *INTENTIONALLY* by

Re: While we're gutting the python parser...

2001-11-03 Thread David A. Desrosiers
I'm still not sure how this would work. The user already sets the font preferred to read documents in via the Preferences panel. No, they don't set the font, they *CHANGE* the font via the Preferences panel. What I'm suggesting is that we have a way to set a default font for that pdb

Plucking Slate.com, a Python example

2001-11-03 Thread Bill Janssen
The MSN change has affected Slate.com, an online magazine owned by MS. The re-styling is so bad that I figured I'd start plucking it instead of looking at it in a browser. Unfortunately, it's in UTF-8 and XHTML, and contains a number of the standard odd characters. I wrote a little csh/Python

Re: Support for Japanese characters.-2

2001-11-03 Thread Nori Kanazawa
Hi Bill, At Fri, 2 Nov 2001 19:31:30 PST, Bill Janssen wrote: Is this for text documents, or do you also process S-JIS HTML? Does the parser work OK on S-JIS HTML? What do you mean about S-JIS HTML? # Sorry, I am not so familiar about HTML. -- Nori

Color support in CVS

2001-11-03 Thread Robert O'Connor
Code has been added to CVS, for the viewer and the parser to support color elements. A somewhat difficult merge--probably missed at least one thing somewhere along the way. If someone could test it out and see what things need correcting, especially that the charset work by Bill still is

Re: behavior of missing tag?

2001-11-03 Thread MJ Ray
http://foo.bar.com/bletch.html#tag but there's no tag in bletch.html) is handled is to point the link to the beginning of the page which presumably would have contained the link (in the above example, to paragraph 0 of bletch.html). [...] change the parser to treat such URLs as excluded,