newsclipperdevlist  

Re: Possible Bug in GetText?

David Coppit
Wed, 7 Jul 1999 04:56:04 -0700


On Tue, 6 Jul 1999, Shalom Craimer wrote:

> When I call GetText on a document, it says: "[TABLE NOT SHOWN]"
> I think this is TreeBuilder's fault (or the text convertion back). You see,
> when I call GetHtml with the same paramters, I get a partial table (without
> a last </TR> and </TABLE>) which is what I want (all the text in that zone
> is what I care about, nothing more). So I guess GetText gets mixed up,
> either trying to display a table in text, an failing because the table
> isn't "closed"; or just failing because it sees a table.

Yeah, GetText uses TreeBuilder & FormatText to remove tags and try to do
minimal formatting, and FormatText doesn't support HTML tables. :( Are you
trying to get the table in textual form? Remember, GetText is meant to grab
non-HTML text, or to convert HTML to plain text.  Anyway, maybe GetHtml and
some form of StripTags is the right thing to do here.

David

_________________________________________________________________________
David Coppit - Graduate Student        [EMAIL PROTECTED]      
The University of Virginia             http://coppit.org/
    "Yes," said Piglet, "Rabbit has Brain." There was a long silence.
"I suppose," said Pooh, "that that's why he never understands anything."



-
If you would like to unsubscribe from this mailing list send an email to 
[EMAIL PROTECTED] with the body "unsubscribe newsclipperdevlist 
YOUR_EMAIL_ADDRESS" (without the quotes) or use the form provided at 
http://www.NewsClipper.com/TechSup.htm#MailingList.