Re: Standalone html parser

2001-06-29 Thread Hrvoje Niksic
Anees Shaikh [EMAIL PROTECTED] writes: I'm trying to use the code in html-parse.c (v1.7) in standalone mode Excellent! For some reason, img src=... tags are recognized but then skipped almost every time they are encountered. When using the full program and recursive retrieve, the images

Re: Q: (problem) wget on dos/win: question marks in url

2001-06-29 Thread Rick Palazola
Did you try putting quotes around the URL? Don't have time to test at the moment, but my memory is that I used quoted URL in (this or another) DOS app and it did work. Rick On Fri, 29 Jun 2001, Reto Kohli wrote: hello list, i ran across a problem today which is so obvious that i'm

Re: Q: (problem) wget on dos/win: question marks in url

2001-06-29 Thread Ian Abbott
On 29 Jun 2001, at 8:13, Rick Palazola [EMAIL PROTECTED] wrote: On Fri, 29 Jun 2001, Reto Kohli wrote: consider the following wget call (and think dos:) wget http://mydomain.org/index.html?foo=bar well? -- dos (and your average windows, too) does of course not allow you to write

Re: Standalone html parser

2001-06-29 Thread Anees Shaikh
So I think the problem is with malformed img tags. The parser fails if the tag is of this form: img src=/library/homepage/images/curve.gif alt= border=0 / Note the end of the tag is closed with / instead of just as in the spec. When the parser finds the / it thinks it sets

Re: Standalone html parser

2001-06-29 Thread Hrvoje Niksic
Anees Shaikh [EMAIL PROTECTED] writes: So I think the problem is with malformed img tags. The parser fails if the tag is of this form: img src=/library/homepage/images/curve.gif alt= border=0 / [...] This problem with img tags seems to be quite common (redhat.com, ibm.com,

Re: xhtml? Re: Standalone html parser

2001-06-29 Thread Hrvoje Niksic
Anees Shaikh [EMAIL PROTECTED] writes: Hrvoje, you mentioned that you planned to modify the parser to handle these tags. Any ideas on timetable? How about now? :-) I have created a simple patch that deals with this. However, preliminary testing indicated a problem with the semantics.