According to [EMAIL PROTECTED]:
> > > 8:12:1:http://www.senga.org/uri/html>: not found
> ..
> >
> > The CVS version is much more lenient about URLs. If you read the
> > messages, it's trying to connect to the URLs
> > "http://www.senga.org/uri/html>" or "http://www.senga.org/support.html>"
> > which are incorrect links.
>
> I thing this is because the quotes are missing :
>
> <a href=uri/html>uri</a>
>
> Do you think htdig should permanently consider this an incorect href ?
> If so it will have troubles with a lot of existing web sites.
I had a feeling this might crop up after the changes to HTML.cc. Here's
the fix, which I just committed to the CVS source tree:
Fri Jul 16 13:04:27 1999 Gilles Detillieux <[EMAIL PROTECTED]>
* htdig/HTML.cc(parse): fix to prevent closing ">" from being passed
to do_tag().
Index: htdig/HTML.cc
===================================================================
RCS file: /opt/htdig/cvs/htdig3/htdig/HTML.cc,v
retrieving revision 1.48
diff -u -p -r1.48 HTML.cc
--- htdig/HTML.cc 1999/07/13 20:58:06 1.48
+++ htdig/HTML.cc 1999/07/16 17:19:49
@@ -276,9 +276,9 @@ HTML::parse(Retriever &retriever, URL &b
q = (unsigned char*)strchr((char *)position, '>');
if (!q)
break; // Syntax error in the doc. Tag never ends.
- tag = 0;
- tag.append((char*)position + 1, q - position);
position++;
+ tag = 0;
+ tag.append((char*)position, q - position);
while (isspace(*position))
position++;
if (!in_space && spacebeforetags.CompareWord((char *)position)
@@ -328,8 +328,9 @@ HTML::parse(Retriever &retriever, URL &b
q = (unsigned char*)strchr((char *)position, '>');
if (q)
{
+ position++;
tag = 0;
- tag.append((char*)position + 1, q - position);
+ tag.append((char*)position, q - position);
do_tag(retriever, tag);
position = q+1;
}
--
Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba Phone: (204)789-3766
Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930
------------------------------------
To unsubscribe from the htdig3-dev mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the SUBJECT of the message.