On Tue, 20 Mar 2001, Joe R. Jah wrote:
> Date: Tue, 20 Mar 2001 14:28:54 -0800 (PST)
> From: "Joe R. Jah" <[EMAIL PROTECTED]>
> To: Geoff Hutchison <[EMAIL PROTECTED]>
> Cc: [EMAIL PROTECTED]
> Subject: Re: [htdig-dev] Re: HtRegexList problems
>
> On Mon, 19 Mar 2001, Geoff Hutchison wrote:
>
> > Date: Mon, 19 Mar 2001 12:38:54 -0500 (EST)
> > From: Geoff Hutchison <[EMAIL PROTECTED]>
> > To: [EMAIL PROTECTED]
> > Subject: [htdig-dev] Re: HtRegexList problems
> >
> >
> > OK, the patch at the end of this message should fix most of the problems
> > people have seen with the 3.2.0b4-031801 snapshot. I still cannot
> > reproduce Alexander Cohen's bug with HtRegexList:
>
> Yes it has solved, at least my problem on BSDI. I do not know how it has
> affected indexing speed, though; I'll know in a day or two;)
I take it back;( It just took a little longer; it went somewhat further
in the web tree before:
gdb htdig htdig.core
GNU gdb
This GDB was configured as "i386-unknown-bsdi4.0.1"...
Core was generated by `htdig'.
Program terminated with signal 11, Segmentation fault.
#0 0x18254139 in p_b_term ()
(gdb) bt
#0 0x18254139 in p_b_term ()
#1 0x18253d95 in p_bracket ()
#2 0x18254543 in bothcases ()
#3 0x182545bc in ordinary ()
#4 0x1825341d in p_ere_exp ()
#5 0x18252ffa in p_ere ()
#6 0x18252efd in regcomp ()
#7 0x18123409 in HtRegex::set (this=0xa616c00,
str=0xc038000
"http://www\\.ccsf\\.org/Guardsman/|http://www\\.ccsf\\.cc\\.ca\\.us/Guardsman/",
case_sensitive=0)
at HtRegex.cc:51
#8 0x18123a5e in HtRegexList::setEscaped (this=0x804633c,
list=@0x80463fc, case_sensitive=0) at HtRegexList.cc:77
#9 0x8053b56 in Retriever::IsValidURL (this=0x804744c, u=@0x9151d00) at
Retriever.cc:911
#10 0x8055b38 in Retriever::got_href (this=0x804744c, url=@0x9151d00,
description=0x91563c0 "Online Courses", hops=1)
at Retriever.cc:1419
#11 0x804f5d9 in HTML::do_tag (this=0x8258900, retriever=@0x804744c,
tag=@0x8258968) at HTML.cc:552
#12 0x804ea60 in HTML::parse (this=0x8258900, retriever=@0x804744c,
baseURL=@0x806fc00) at HTML.cc:324
#13 0x805372b in Retriever::RetrievedDocument (this=0x804744c,
doc=@0x8258200, url=@0x8046f1c, ref=0x8258800) at Retriever.cc:802
#14 0x80532dc in Retriever::parse_url (this=0x804744c, urlRef=@0x8258500)
at Retriever.cc:647
#15 0x80529af in Retriever::Start (this=0x804744c) at Retriever.cc:423
#16 0x8059ab5 in main (ac=5, av=0x80479bc) at htdig.cc:317
#17 0x804c800 in __start ()
(gdb)
Regards,
Joe
--
_/ _/_/_/ _/ ____________ __o
_/ _/ _/ _/ ______________ _-\<,_
_/ _/ _/_/_/ _/ _/ ......(_)/ (_)
_/_/ oe _/ _/. _/_/ ah [EMAIL PROTECTED]
_______________________________________________
htdig-dev mailing list
[EMAIL PROTECTED]
http://lists.sourceforge.net/lists/listinfo/htdig-dev