On Tue, 20 Mar 2001, Joe R. Jah wrote:

> Date: Tue, 20 Mar 2001 14:28:54 -0800 (PST)
> From: "Joe R. Jah" <[EMAIL PROTECTED]>
> To: Geoff Hutchison <[EMAIL PROTECTED]>
> Cc: [EMAIL PROTECTED]
> Subject: Re: [htdig-dev] Re: HtRegexList problems
> 
> On Mon, 19 Mar 2001, Geoff Hutchison wrote:
> 
> > Date: Mon, 19 Mar 2001 12:38:54 -0500 (EST)
> > From: Geoff Hutchison <[EMAIL PROTECTED]>
> > To: [EMAIL PROTECTED]
> > Subject: [htdig-dev] Re: HtRegexList problems
> > 
> > 
> > OK, the patch at the end of this message should fix most of the problems
> > people have seen with the 3.2.0b4-031801 snapshot. I still cannot
> > reproduce Alexander Cohen's bug with HtRegexList:
> 
> Yes it has solved, at least my problem on BSDI.  I do not know how it has
> affected indexing speed, though; I'll know in a day or two;)

I take it back;(  It just took a little longer; it went somewhat further
in the web tree before:

gdb htdig htdig.core
GNU gdb
This GDB was configured as "i386-unknown-bsdi4.0.1"...
Core was generated by `htdig'.
Program terminated with signal 11, Segmentation fault.
#0  0x18254139 in p_b_term ()
(gdb) bt
#0  0x18254139 in p_b_term ()
#1  0x18253d95 in p_bracket ()
#2  0x18254543 in bothcases ()
#3  0x182545bc in ordinary ()
#4  0x1825341d in p_ere_exp ()
#5  0x18252ffa in p_ere ()
#6  0x18252efd in regcomp ()
#7  0x18123409 in HtRegex::set (this=0xa616c00,
    str=0xc038000
"http://www\\.ccsf\\.org/Guardsman/|http://www\\.ccsf\\.cc\\.ca\\.us/Guardsman/",
case_sensitive=0)
    at HtRegex.cc:51
#8  0x18123a5e in HtRegexList::setEscaped (this=0x804633c,
list=@0x80463fc, case_sensitive=0) at HtRegexList.cc:77
#9  0x8053b56 in Retriever::IsValidURL (this=0x804744c, u=@0x9151d00) at
Retriever.cc:911
#10 0x8055b38 in Retriever::got_href (this=0x804744c, url=@0x9151d00,
description=0x91563c0 "Online Courses", hops=1)
    at Retriever.cc:1419
#11 0x804f5d9 in HTML::do_tag (this=0x8258900, retriever=@0x804744c,
tag=@0x8258968) at HTML.cc:552
#12 0x804ea60 in HTML::parse (this=0x8258900, retriever=@0x804744c,
baseURL=@0x806fc00) at HTML.cc:324
#13 0x805372b in Retriever::RetrievedDocument (this=0x804744c,
doc=@0x8258200, url=@0x8046f1c, ref=0x8258800) at Retriever.cc:802
#14 0x80532dc in Retriever::parse_url (this=0x804744c, urlRef=@0x8258500)
at Retriever.cc:647
#15 0x80529af in Retriever::Start (this=0x804744c) at Retriever.cc:423
#16 0x8059ab5 in main (ac=5, av=0x80479bc) at htdig.cc:317
#17 0x804c800 in __start ()
(gdb)

Regards,

Joe
-- 
     _/   _/_/_/       _/              ____________    __o
     _/   _/   _/      _/         ______________     _-\<,_
 _/  _/   _/_/_/   _/  _/                     ......(_)/ (_)
  _/_/ oe _/   _/.  _/_/ ah        [EMAIL PROTECTED]


_______________________________________________
htdig-dev mailing list
[EMAIL PROTECTED]
http://lists.sourceforge.net/lists/listinfo/htdig-dev

Reply via email to