OK, htdig 3.1.6-012702 indexes www.aicha.com without a hitch (13 pages) ...
My 3.2 is currently indexing the rest (I've for the moment excluded aicha.com) so I don't want to stop it ; although maybe I can kill it and relaunch it through gdb, what do you think ? I'm at 14,305 out of roughly 40,000 documents ... by the way when you send a kill statement, it would be great to have on the stdout something like "kill received, dumping progress log to /path/file.log" Are there DB differences between 3.1.6 and 3.2.0 ? In other words, if this crawl finishes, can I share the databases between the two versions, like in crawl with 3.1.6 and search with 3.2.0 ? Thanks a bunch >According to Franck Horlaville: >> OK will do - thanks ! >> >> Is there a way I can install 3.1.6 along my 3.2 without overwriting >> my 3.2 cgi's ? I think I don't need the cgi's at all for the crawl >> alone. So is there a "make install --nocgi's" ? > >There's no such option for make install, so the only way to avoid having >a make install clobber your 3.2 binaries would be to specify different >target directories for them using ./configure options (see >./configure --help). > >However, you don't need to install the 3.1.6 binaries in order to try >them. From your htdig-3.1.6 source directory, you can run htdig directly: > >$ htdig/htdig -vvv -s -i -c /path/to/test/htdig.conf -m - >http://www.aicha.com/lacollec/c/montres_010430_2.htm ><Ctrl-D> > >This runs htdig in minimal mode, indexing only the URLs you give on >the standard input. I've tried this myself, but it didn't hang for me. >However, it may be a system-dependent problem. You mentioned running >on Darwin, but what's the specific version number? Do you have similar >problems on other platforms, and if so, what are the details of the >other platforms you're running on (hardware, OS version, distribution >if applicable)? > >With 3.2.0b4, this test is only slightly more difficult, as the -m >option requires an actual file name, not "-" for stdin, so you either >need to change the start_url in your htdig.conf file, or set up a text >file with the URL(s) you want to index and give that file name as the >argument after -m on the command line. > >> >According to Franck Horlaville: >> >> Running 3.2.0b4-123001 on Darwin >> >> >> >> the dig hangs forever on the following page: >> >> >> >> http://www.aicha.com/lacollec/c/montres_010430_2.htm >> >> >> >> when I look at the page I see that it's not finished (i.e. no closing >> >> tags). Could there be some sort of timeout mechanism to avoid this >> >> situation ? Or is there a setting somewhere I overlooked ? >> > >> >Would you mind trying the latest 3.1.6 snapshot of htdig and see if it >> >also hangs on the same page? I've backported a lot of 3.2.0b4 parser >> >changes to 3.1.6, so it would be good to know, before 3.1.6 is released, >> >if I've backported some bugs too. If it also hangs, could you run htdig >> >under gdb, and when it hangs hit Ctrl-C and get a stack backtrace using >> >gdb's "bt" command? We'd like to know where exactly it is hanging. >> > >> >If 3.1.6 doesn't hang, but only 3.2.0b4 does, then try getting a backtrace > > >from 3.2.0b4, as described above. Thanks. > > >-- >Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]> >Spinal Cord Research Centre WWW: >http://www.scrc.umanitoba.ca/~grdetil >Dept. Physiology, U. of Manitoba Phone: (204)789-3766 >Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930 -- Franck Horlaville Athena Online +212 (0) 37 68 28 08 http://www.athena.online.co.ma/ mailto:[EMAIL PROTECTED] _______________________________________________ htdig-general mailing list <[EMAIL PROTECTED]> To unsubscribe, send a message to <[EMAIL PROTECTED]> with a subject of unsubscribe FAQ: http://htdig.sourceforge.net/FAQ.html

