After 40 hours of indexing, I got a segmentation fault.  The last 2
lines are: 
title: Converted from orginal_doc
Segmentation fault

That is all it says.
I ran htdig -i -vvv -s so I would have expected a little more info with
the -vvv switch.

I have the following files it made:
db.docdb
db.docs.index
db.excerpts
db.words.db

How can I get up and running until I can finish an -a job using these
files?  I have to be up and running in the AM!  Any suggestions?

htdig 3.2.0b4-001302
Redhat 7.3
Apache 1.3.23-11

CONFIG FILE EXCERPTS:
database_dir:           /var/lib/htdig
start_url:      http://myserver.company.org/docs/
local_urls:     http://myserver.company.org/docs/=/var/PDF/
#local_urls_only:       true
timeout:                60
wordlist_cache_size:    50000000
wordlist_compress:      false
limit_urls_to:          ${start_url}
exclude_urls:           /cgi-bin/ .cgi /icons/
bad_extensions:         .gz .tar .jpg .htm .html .tgz .rpm .gif .png
.pl .sh
bad_querrystr:          ?D=A ?D=D ?M=A ?M=D ?N=A ?N=D ?S=A ?S=D
external_parsers: application/rtf->text/html /usr/local/bin/doc2html.pl
\
                  text/rtf->text/html /usr/local/bin/doc2html.pl \
                  application/pdf->text/html /usr/local/bin/doc2html.pl
\
                  application/postscript->text/html
/usr/local/bin/doc2html.pl \
maintainer:             [EMAIL PROTECTED]
max_head_length:        500000
max_stars:              5
robotstxt_name:         localdig
max_doc_size:           100000000
matches_per_page:       20
maximum_word_length:    25
minimum_word_length:    3
no_excerpt_show_top:    true
search_algorithm:       exact:1 synonyms:0.1 endings:0.1
template_map: Long long ${common_dir}/long.html \
                Short short ${common_dir}/short.html
template_name: long





Bill Akins, CNE
Sr. OSA
Emory Healthcare
(404) 712-2879 - Office
12674 - PIC
[EMAIL PROTECTED]


++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
CONFIDENTIALITY NOTICE:

This message may contain legally confidential and privileged information
and is intended only for the named recipient(s).  No one else is 
authorized to read, disseminate, distribute, copy, or otherwise disclose
the contents of this message.  If you have received this message in 
error, please notify the sender immediately by e-mail or telephone and 
delete the message in its entirety. Thank you.
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
<<<<GWIASIG 0.06c>>>>

_______________________________________________________________

Don't miss the 2002 Sprint PCS Application Developer's Conference
August 25-28 in Las Vegas -- http://devcon.sprintpcs.com/adp/index.cfm

_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to