I'm having some problems indexing some files on my site. If I run htdig
with -v -v -v, I get lines like these in the output:
+A tag: pos = 2, position =
="http://www.jhuccp.org/centerpubs/impact/number13/pdf/Impact13.p
image: http://localhost/centerpubs/impact/pdf-icon.gif
href: http://www.jhuccp.org/centerpubs/impact/number13/pdf/Impact13.pdf
(PDF format )

   Rejected: URL not in the limits!
url rejected: (level
1)http://www.jhuccp.org/centerpubs/impact/number13/pdf/Impact13.pdf

I can't figure out why this file would be rejected. My htdig.conf file
has these pertinent lines:
start_url:              http://localhost/ 
limit_urls_to:          ${start_url}
exclude_urls:           /cgi-bin/ .cgi /test/
bad_extensions:         .wav .gz .z .sit .au .zip .tar .hqx .exe .com
.gif \
                .jpg .jpeg .aiff .class .map .ram .tgz .bin .rpm .mpg
.mov .avi

I can't think of any other configurations which would effect this.

Any suggestions of what I'm doing wrong? Anyway to get more explicit
diagnostics from htdig? I haven't tried to run htdig with four -v yet,
but I'll try that next.

Thanks for your help and suggestions.

-Kevin Zembower

-----
E. Kevin Zembower
Unix Administrator
Johns Hopkins University/Center for Communications Programs
111 Market Place, Suite 310
Baltimore, MD  21202
410-659-6139

_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to