Hi list,

I have a problem with exclude_urls. I wan't to exclude a directory
(/~user1/dir/) from being indexed, because in this directory are
broken PDF-Files and I get a "Error: Uknown Type 0 character set:
Adobe-Identity" mail everytime cron calls rundig.

I use this configuration:

start_url:              http://www.domain.com/
limit_urls_to:          ${start_url}
exclude_urls:           /~user1/dir/

Nevertheless rundig tries to index /~user1/dir/, because I get 

1364:1364:4:http://www.domain.com/~user1/dir/test.pdf: Error: Uknown Type 0 character 
set: Adobe-Identity
 size = 207829

If I change the configuration to:

start_url:              http://www.domain.com/~user1/
limit_urls_to:          ${start_url}
exclude_urls:           /~user1/dir/

everything works fine and the PDF-Files in /~user1/dir/ are not
indexed !?!

Any ideas?

Thanks a lot

Florian

_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to