I take it that ignores links that match (any of?) those urls... but isn't there a way to index /wotd/data (which produces a listing of the .html files in that directory) without haing that index page itself get indexed?
it's almost as though a exclusive recursion level ==1 should need to be specified. -----Original Message----- From: David Adams [mailto:[EMAIL PROTECTED] Sent: Friday, March 19, 2004 5:45 AM To: David Adams; Erick Calder; [EMAIL PROTECTED] Subject: Re: [htdig] how to suppress indexing / Sorry, that should of course be: bad_querystr: C=D&O=A C=D&O=D \ C=M&O=A C=M&O=D \ C=N&O=A C=N&O=D \ C=S&O=A C=S&O=D or to include Apache 1 servers as well: bad_querystr: D=A D=D \ M=A M=D \ N=A N=D \ S=A S=D \ C=D&O=A C=D&O=D \ C=M&O=A C=M&O=D \ C=N&O=A C=N&O=D \ C=S&O=A C=S&O=D David Adams ----- Original Message ----- From: "David Adams" <[EMAIL PROTECTED]> To: "Erick Calder" <[EMAIL PROTECTED]>; <[EMAIL PROTECTED]> Sent: Friday, March 19, 2004 1:22 PM Subject: Re: [htdig] how to suppress indexing / > It looks as though you have an Apache 2 server. You can reduce those > multiple /wotd/data/ entries to one by including this in your configuration > file: > > bad_querstr: C=D&O=A C=D&O=D \ > C=M&O=A C=M&O=D \ > C=N&O=A C=N&O=D \ > C=S&O=A C=S&O=D > > David Adams > Corporate Information Services > Information Systems Services > University of Southampton > > ----- Original Message ----- > From: "Erick Calder" <[EMAIL PROTECTED]> > To: <[EMAIL PROTECTED]> > Sent: Thursday, March 18, 2004 8:35 PM > Subject: [htdig] how to suppress indexing / > > > > hello everyone, > > > > I publish an index of the word-of-the-day from yourdictionary.com which > may > > be found at: http://www.arix.com/wotd/ > > > > I create the index by grabbing the daily WOTD and writing a .html file > into > > /var/www/html/wotd/data. I create a config file (today.conf) to index the > > new file and call "htdig -c today.conf; htmerge -c today.conf" - a sample > > config file is included below. > > > > my question is: when I search for a word I get a bunch of hits like: > > > > Index of /wotd/data > > > > try it yourself by searching for "prince". why is this and how can I > > suppress it? > > > > tia - erick > > > > --- today.conf --- > > > > common_dir: /var/www/html/wotd > > database_dir: ${common_dir}/db > > start_url: http://www.arix.com/wotd/data/prince.html > > limit_urls_to: ${start_url} > > max_head_length: 10000 > > max_doc_size: 200000 > > maintainer: [EMAIL PROTECTED] > > no_excerpt_show_top: true > > excerpt_length: 300 > > template_map: Long long ${common_dir}/long.html > > template_name: long > > search_algorithm: exact:1 synonyms:0.5 endings:0.1 > > search_results_header: ${common_dir}/header.html > > search_results_footer: ${common_dir}/footer.html > > nothing_found_file: ${common_dir}/nichts.html > > > > > > > > ------------------------------------------------------- > > This SF.Net email is sponsored by: IBM Linux Tutorials > > Free Linux tutorial presented by Daniel Robbins, President and CEO of > > GenToo technologies. Learn everything from fundamentals to system > > administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click > > _______________________________________________ > > ht://Dig general mailing list: <[EMAIL PROTECTED]> > > ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html > > List information (subscribe/unsubscribe, etc.) > > https://lists.sourceforge.net/lists/listinfo/htdig-general > > > > > > ------------------------------------------------------- > This SF.Net email is sponsored by: IBM Linux Tutorials > Free Linux tutorial presented by Daniel Robbins, President and CEO of > GenToo technologies. Learn everything from fundamentals to system > administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click > _______________________________________________ > ht://Dig general mailing list: <[EMAIL PROTECTED]> > ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html > List information (subscribe/unsubscribe, etc.) > https://lists.sourceforge.net/lists/listinfo/htdig-general > ------------------------------------------------------- This SF.Net email is sponsored by: IBM Linux Tutorials Free Linux tutorial presented by Daniel Robbins, President and CEO of GenToo technologies. Learn everything from fundamentals to system administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click _______________________________________________ ht://Dig general mailing list: <[EMAIL PROTECTED]> ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html List information (subscribe/unsubscribe, etc.) https://lists.sourceforge.net/lists/listinfo/htdig-general

