Re: [htdig] Going for the big dig

2000-12-18 Thread Geoff Hutchison
At 10:14 AM +1100 12/19/00, Terry Collins wrote: And make sure you don't ignore robots.txt Yes, though someone would need to alter the code to do this. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig

[htdig] Re: Phrases

2000-12-17 Thread Geoff Hutchison
See the FAQ: http://www.htdig.org/FAQ.html#q1.9 -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ At 6:06 AM -0800 12/17/00, Bill Vick wrote: I've appreciated your many good comments on the HTDIG list. Can HTDIG support phrases such as the phrase Hospital Administrators

Re: [htdig] indexing mySQL table

2000-12-16 Thread Geoff Hutchison
less time (since it could do the SQL lookups directly) to perform the search and less space. It doesn't sound like you really want to index the pages themselves, just the category scheme. So running htdig over the site would be overkill. -- -Geoff Hutchison Williams Students Online http://wso.wi

Re: [htdig] Large Websites Indexing vs Dynamic Databasequery...pros cons...

2000-12-16 Thread Geoff Hutchison
ot; are not mutually exclusive. Well, there's the excellent SearchTools site: http://www.searchtools.com/ -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You wi

Re: [htdig] METATAGS in Search Results

2000-12-16 Thread Geoff Hutchison
, a la the META description, but it hasn't been a common request. If you'd like, I could give you tips on how to code such a modification. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send

Re: [htdig] configure help

2000-12-16 Thread Geoff Hutchison
the package is installed, you can make as many config files as you want to specify databases. All you'll want to do is make sure you specify the -c flag to htdig/htmerge and the config field in the htsearch form. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

RE: [htdig] Words and files not being found or indexed

2000-12-15 Thread Geoff Hutchison
more than the server spits up. On the other hand, in the 3.2 code, there's support for the file:// "protocol" and in the 3.2.0b3 snapshots, it automatically generates directory listings. -- -Geoff Hutchison Williams Students Online http://wso.wi

Re: [htdig] local access to files for indexing

2000-12-15 Thread Geoff Hutchison
. If you're concerned about "counting," most statistics programs have ways to exclude based on browser or IP address. Excluding hits from your server or the htdig user-agent is one way of getting a more accurate "count" if it concerns you. -- -Geoff Hutchison Williams

RE: [htdig] Words and files not being found or indexed

2000-12-14 Thread Geoff Hutchison
htdig.conf file). But this has not been successful. You can list as many URLs as you want in the start_url attribute, or you can also include a file into the htdig.conf. e.g.: start_url: `/path/to/urls.txt` -- -Geoff Hutchison Williams Students Online http://w

RE: [htdig] Words and files not being found or indexed

2000-12-14 Thread Geoff Hutchison
environment variable, or /tmp if TMPDIR is not defined. The rundig script sets TMPDIR to the same directory as your databases before running. Alternatively, you can point it to someplace where there won't be any sort of quota. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re: [htdig] keywords in meta tags?

2000-12-14 Thread Geoff Hutchison
? If not, it would help to see your config file--just so we can check if there's something you're missing. (Eliminate the impossible...) -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message

Re: [htdig] Installation Problem under HPUX 10.20

2000-12-14 Thread Geoff Hutchison
. Now the question: what do you see in the config.log file? It should show you the program it tried and the error message it got. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message

Re: [htdig] Hi, need help with searching database.

2000-12-14 Thread Geoff Hutchison
t question is whether you get results if you run the htsearch CGI from the command-line. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message

Re: [htdig] ht://Dig version 3.1.5 with Weblogic 5.1

2000-12-14 Thread Geoff Hutchison
t's a good idea to either reindex from scratch or to update the timestamps on the web files. (The latter is good because it clears caching servers elsewhere.) -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htd

Re: [htdig] Hi, I really need your help!

2000-12-14 Thread Geoff Hutchison
ppreciated. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail/men

Re: [htdig] PDF Problem

2000-12-14 Thread Geoff Hutchison
with you). The author, of course, is Derek Noonburg [EMAIL PROTECTED] -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm

Re: [htdig] configure help

2000-12-12 Thread Geoff Hutchison
http://www.htdig.org/install.html I am not interested in doing anything fancy with the program - that means I have read the alphabetical list of switches and most of the defaults will serve me fine. What platform are you using? There may already be a binary distribution for it. -- -Geoff

Re: [htdig] Installation Problem under HPUX 10.20

2000-12-12 Thread Geoff Hutchison
to compile "hello world" and if it didn't work, then there's something wrong with the compiler. You can get more info from the config.log file. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdi

Re: [htdig] test

2000-12-10 Thread Geoff Hutchison
ould I get IMHO, emacs is not hard to learn, but I will refrain from a emacs/vi debate. (This is not the list.) For emacs, you may find it helpful to type control-h then control-t to get the tutorial. Cheers, -- -Geoff Hutchison Williams Students Online http://wso.wi

[htdig] Re: htsearch: No title in search results (htdig-3.2.0b3-112600)(PR#964)

2000-12-10 Thread Geoff Hutchison
. Thanks, Robert -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.or

Re: [htdig] How can we add our url to your site

2000-12-09 Thread Geoff Hutchison
At 4:16 PM +0600 12/9/00, Kumar Melvani wrote: site and how can I use this software to find email address of all the university's and collages around the world You can't. It's not that sort of software. It is intended mainly for indexing you own server. Cheers, -- -Geoff Hutchison Williams

Re: [htdig] error messages in web error_log

2000-12-09 Thread Geoff Hutchison
htdig? If you don't need/want the synonyms algorithm, you will want to change the search_algorithm config attribute accordingly http://www.htdig.org/attrs.html#search_algorithm -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe

Re: [htdig] PDF problem

2000-12-08 Thread Geoff Hutchison
On Fri, 8 Dec 2000 [EMAIL PROTECTED] wrote: I am using htdig 3.1.5 on Linux. I get these errors when I try to index the files Have you checked the FAQ? http://www.htdig.org/FAQ.html#q5.2 -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re: [htdig] htdig logo

2000-12-08 Thread Geoff Hutchison
On Fri, 8 Dec 2000, Céline Scheidecker wrote: If I get htdig search on my website, do I must put ht//Dig logo and link to http://www.htdig.org/? No, that's not necessary. It is, of course, appreciated, but there are no limitations on use. -- -Geoff Hutchison Williams Students Online http

Re: [htdig] core dump--help

2000-12-08 Thread Geoff Hutchison
self, what compiler did you use? (You can get this information usually from gcc -v and g++ -v or similar.) -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROT

Re: [htdig] multiple word synonyms

2000-12-08 Thread Geoff Hutchison
. But in the 3.1.x code, phrase searching isn't supported and at the moment in the 3.2 code, the synonym code hasn't been rewritten to allow phrase synonyms. (On the scale of things to do, it wouldn't be hard, but unless someone volunteers to help out with that, it will have to wait.) -- -Geoff Hutchison

Re: [htdig] can't index external urls

2000-12-08 Thread Geoff Hutchison
.: start_url: http://www.foo.com/ http://www.bar.com/ [...] Yes, you should upgrade ASAP to 3.1.5, but this sounds like a mistake in the config file. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list

Re: [htdig] htdig database related questions

2000-12-06 Thread Geoff Hutchison
oblem, if a document is added to the robots.txt file. In both cases, the code is upholding the "letter of the law," but it's a bit hazy. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list,

Re: [htdig] Pb indexing HTML with htdig 3.1.5

2000-12-06 Thread Geoff Hutchison
ther idea ? What can I do ? Can you edit the document as an initial workaround? If not, you (or someone else) will need to edit the HTML.cc file to make the comment patterns less picky. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/

Re: [htdig] htdig database related questions

2000-12-06 Thread Geoff Hutchison
seen is 3.2b2 ... can this be considered as stable enough for a production environment? A 3.2.0b3 release will be coming out soon--until that point I'd suggest using the development snapshots in preference to 3.2.0b2, which has many known bugs. -- -Geoff Hutchison Williams Students Online http

Re: [htdig] Can htdig kill Linux?

2000-12-06 Thread Geoff Hutchison
or heard reports of this sort of behavior. So my first question would be "how long have you had this server running?" -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message

Re: [htdig] Can htdig kill Linux? (redux)

2000-12-06 Thread Geoff Hutchison
ive? fsck is usually just fine. If you see repeated disk problems, then you may want to do a reformatting with options to get rid of bad sectors. With the current prices of disks, it's also a reasonable option to just buy a new disk if there seem to be media problems. -- -Geoff Hutchison Williams Stude

Re: [htdig] Htdig in spanish

2000-12-06 Thread Geoff Hutchison
ep in mind, even with uploading, someone will still need to move it into place.) It should be mirrored within an hour or so. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message

Re: [htdig] C++

2000-12-05 Thread Geoff Hutchison
On Tue, 5 Dec 2000, Bill Vick wrote: How can we allow the user to search for the word 'C++'? See http://www.htdig.org/attrs.html#extra_word_characters e.g. extra_word_characters: + -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re: [htdig] Indexing never ends ...

2000-12-05 Thread Geoff Hutchison
To htdig, these are different, but these are probably the same to your code. But from your description, you haven't given any sense that this is happening, just that this seems to be taking longer than you expect. -- -Geoff Hutchison Williams Students Online http://wso.wi

Re: [htdig] perl interface.

2000-12-05 Thread Geoff Hutchison
ytime soon. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail/men

[htdig] Re: detailed information

2000-12-05 Thread Geoff Hutchison
and is free software. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ On Tue, 5 Dec 2000, Pan, Belinda wrote: Hello, We are looking for a powerful search engine application based on UNIX platform and Oracle. please send us detailed information about your products. We

Re: [htdig] Re: detailed information

2000-12-05 Thread Geoff Hutchison
On Tue, 5 Dec 2000, Geoff Hutchison wrote: If, on the other hand, you're looking for a general-purpose, open-source* web search package, feel free to browse the information on ht://Dig at: http://www.htdig.org/ Sorry, I couldn't resist the urge to throw in some buzzwords. :-) -- -Geoff

[htdig] Re:

2000-12-05 Thread Geoff Hutchison
At 4:55 PM +0100 12/5/00, Roberta Minneci wrote: How do I restrict a search to word out script language="JavaScript" /script? See http://www.htdig.org/attrs.html#noindex_start -- -Geoff Hutchison Williams Students Online http://wso.wi

Re: [htdig] digging with apache auth

2000-12-04 Thread Geoff Hutchison
On Mon, 4 Dec 2000, Stephen L Arnold wrote: Is there a way to pass username/password info through htdig, or some other way to let it follow the appropriate URLs? See http://www.htdig.org/attrs.html#authorization http://www.htdig.org/htdig.html (specifically the -u flag) -- -Geoff Hutchison

Re: [htdig] 3.20b2 ${common_dir} -- reliably alterable?

2000-12-03 Thread Geoff Hutchison
e override be in the FIRST encountered ".conf" file? No. You could have it in a config file that's included in another one. When the config parser hits an include declaration, it includes the file, parses it and continues on--top down, so to speak. -- -Geoff Hutchison Williams

Re: [htdig] Looking for start_url strategies

2000-11-30 Thread Geoff Hutchison
re reading anything in or starting.) Does this sound like a slightly better solution? -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message

Re: [htdig] query syntax and searching for exact meta

2000-11-29 Thread Geoff Hutchison
hn". i've found option keyword_meta_tag_names, but i'm not sure whether it's right one and how to use it. Not at the moment. The current development code recognizes certain tags and stores this information in the database, but the htsearch code has not yet been rewritten to accept filters l

Re: [htdig] 3.20b(2/3)?

2000-11-29 Thread Geoff Hutchison
al disk overhead. C. (Sorry to be repeating myself, but I sure can't find anything relevant in the FAQ); is 3.20b3 going to include any form of phrase searching? I dunno, FAQ 1.9 sounds about right: http://www.htdig.org/FAQ.html#q1.9 Cheers, -- -Geoff Hutchison Williams Students

Re: [htdig] Same problem with ~s

2000-11-26 Thread Geoff Hutchison
make sure there are links from the start_url to the appropriate sections or make sure they're included in the start_url. Keep in mind you can insert files into the config file like so: start_url: `/path/to/url.file` -- -Geoff Hutchison Williams Students Online http://wso.wi

Re: [htdig] Rundig

2000-11-26 Thread Geoff Hutchison
config files, and run htdig - to get some debugging output. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm

Re: [htdig] Same problem with ~s

2000-11-26 Thread Geoff Hutchison
/attrs.html#start_url This means you can separate strings by whitespace: http://www.htdig.org/cf_types.html -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You

Re: [htdig] problems building htdig on cygwin

2000-11-26 Thread Geoff Hutchison
for the systems that don't have it (e.g. BSDI and cygwin evidently), I'll use it instead. Until then, you are correct, I don't know of a way of resolving it. (And as you say, including other header files might continue ad infinitum, which seems silly to me.) -- -Geoff Hutchison Williams Students

Re: [htdig] Does htmerge remove URL from database ?

2000-11-25 Thread Geoff Hutchison
encodings are shared between your site.conf files. Personally, I make up a "main.conf," include that in the other files and only set the start_url and a minimal number of things in the individual site.conf files. In particular, it makes it easy to change something in all config files

Re: [htdig] different search results

2000-11-20 Thread Geoff Hutchison
quot; tags for the anchors? This is on the right track. Basically, you can pass along information to Acrobat to open to a particular page. So AFAIK, it works with all browsers that support the Acrobat PDF plugin. -- -Geoff Hutchison Williams Students Online http://wso.wi

Re: [htdig] configuration problem

2000-11-20 Thread Geoff Hutchison
and http://www.htdig.org/FAQ.html#5.1 -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http

Re: [htdig] configuration problem

2000-11-20 Thread Geoff Hutchison
on. I assume you've tried running htdig with say -vvv for debugging and taken a look at the output? -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You

Re: [htdig] [off topic] -- how to reset STDOUT Assignment

2000-11-18 Thread Geoff Hutchison
asier ways of doing this and perhaps someone more knowledgeable in Perl can reply.) On the other hand, I can answer your question. On every form of UNIX I've seen, /dev/stdin, /dev/stdout and /dev/stderr correspond to the appropriate I/O. -- -Geoff Hutchison Williams Students O

Re: [htdig] Unsatisfied Symbol - HP10.20

2000-11-16 Thread Geoff Hutchison
On Thu, 16 Nov 2000, J. op den Brouw wrote: Unsatisfied Symbols: L$BE0106 (data) Hmm, this is strange. I've compiled/linked 3.1.5 and some 3.2betas on a HP-UX 10.20 box without trouble. Are you using GCC/G++ and if so, which version. Can you send us a snippet of the output surrounding

Re: [htdig] 3.2b2 -- include:, config_dir

2000-11-15 Thread Geoff Hutchison
config_dir. I don't remember if I tried it with an absolute path to the include or not... There was a bug in the 3.2.0b1 and 3.2.0b2 releases as far as the include: function. AFAIK, it is fixed in the latest snapshots. -- -Geoff Hutchison Williams Students Online http://wso.wi

Re: [htdig] not showing child frames?

2000-11-15 Thread Geoff Hutchison
provide one. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail/menu.html

Re: [htdig] 3.2b2 -- include:, config_dir

2000-11-15 Thread Geoff Hutchison
or security reasons. If you think about it, specifying config_dir in a config file is nonsensical. (What, are you saying that the config file that should be read is one with the same name in a different directory?) -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/

Re: [htdig] bibtex format

2000-11-15 Thread Geoff Hutchison
to index? The whole thing? -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail

Re: [htdig] not showing child frames?

2000-11-15 Thread Geoff Hutchison
probably not as common as parts of FAQ, but it's certainly a good thing to have there. One of these days the docs should be restructured to have a "tips and tricks" section or HOWTO or somesuch. -- -Geoff Hutchison Williams Students Online http://wso.wi

Re: [htdig] Dummies Guide to Restricting searchs

2000-11-15 Thread Geoff Hutchison
HOW to go about this? Can anyone point me to some further info/how-to/etc? See http://www.htdig.org/hts_form.html (esp. the "restrict" and "exclude" portions) Cheers, -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/

Re: [htdig] Additional variables for htsearch

2000-11-14 Thread Geoff Hutchison
#allow_in_form Cheers, -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail

Re: [htdig] How to exclude part of a html page ?

2000-11-13 Thread Geoff Hutchison
At 11:38 AM +0100 11/13/00, raphael hoffner wrote: My html page is in two part : the navigation (link in the site) and the content, each are in a table I search how I can exclude the table who content the navigation See http://www.htdig.org/attrs.html#noindex_start Cheers, -- -Geoff Hutchison

Re: [htdig] question about 3D

2000-11-11 Thread Geoff Hutchison
At 9:28 AM + 11/11/00, won-gu Oh wrote: My question is that your university has a course for 3D and after finishing your course... your university supports students to get jobs? I don't know where you got this address, but it's certainly not a university. Cheers, -- -Geoff Hutchison

[htdig] Re: WELCOME to htdig@htdig.org

2000-11-11 Thread Geoff Hutchison
At 8:18 AM -0500 11/11/00, Steve Knoblock wrote: Hello, can anyone tell me if htsearch ignores two letter combinations when searching? Such as See http://www.htdig.org/attrs.html#minimum_word_length -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re: [htdig] internal server error on new install on Solaris(tried the FAQ already)

2000-11-09 Thread Geoff Hutchison
At 2:20 PM +0100 11/9/00, Dean Flanders wrote: I have looked at the FAQ and tried every scenario to fix this problem. However, when I go to do the “get” from the form, I always get “internal server error”. See http://www.htdig.org/FAQ.html#q5.7 -- -Geoff Hutchison Williams Students Online http

Re: [htdig] 3.20b2 -- subsequent-page-locate HTML.

2000-11-09 Thread Geoff Hutchison
likely to suggest the snapshots since they're much more likely to have the bugs fixed in a timely manner. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You

Re: [htdig] Digital Certificates

2000-11-07 Thread Geoff Hutchison
to index through HTTPS. In any case, the 3.2 code introduces the idea of "external transport" allowing you to write your own scripts to d/l documents in much the same way as the "external parsers" in the current code. -- -Geoff Hutchison Williams Students Online htt

Re: [htdig] Following links, not indexing a doc

2000-11-07 Thread Geoff Hutchison
rules? Are you sure you're using a recent version of ht://Dig? This functionality was added in the 3.1 betas and there were a few bugs that cropped up along the way as well. The latest stable version is 3.1.5: http://www.htdig.org/RELEASE.html -- -Geoff Hutchison Williams Students Online http

Re: [htdig] 3.1.5 -- Wordlist files / space occupancy.

2000-11-07 Thread Geoff Hutchison
will vary considerably, esp. if you have a large max_head_size and store almost all of your documents as excerpts. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROT

Re: [htdig] excerpt problem

2000-11-03 Thread Geoff Hutchison
://www.htdig.org/attrs.html#use_meta_description -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives

Re: [htdig] Compilation error 3.2.0b2

2000-11-03 Thread Geoff Hutchison
. sunfreeware.com) -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail/menu.html

Re: [htdig] No matches... SOLUTION: db.words.db_weakcmpr has to beWRITEABLE

2000-11-03 Thread Geoff Hutchison
ase. While there isn't really anything important in this file, it's also not such a good thing to have something writeable by the webserver. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message

Re: [htdig] Documentation: misprint

2000-11-02 Thread Geoff Hutchison
At 8:35 AM + 11/2/00, Nagy Tamas wrote: It seems that the documentation of use_star_image attribute has a misprint. This is not a misprint. See: http://www.htdig.org/cf_types.html Valid expressions for boolean types are: yes, no, true, false, 1, and 0 Cheers, -- -Geoff Hutchison Williams

Re: [htdig] using 2 languages at the same time?

2000-11-02 Thread Geoff Hutchison
English ones unused, no? That is correct. Of course you can perform searches on all languages at the same time--the only restriction is that most fuzzy algorithms won't work well. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe fr

Re: [htdig] Re: SSL patch for ht://Dig 3.1.5

2000-11-02 Thread Geoff Hutchison
on your server, you can probably just cut out some of that code. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm

Re: [htdig] Compilation error 3.2.0b2

2000-11-02 Thread Geoff Hutchison
D] to help them fix the bug. (And if you pick a stable gcc release, these errors are quite rare.) -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You wi

[htdig] Re: [htdig3-dev] ht://Dig

2000-11-02 Thread Geoff Hutchison
. Many users have setups of hundreds of thousands or millions of pages. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ At 6:43 PM -0500 11/2/00, Janet L. Gdog G wrote: Hi, I have just taken over the development of a website that will require a search. I found a copy

Re: [htdig] Reindexing, customization

2000-11-01 Thread Geoff Hutchison
bdirectory/main". There should be a totally different result. I'm not sure what you mean by "a totally different result." It sounds to me that you have reindexed properly--it is showing the new URLs after all. What else should be different? -- -Geoff Hutchison Williams Students

Re: [htdig] 3.1.5 vs 3.2.0B2.

2000-11-01 Thread Geoff Hutchison
the door without one pretty major change (in the database backend) and so we'll probably have at least a 3.2.0b4 before a full 3.2.0 release. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list,

Re: [htdig] Indexing PDF Files

2000-11-01 Thread Geoff Hutchison
/file.pdf /etc/htdig.conf (that should all be on one line.) -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List

Re: [htdig] hiding text from indexer

2000-10-31 Thread Geoff Hutchison
On Tue, 31 Oct 2000 [EMAIL PROTECTED] wrote: I'm snarfing from moreover.com, but I want to index the rest of the document. Is there something like: htdig-keepoutFoo ... Bar/htdig-keepout Sure. See: http://www.htdig.org/attrs.html#noindex_start -- -Geoff Hutchison Williams

Re: [htdig] Berkeley / MySQL

2000-10-30 Thread Geoff Hutchison
would be a good idea to replace it. On the other hand, I can see very good reasons people might want the option of placing the DocumentDB in various databases. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing

Re: [htdig] hardware requirements

2000-10-30 Thread Geoff Hutchison
of RAM depending on what other services you are running. But remember, YMMV especially because that could be 30,000 pages with lots of text or 30,000 pages of a photo library or something with not much text. You can also set many attributes that change the amount of disk space used. -- -Geoff Hutchison

Re: [htdig] htsearch hangs

2000-10-30 Thread Geoff Hutchison
an English query?) -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail/men

Re: [htdig] spell check - python wrapper script

2000-10-30 Thread Geoff Hutchison
as well. One reason for holding off on this is that I thought it might be better to use code from ispell rather than calling it directly (more portable, almost definitely faster). (N.B. In 3.2, you could also just use htdump to get a db.wordlist and go from there...) -- -Geoff Hutchison Williams

Re: [htdig] Named and numeric character support in Ht://Dig

2000-10-29 Thread Geoff Hutchison
At 3:29 PM +0100 10/29/00, Tamas Nagy wrote: Is Ht://Dig supports named (aring;) and numeric (#229;) characters in HTMLs? Yes. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message

Re: [htdig] Can't find V8

2000-10-28 Thread Geoff Hutchison
quot; you'd need to set the minimum word length: http://www.htdig.org/attrs.html#minimum_word_length -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will

RE: [htdig] Changing httpd.conf

2000-10-28 Thread Geoff Hutchison
cases, it is not permitted to have .htaccess files in the cgi-bin directory. (Too easy to have security problems.) -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED

Re: [htdig] Is the site down?

2000-10-27 Thread Geoff Hutchison
On Fri, 27 Oct 2000, Catherine Litten wrote: I was trying to access the site www.htdig.org and we get the message: One point here--this listserver is probably *not* the pace to ask if the machine is down. After all, it's on the same domain and server. But thanks for the heads-up. -- -Geoff

Re: [htdig] spell check - python wrapper script

2000-10-27 Thread Geoff Hutchison
Of course this idea also got lost in the shuffle. Anyone interested in working on this sort of thing (as you have in a sense) would be doing us all a great favor. Thanks for the script! -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubs

Re: [htdig] Newbie question Hard drive space

2000-10-26 Thread Geoff Hutchison
of them all at once. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail

Re: [htdig] porblems with cfm files on 3.1.5

2000-10-25 Thread Geoff Hutchison
http not the local file system. Any ideas? You are correct that these should be fixed in 3.1.5. Would it be possible to see some debugging output from one of your runs? -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from

Re: [htdig] available variables returned by htdig+creating sitemap

2000-10-25 Thread Geoff Hutchison
rm.html 3- In the tutorial in devshed they write that htdig can be used to create a site map dynamically. Where can I get more info on this? Contact the author of the article. I'm not sure what he had in mind--I'd assume it would be by parsing the output from indexing. -- -Geoff Hutchison William

Re: [htdig] Search engine for private page

2000-10-25 Thread Geoff Hutchison
For example, this is how the ht://Dig bug database is protected for developers. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message

Re: [htdig] max_doc_size

2000-10-25 Thread Geoff Hutchison
On Wed, 25 Oct 2000, Martin Mielke wrote: is there any upper limit for max_doc_size? what does it depend on? It's there to prevent you from using too much memory... (For example, someone might try a denial of service by spitting out an infinite stream of data otherwise.) -- -Geoff Hutchison

Re: [htdig] fast

2000-10-25 Thread Geoff Hutchison
, it will be slower than if you do it off-peak. If you have plenty of RAM, a fast disk, etc. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive

Re: [htdig] Valid Punctiation Question

2000-10-25 Thread Geoff Hutchison
: http://www.htdig.org/attrs.html#extra_word_chars -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives

Re: [htdig] Newbie question

2000-10-25 Thread Geoff Hutchison
problem. It indexes through the web. So if they look the same to a web browser, htdig won't know the difference when it indexes. (3) How can I make htdig work with xpdf ? See the FAQ: http://www.htdig.org/FAQ.html#q4.9 -- -Geoff Hutchison Williams

Re: [htdig] Indexing past the question mark

2000-10-25 Thread Geoff Hutchison
in previous versions, but I know of no problems with the current 3.1.5 version. (Assuming of course that you're not trying to index dynamic content like this through local_urls.) What version are you using? -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re: [htdig] More no server running

2000-10-24 Thread Geoff Hutchison
on the pile to be indexed, but the server was marked as dead. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives

<    1   2   3   4   5   6   7   8   9   10   >