Re: [htdig] Sort by Date from Meta Tags [patch]

2000-04-11 Thread Geoff Hutchison
. This is a good point. The original request didn't care about this since the results would be consistent with other pages and the results would still sort correctly. And yes, 3.2.0b2 should be coming out in about 2-3 hrs. -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re: [htdig] Problems with GET URLS

2000-04-11 Thread Geoff Hutchison
. So picking up a headerwould be a nice way of doing it. Finding a meta tag is also nice, but it's not guaranteed. For example, not every doc will have these. This is a good direction, however. Anyone know of other document specifications or good ways of identifying duplicates? -Geoff Hutchison

[htdig] [ANNOUNCE] ht://Dig 3.2.0b2

2000-04-11 Thread Geoff Hutchison
, etc. To download, see http://www.htdig.org/files/htdig-3.2.0b2.tar.gz For documentation and Release Notes, see http://dev.htdig.org/htdig-3.2/ For the ChangeLog, see http://dev.htdig.org/htdig-3.2/ChangeLog Feedback on the release should be primarily directed to [EMAIL PROTECTED] -Geoff Hutchison

Re: [htdig] Java Servlet Wrapper

2000-04-11 Thread Geoff Hutchison
that was on there? Hopefully others will find it useful and/or provide suggestions. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] The document contained no data?! Help

2000-04-11 Thread Geoff Hutchison
ly calling the htsearch program and so it's not generating a results page. I'd need to know a little more about your configuration. Did you compile WWWoffle yourself? If so, did it include ht://Dig with it? Thanks, -- -Geoff Hutchison Williams Students Online http://wso.wi

Re: [htdig] How to do incremental indexing, and how to estimateindexing time

2000-04-11 Thread Geoff Hutchison
usually takes a few minutes. (It also has local_urls set, which helps a fair amount.) In general, updates will be *much* faster than the initial dig. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing

Re: [htdig] performance

2000-04-10 Thread Geoff Hutchison
. But I think the proper person to ask is the author of the patch. Any input Zoran? -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message

Re: [htdig] Hit: Abstract instead of text passage

2000-04-10 Thread Geoff Hutchison
On Mon, 10 Apr 2000, Michael Schulz wrote: I think with use_meta_description:true this would be possible? Do i have to reindex (rundig ...)? No, they're already in your database--the attribute only sets whether or not htsearch displays them. -Geoff Hutchison Williams Students

Re: [htdig] java servlet wrapper

2000-04-10 Thread Geoff Hutchison
r anything along those lines. That said, it is there, so if it's something that interests you, you should probably take a look. -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message

Re: [htdig] Re: COMDEX/Spring

2000-04-10 Thread Geoff Hutchison
At 2:22 PM -0500 4/10/00, Douglas S. Davis wrote: So, what is a BOF session anyway? - let me know BOF = Birds of a Feather. Essentially, I'm suggesting we gather for lunch or dinner or a beer or something during the conference, provided there's sufficient interest. -- -Geoff Hutchison

[htdig] Re: COMDEX/Spring

2000-04-10 Thread Geoff Hutchison
and I'll work something out. If that doesn't work out (the late notice kinda hurts), I will post a survey so we can figure out if some sort of gathering is desired and if so, where it should be. If you're willing to help organize such a thing, please e-mail me. Thanks, -Geoff Hutchison Williams Students

Re: [htdig] New Release

2000-04-09 Thread Geoff Hutchison
the searches.) I hope that answers your question! (I hate giving out actual dates. Since all of the main developers are all volunteers, our schedules slip. Plus we don't like to ship something before it's ready.) -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re: [htdig] html-searchform

2000-04-09 Thread Geoff Hutchison
to fill this in depending on the results of the pop-up menus (with different names). -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] Searching with prefix

2000-04-09 Thread Geoff Hutchison
they'll be any better--it'll just pick the first X matches. If you want endings that roughly match a language, you'll want the endings algorithm. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list

Re: [htdig] Finding Exact Words

2000-04-07 Thread Geoff Hutchison
on... All I want it to locate is Training !! Sure. Set the search_algorithms attribute in your htdig.conf to be only exact instead of whatever you have now. Cheers, -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from

Re: [htdig] Boolean Searching

2000-04-07 Thread Geoff Hutchison
if you just want two option boxes, set one to be the words field and the other to be the keywords field. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You

Re: [htdig] url separators

2000-04-07 Thread Geoff Hutchison
the code. You'll want to edit the createURL procedure in htsearch/Display.cc -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] We're looking for a web pages monitoring agent, cudu tell us if HTDig can do what follows ?

2000-04-07 Thread Geoff Hutchison
e of disk space for you. I suggest checking freshmeat.net for agent software--I can't think of any offhand, but there are programs are designed to do this sort of thing. Good luck, -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubs

Re: [htdig] A problem in the HTML parser of htdig 3.1.5

2000-04-07 Thread Geoff Hutchison
expressly forbid this, but it also says that character entities should be interpreted as the appropriate characters. So if you wanted to include a character, you should use a lt;, etc. At least that's my interpretation. -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re: [htdig] Errors to take note of ?

2000-04-06 Thread Geoff Hutchison
in there. On a former topic : 23000:35506:2:http://xxx.yyy.zz/index.html: ***-+--++***+ size = 4056 What does the ***-+--++***+ mean ? + - new URL - - rejected URL * - URL already visited -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re: [htdig] URGENT! help on setting up htdig

2000-04-06 Thread Geoff Hutchison
/htdig.conf' The htdig.conf does lie in the above directory, but then why the error. The search CGI must be able to access the config file. From what you've said, there are permission problems in either the file itself or the directory tree leading to it. Cheers, -- -Geoff Hutchison Williams Students

Re: [htdig] multiply defined

2000-04-06 Thread Geoff Hutchison
libg++ has rx built-in: most of us use libstdc++ instead now. I would remove the regex.h file from htlib and remove regex.o from the Makefile.in in the htlib directory as well. Run the config.status script in the top-level to regenerate your Makefiles and you should be OK. -Geoff Hutchison Williams

Re: [htdig] Htdig lost in frames

2000-04-06 Thread Geoff Hutchison
. The documentation at http://www.htdig.org/attrs.html#exclude_urls tells you that the default is /cgi-bin/ .cgi So you'll want this to "clear" the attribute. exclude_urls: -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubs

Re: [htdig] What are the numbers meaning in verbose mode

2000-04-05 Thread Geoff Hutchison
? The first number is indeed the number of the document parsed. The second is the DocID for this document and the third is the hopcount. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send

Re: [htdig] htdig-3.2.0b1 - htdig doesn't follow links

2000-04-05 Thread Geoff Hutchison
:-) ) There is a stable release: 3.1.5. Last I checked, it was the stable package for Debian. All previous releases (including 3.2.0b1) have the security hole. If you missed the details of the hole, see the Debian security updates. Cheers, -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re: [htdig] htdig-3.2.0b1 - htdig doesn't follow links

2000-04-05 Thread Geoff Hutchison
code, so it's entirely possible something is amiss. -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] Request for new htdig META property: htdig-description

2000-04-05 Thread Geoff Hutchison
good idea or not. You obviously think it's a good idea. What do other people think? After all, it's not called "free software" for any old reason. Good idea? Bad idea? -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] bad words list external parser

2000-04-04 Thread Geoff Hutchison
to the ExternalParser code offhand. -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] Searching for All versus Any

2000-04-04 Thread Geoff Hutchison
? Littérature francophone francophone virtuelle I guess Littérature virtuelle might be interesting too, but the parser works pairwise, so these two should be the key queries. As far as htfuzzy with "ifrench," see http://www.htdig.org/FAQ.html#q4.10 Thanks for your help! -- -Geoff Hutchiso

Re: [htdig] Site has multiple names

2000-04-03 Thread Geoff Hutchison
/attrs.html#server_aliases Cheers, -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] Searching for All versus Any

2000-04-03 Thread Geoff Hutchison
The other thing to try is to set match_method each way and run htsearch -vvv from the command line and take a look at the info that comes up--it's not much, but it should show how the fuzzy algorithms are working. -- -Geoff Hutchison Williams Students Online http://wso.wi

Re: [htdig] Using htdig for a What's New page

2000-04-03 Thread Geoff Hutchison
. It won't make the 3.2.0b2 beta which is coming out soon, but the development code will soon return all documents for a query of *. -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message

Re: [htdig] What's possible with htdig?

2000-04-03 Thread Geoff Hutchison
: htdig -v htdig.log -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] How to Rename Config file at MindSpring

2000-04-03 Thread Geoff Hutchison
if it's a script or a binary. If it's a script, we might be able to work it out from there. -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive

Re: [htdig] config file parse error

2000-04-03 Thread Geoff Hutchison
. If you are running 3.1.5, it doesn't have the speling fuzzy algorithm. -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] P: access docdb with perl

2000-04-03 Thread Geoff Hutchison
e, but you must be aware of it. -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] P: access docdb with perl

2000-04-03 Thread Geoff Hutchison
way to read these databases. -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] Problem with search cgi-generated html-page

2000-04-01 Thread Geoff Hutchison
On Sat, 1 Apr 2000, Christian Grandl wrote: When i click the submit button in in the search-form(method post) nothing happens: there is no new html-page in the browser showing the results. You have a server-configuration problem. See http://www.htdig.org/FAQ.html#q5.19 for some tips. -Geoff

Re: [htdig] Applixword Parse

2000-03-31 Thread Geoff Hutchison
with version 3.1.5 See the documentation (external_parser) and the comments in the script for more details. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You

Re: [htdig] wildcard matching, 8-bit characters, and 2-letter words

2000-03-31 Thread Geoff Hutchison
g/attrs.html#locale 3. i never get any matches on 2-letter words. can this be fixed? This was mentioned previously. Set the minimum_word_length attribute. -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing

Re: [htdig] htdig -- infinite looping (3.1.5) and redirection

2000-03-31 Thread Geoff Hutchison
s to" condition). In this case, you will want to examine the url list very carefully. If you get stuck, you can also see the reasons for ignoring URLs by using the -vvv switch. (The more v's, the more verbose.) -- -Geoff Hutchison Williams Students Online htt

Re: [htdig] searches are slooow...

2000-03-30 Thread Geoff Hutchison
. So you will almost always have a bigger database for 3.1.x because it's indexing more pages. -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive

Re: [htdig] searches are slooow...

2000-03-30 Thread Geoff Hutchison
about an order of magnitude less time I believe Gilles pointed out backlink_factor and date_factor. While it's not a speed hit for some people (maybe I'm the only one in this category, I don't know), others see a noticeable change. See the FAQ. -- -Geoff Hutchison Williams Students Online http

Re: [htdig] How to build word2root.db and root2word.db

2000-03-29 Thread Geoff Hutchison
(Here the example is for German, but you should get the idea.) Cheers, -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] Repeated unsubscription requests have failed - pleaseremove ALL *.paypc.com addresses from your list

2000-03-28 Thread Geoff Hutchison
, the addresses: [EMAIL PROTECTED] [EMAIL PROTECTED] are now unsubscribed. -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] Multiple excerpts.

2000-03-28 Thread Geoff Hutchison
to specify different parts of a PDF file through a URL. So even if it did, you wouldn't be able to "jump" like you can with HTML anchors. (e.g. http://www.foo.com/blah.html#part2 ) -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To u

Re: [htdig] Made, installed and PRESTO! errors

2000-03-28 Thread Geoff Hutchison
systems usually need to have LD_LIBRARY_PATH set to include libstdc++. You will also need to do this through your server or htsearch will not run. See http://www.htdig.org/FAQ.html#q5.7 -Geoff Hutchison Williams Students Online http://wso.williams.edu

RE: [htdig] make error in Ver. 3.1.5

2000-03-28 Thread Geoff Hutchison
-around is to edit the Makefile.config in the top-level and then all the Makefiles in the subdirectories to point to the full path. Or you can compile GNU make. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig

Re: [htdig] Choice of cache size and wordlist_cache_size

2000-03-27 Thread Geoff Hutchison
scoring and templates would produce much greater performance improvements. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message

Re: [htdig] Phrase Search (two words together)

2000-03-27 Thread Geoff Hutchison
At 2:34 PM +0100 3/27/00, Simon Hyde - Webyte.co.uk wrote: What I want it to do is search for the two words i.e. Internet London as a phrase (together) not all documents with either Internet or London in them. See http://www.htdig.org/FAQ.html#q1.9 Cheers, -- -Geoff Hutchison Williams Students

Re: [htdig] Compile problems on SGI system

2000-03-27 Thread Geoff Hutchison
I've never had anything but trouble with them, so I invariably compile the latest version of GCC/G++ when working on an SGI box. I just worked with someone to track down a strange database problem that went away as soon as he upgraded to gcc-2.95.2. Cheers, -- -Geoff Hutchison Williams Stude

Re: [htdig] htsearch 3.1.5 word1 word2 searching?

2000-03-25 Thread Geoff Hutchison
earch form in the left-hand frame on http://www.htdig.org/ Cheers, -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

[htdig] Re: htDig conf help

2000-03-25 Thread Geoff Hutchison
words http://www.stargazette.com/) You probably want limit_urls_to as either a list of all the domains, or perhaps simply the string stargazette. The latter will index any URL with that string somewhere in it. -Geoff Hutchison Williams Students Online http://wso.wi

Re: [htdig] core dump

2000-03-24 Thread Geoff Hutchison
he endings algorithm, remove it from your search_algorithm line of your config file and comment out that portion of the rundig script. -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message

Re: [htdig] 3.1.5 update drama

2000-03-23 Thread Geoff Hutchison
really don't want to run an insecure htsearch and for another I find this all very bizarre and I'd like to work out what's going on. -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message

Re: [htdig] core dump

2000-03-23 Thread Geoff Hutchison
a problem? If so, it's dying because of the htfuzzy problem. To answer your question about the regex problems, no it has not been fixed. We're still trying to work out a good test for the configure script to prevent this. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re[3]: [htdig] A language issue.. Could you give me a favor?

2000-03-23 Thread Geoff Hutchison
languages (Russian springs to mind) where people serve the same page in multiple character sets. Regards, -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You wi

Re: [htdig] doesn't find words

2000-03-23 Thread Geoff Hutchison
database. There's also the possibility that pages are excluded by noindex directives or by robots.txt files. Without more information, it's a bit hard to say. Good questions to ask are things like "what kinds of word searches aren't returning results?" (i.e. what words?) -- -Geoff

Re: [htdig] 3.1.5 update drama

2000-03-23 Thread Geoff Hutchison
permission issues would be in the other directories. Failing that, I guess I'd try running htsearch from the command-line as the webserver user itself. -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list

Re: [htdig] Same pages in results

2000-03-22 Thread Geoff Hutchison
want to add in a few of them for exclude_urls. Secondly is there a way to search for "foo bar" and get pages that have these words sitting next to each other and not "foo" somewhere and "bar" somewhere ? See http://www.htdig.org/FAQ.html#q1.9 -- -Geoff Hutchison

Re: [htdig] synonym_dictionary in German?

2000-03-22 Thread Geoff Hutchison
files or bad_words files for any non-English language and would be willing to upload it to ftp.htdig.org or send it to the list, I think we'd all appreciate it. If a few people share their work, we can end up with a pretty good set. Thanks, -- -Geoff Hutchison Williams Students Online http

Re: [htdig] htdig errors

2000-03-22 Thread Geoff Hutchison
that there are library mismatches between RedHat 6 (which compiled the RPM) and Mandrake 7. Probably the best solution is to rebuild the RPM on your Mandrake system--you'll need a C++ compiler to do this. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re: [htdig] A language issue.. Could you give me a favor?

2000-03-22 Thread Geoff Hutchison
characters, so it cannot be used to index Chinese. Any help in adding multi-byte support would be greatly appreciated by everyone! Cheers, -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send

Re: [htdig] 3.1.5 update drama

2000-03-22 Thread Geoff Hutchison
, can you get the nomatch page? -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re[2]: [htdig] A language issue.. Could you give me a favor?

2000-03-22 Thread Geoff Hutchison
of the current i18n. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] 3.1.5 update drama

2000-03-22 Thread Geoff Hutchison
? I only ask because you said you had problems with conf/ earlier. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] htdig errors

2000-03-22 Thread Geoff Hutchison
sincere apologies! Please let us know if you have further problems! Cheers, -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] Databases -- Read-access modules. (3.1.5)

2000-03-21 Thread Geoff Hutchison
"CreateSearchDB" with the key fields being the DocID and the URL (the first two). -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] Passing the Config File

2000-03-21 Thread Geoff Hutchison
sense? Why restrict yourself to one header? See http://www.htdig.org/attrs.html#search_results_header Or edit the new search form in the header file as you see fit. You can, for example add a select/select list to pick the config field for additional searches. -- -Geoff Hutchison Williams

Re: [htdig] Fonts

2000-03-21 Thread Geoff Hutchison
to your heart's content. Read the documentation for more information or see the website http://www.htdig.org/ For starters, read your htdig.conf (I'm assuming you're running 3.1.5, if not, you should upgrade yesterday if not sooner). See the comments about template_map. -- -Geoff Hutchison Williams

Re: [htdig] Duration of Htsearch Processing (3.1.5)

2000-03-20 Thread Geoff Hutchison
At 10:33 AM +0100 3/20/00, Mentos Hoffmann wrote: I am not quite sure how this would help for multiword searches. Any thoughts about this? Well the crux of the problem is this: Not only do you have to do scoring, but you have to perform some filtering on the results (e.g. restrict, exclude,

Re: [htdig] htdigtobeindexed/htdig

2000-03-20 Thread Geoff Hutchison
r activate on "htdig-description" as well. I don't know about making it a general feature--I hesitate when "adding" a new non-DTD tag. Regards, -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdi

Re: [htdig] auto - indexing

2000-03-20 Thread Geoff Hutchison
sample rundig script a bit if you want to use this as an automatic script. (Or see the rundig.sh script at http://www.htdig.org/files/contrib/scripts/ for the script I use under cron.) Your crontab line will probably look something like this: 0 4 * * * /opt/htdig/bin/rundig.sh -- -Geoff Hutchis

Re: [htdig] Htdig/Htmerge -- When pre-existing databases areinvolved.

2000-03-20 Thread Geoff Hutchison
as above. If you merge in a changed database, duplicate documents will be merged (picking the most recent). Does this make sense? -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message

Re: [htdig] Problem with searching for 2-letter words

2000-03-20 Thread Geoff Hutchison
At 5:12 PM +0100 3/20/00, [EMAIL PROTECTED] wrote: Is there some way to configure htdig to index 2-letter words? See http://www.htdig.org/attrs.html#minimum_word_length -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe

Re: [htdig] HTdig and FastCGI ?

2000-03-19 Thread Geoff Hutchison
or the htdig3-dev mailing list. I'm sure there are probably a few people who'd be willing to beta test too. :-) Cheers, -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL

Re: [htdig] Duration of Htsearch Processing (3.1.5)

2000-03-18 Thread Geoff Hutchison
words add very little information value to a query. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] matches_per_page

2000-03-18 Thread Geoff Hutchison
similar for htsearch. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] Support javascript links

2000-03-17 Thread Geoff Hutchison
#q5.18 (thanks Gilles!) -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] databases (3.1.5)

2000-03-17 Thread Geoff Hutchison
with finals, I hope to get my hands on some of these simple tools for 3.2. -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] Servlets anyone?

2000-03-17 Thread Geoff Hutchison
there is the htservlet in the contrib section: http://www.htdig.org/files/contrib/templates/htservlet-0.1.tgz As I've never used it, I make no promises whatsoever. -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list

Re: [htdig] Advice wanted: Multiple mailing lists

2000-03-16 Thread Geoff Hutchison
of them at once! -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] Servlets anyone?

2000-03-15 Thread Geoff Hutchison
, but until then there is the CGI. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] Nothing found

2000-03-15 Thread Geoff Hutchison
is not configured correctly--you need to make sure the htsearch program is recognized as a CGI. Usually this will happen if it's in a "cgi-bin" directory or something similar. -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubs

Re: [htdig] Is there anyone can tell me how to test..

2000-03-15 Thread Geoff Hutchison
figuration. -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] Configuration File

2000-03-15 Thread Geoff Hutchison
the $ character! Cheers, -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] Htdig/Htmerge -- concurrent running.

2000-03-15 Thread Geoff Hutchison
sequentially later. I'd be very careful with the timing of this. You might want to have some sort of temporary lock file that implies that all the previous steps are done or something like that. -- -Geoff Hutchison Williams Students Online htt

Re: [htdig] indexing full text documents

2000-03-15 Thread Geoff Hutchison
we've tried the 3.1.5 HTML parser on XHTML, but I can't think of a problem. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message

Re: [htdig] Htdig 3.1.5 -- Infinite Loop Possible?

2000-03-15 Thread Geoff Hutchison
t's caused by a combination of an incorrect server configuration and a typo in the links. -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive

Re: [htdig] control of htmerge.

2000-03-15 Thread Geoff Hutchison
, is the whitespace, between the option letters and the filename, optional? I don't believe so, though I've never tried it. I'd leave the whitespace in anyway--it makes it easier to read (among other things). :-) -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re: [htdig] Identifying non-indexed URLs

2000-03-14 Thread Geoff Hutchison
RL seen, you can set create_url_list. -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] Databases on different Platforms?

2000-03-14 Thread Geoff Hutchison
in the CVS tree at the moment. -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] WebHost htdig PDF $

2000-03-14 Thread Geoff Hutchison
jump ship? Beyond the recent security hole, which will let users read your files (even MindSpring's files), there are numerous bugs fixed between 3.0.8b2 and 3.1.5. Database corruption problems and bugs in the URL-matching code spring to mind. -Geoff Hutchison Williams Students Online http

Re: [htdig] Questions please

2000-03-14 Thread Geoff Hutchison
and Hebrew web sites? I haven't tried it myself, but if there's an 8-bit encoding for Hebrew and a valid locale, it might work. Others may know from experience. -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig

Re: [htdig] Errors with keyword search

2000-03-13 Thread Geoff Hutchison
Segmentation fault(coredump) ~ I'm a bit surprised at the coredump. However, this tells me your synonym database is corrupted. So I'd delete that database and re-run htfuzzy synonyms and you should be OK. Cheers, -Geoff Hutchison Williams Students

Re: [htdig] Did anyone gets whatsnew.pl running?

2000-03-13 Thread Geoff Hutchison
elp to set up a separate mailing list for this purpose? -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] Acquire pdftotext and pdfinfo

2000-03-13 Thread Geoff Hutchison
on a DEC Alpha, running Digital Unix 4.0 1229. Check the xpdf webpage. To download source and/or binaries, see http://www.foolabs.com/xpdf/download.html I don't know if there are any Alpha/Digital Unix binaries, but it's a decent bet. -Geoff Hutchison Williams Students Online http

Re: [htdig] Acquire pdftotext and pdfinfo

2000-03-13 Thread Geoff Hutchison
the person who's distributing it. -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] Errors with keyword search

2000-03-11 Thread Geoff Hutchison
the htsearch program from the command-line (with the same words)? Does it spit up the "Content-type:" header at the beginning of the output from the command-line? -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from

Re: [htdig] Search from different databases

2000-03-11 Thread Geoff Hutchison
, e.g. http://www.foo.com/cgi-bin/secure-search.cgi -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

<    2   3   4   5   6   7   8   9   10   11   >