[htdig] Re: Fw: doc2html

2001-01-22 Thread Leong Peck Yoke
Hi, Actually I was wrong, the code means replacing the soft hyphen \255 with \055. I didn't read it carefully. Sorry for the inconvenience caused. Regards, Peck Yoke David Adams wrote: When I wrote doc2html I copied this without change from conv_doc, and I think it is the same in the

Re: [htdig] Re: Fw: doc2html

2001-01-22 Thread Gilles Detillieux
According to Leong Peck Yoke: Actually I was wrong, the code means replacing the soft hyphen \255 with \055. I didn't read it carefully. Sorry for the inconvenience caused. Regards, Peck Yoke No problems. The octal code 055 is the ASCII hyphen (-), while 255 octal is the ISO-8859-1 code

[htdig] Re: htdig SSL problems

2001-01-19 Thread Jason Scharlach
Matt I'm still trying to get this to work. For some reason Solaris (and in your case SunOS) compiles of htDig (with patch) are not using the openssl libraries correctly. Are you able to actually retrieve pages by just using "./openssl s_client -host {hostname} -port 443" and then issuing the

[htdig] Re: Htdig ?

2001-01-18 Thread K
Thank you Yours Kamel. - Original Message - From: "Geoff Hutchison" [EMAIL PROTECTED] To: "K" [EMAIL PROTECTED] Cc: [EMAIL PROTECTED] Sent: Thursday, January 18, 2001 2:28 PM Subject: Re: Htdig ? At 3:14 PM + 1/18/01, K wrote: Hi, I've found your e-mail into htdig site. I

[htdig] Re: Htdig ?

2001-01-18 Thread Geoff Hutchison
At 3:14 PM + 1/18/01, K wrote: Hi, I've found your e-mail into htdig site. I hope you've got the solution to my pb. I've got a datbase of url. I would like to use htdig for a quick search to my database, how can I do to index all my url. Could I index an acces file for exemple Thank u

[htdig] Re: Reindex

2001-01-17 Thread Gilles Detillieux
According to Elsa Chan: We just launched a new site, but the search engine is indexing pages that don't exist anymore. I think I just need to restart htdig except I don't know how. I trying search for info on theb htdig web site but I couldnjt find anything. Would you be able to help me?

[htdig] Re: indexing Flash (was: excluding page section...)

2001-01-17 Thread nets
Theoretically, Flash is supposed to put links and text into the HTML file if you check those options. Unfortunately, it sticks them in comment fields. I've had inconsistent behavior with getting it to do even that! Macromedia did publish a Flash file access API or something, but it's not

[htdig] Re: Reindex

2001-01-17 Thread Gilles Detillieux
According to Elsa Chan: I try doing that, but only one file gets updated from htdig. /usr/local/htdig/db/db.docdb is the only file that gets updated. db.docs.index is still old and db.wordlist.new is created by it has 0 bytes When I try to run htmerge it gives me htmerge: Unable to

[htdig] Re: Problem with exclude_url

2001-01-17 Thread Gilles Detillieux
According to [EMAIL PROTECTED]: we have htdig 3.15. we wanted to index a big directory of the SAP-documentation the structure is as follows: directory1 directory2 directory3 directory4 content.html frameset.html

[htdig] Re:%20[htdig]%20Fw:%20confirm%20unsubscribe%20from%20htdig@htdig.orgreplyto=003

2001-01-15 Thread AddieADOBY
Unsubscribe me- I tried several times I still get mail, the same things over and over [EMAIL PROTECTED] To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives:

[htdig] Re:%20[htdig]%20Fw:%20confirm%20unsubscribe%20from%20htdig@htdig.orgreplyto=003

2001-01-15 Thread AddieADOBY
[EMAIL PROTECTED] To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail/menu.html FAQ:http://www.htdig.org/FAQ.html

[htdig] Re: any suggestions for using 3.1.5 or 3.2.0b2?

2001-01-12 Thread Gilles Detillieux
According to Edward Lu: According to the release note for htdig-3.2.0b2. It added more functionality and fixed all known bugs after 3.1.5 But apparently it still has the relevance ($(PERCENT)) bug and not stable enough. I am asking for any suggestions about which version (3.1.5 or 3.2.0b2)

[htdig] Re: ht://dig on IRIX 6.5 (fwd)

2001-01-09 Thread David Adams
Here is part of an old email describing another way of compiling htdig under IRIX, it works for us. I used the following script to run the configure command. I try to always run configure from a script rather than by hand - that way I don't have to remember what options I had to

[htdig] Re: htdig.org outages

2001-01-09 Thread Geoff Hutchison
Hi there, I regret that I have some bad news. The htdig.org machine was abruptly disconnected from the net yesterday by its previous host. We are currently searching for a new location for the server. In the long-term, we will probably be migrating most, if not all, services to SourceForge. In

[htdig] Re: negative number for $(FIRSTDISPLAYED)

2001-01-08 Thread Juergen Peus
Hi again, i've fount my mistake :-/ There was a 'page=' parameter too much in the wrapper script. Greetings, Juergen --- Juergen Peus paderLinx - Neue Informationsmedien GmbH Geschaeftsfuehrer

[htdig] Re: Enhancement request (PR#991)

2001-01-08 Thread Gilles Detillieux
According to [EMAIL PROTECTED]: Is it possible for you to add a feature to the config file to allow custom information in anchor urls in excerpts. e.g. I would like to add the "target" attribute to the anchor urls so that I can direct the matching url to another frame on the page. A

Re: [htdig] Re: Enhancement request (PR#991)

2001-01-08 Thread Kapil Biyani
Instead of editing the .cc files from the source, just to add the target param, I guess you can even change the long.html file in the $commondir All one has to do is enable the long file in the source and then edit the particular file and add parameters required to it. See below what to add in

Re: [htdig] Re: Enhancement request (PR#991)

2001-01-08 Thread Gilles Detillieux
According to Kapil Biyani: Instead of editing the .cc files from the source, just to add the target param, I guess you can even change the long.html file in the $commondir All one has to do is enable the long file in the source and then edit the particular file and add parameters required

RE: [htdig] Re: can ht://dig run with iPlanet web server

2001-01-08 Thread Albertson, Chris
Can ht://dig run with iPlanet web server? Yes, I'm running on Solaris 8 with the Netscape server. To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives:

[htdig] Re: can ht://dig run with iPlanet web server

2001-01-07 Thread Geoff Hutchison
At 8:13 PM -0800 1/7/01, Edward Lu wrote: Thank you very much for your reply! one more question. Can ht://dig run with iPlanet web server? Looking forward your reply. Thanks! -Ed I don't know of anyone that's reported that, but I doubt it would be a problem. -- -Geoff Hutchison Williams

[htdig] Re: Suitability

2001-01-04 Thread Cormac Robinson
Hi, I'm a newbie. I've been recently charged with sticking together a search engine to search over a selected number of sites on the web. I've been looking at the FAQ and Help pages on htdig and it implies but does not actually state the following; That the search engine can search over a

Re: [htdig] Re: Suitability

2001-01-04 Thread Daniel Naber
On 2001-01-04 19:07, you wrote: over a number of different sites "those sites using different IP address' ". Yes, it can. Of cource it has nothing to do with IP addresses, you can just feed it different start URLs for different servers. regards Daniel -- Daniel Naber, Paul-Gerhardt-Str.

Re: [htdig] Re: Suitability

2001-01-04 Thread Cormac Robinson
Thanks Daniel, That's what I needed to know. Cormac. - Original Message - From: "Daniel Naber" [EMAIL PROTECTED] To: "Cormac Robinson" [EMAIL PROTECTED] Cc: [EMAIL PROTECTED] Sent: Thursday, January 04, 2001 4:31 AM Subject: Re: [htdig] Re: Suitability On 2

[htdig] Re: Help deciphering error message

2000-12-27 Thread Allan Trick
I'm trying to sort out our errors logfile and this appears to be the line generated by ht://Dig: ld.so.1: htsearch: fatal: libstdc++.so.2.8.1.1: open failed: No such file or directory [Wed Dec 27 16:13:22 2000] [error] [client 62.7.12.77] Premature end of script headers:

Re: [htdig] Re: Help deciphering error message

2000-12-27 Thread Geoff Hutchison
On Wed, 27 Dec 2000, Allan Trick wrote: ld.so.1: htsearch: fatal: libstdc++.so.2.8.1.1: open failed: No such file or directory See the FAQ: http://www.htdig.org/FAQ.html#q5.7 -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To

[htdig] Re: * Web Crawler Help!** (PR#982)

2000-12-23 Thread Geoff Hutchison
On Fri, 22 Dec 2000 [EMAIL PROTECTED] wrote: Ok... So the way I see it is that I am going to use a index robot and tell it to index the 6 websites and give me a data file. Then I'll tweak the search Install ht://Dig and set the start_url to point at all six websites. (Despite the name, it

[htdig] Re: Htdig - hint

2000-12-21 Thread Geoff Hutchison
This has been covered before. You can use the url_part_aliases attribute to set up a replacement for the purpose of the htsearch CGI only (but htdig will still have the correct internal URLs): http://www.htdig.org/attrs.html#url_part_aliases in your htdig.conf: url_part_aliases: 200.0.0.7 *1

[htdig] Re: Phrases

2000-12-17 Thread Geoff Hutchison
See the FAQ: http://www.htdig.org/FAQ.html#q1.9 -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ At 6:06 AM -0800 12/17/00, Bill Vick wrote: I've appreciated your many good comments on the HTDIG list. Can HTDIG support phrases such as the phrase Hospital Administrators or

[htdig] Re: I need your help [from ellenliu]

2000-12-13 Thread Gilles Detillieux
Hi, Ellen. First of all, you should always send these questions to the list, and not to me personally. I don't have all the answers. See http://www.htdig.org/FAQ.html#q1.16 According to ellenliu: Dear Gilles R. Detillieux: I'm very grateful for your kind help last time.

[htdig] Re: htsearch: No title in search results (htdig-3.2.0b3-112600)(PR#964)

2000-12-10 Thread Geoff Hutchison
Robert, This has *always* been the case with server_max_docs. It controls the number of documents retrieved from the server, not necessarily the number of documents in your databases. As links are encountered, "stubs" are added to the document with the information so far (like link text). In

[htdig] Re: detailed information

2000-12-05 Thread Geoff Hutchison
Hi there, I'm assuming you picked my name as the contact for the ht://Dig search engine package. It is a UNIX search engine, but it is not based on Oracle. In most cases, if you're looking for a way to search an Oracle database, it's often better to hire an Oracle consultant to write a custom

Re: [htdig] Re: detailed information

2000-12-05 Thread Geoff Hutchison
On Tue, 5 Dec 2000, Geoff Hutchison wrote: If, on the other hand, you're looking for a general-purpose, open-source* web search package, feel free to browse the information on ht://Dig at: http://www.htdig.org/ Sorry, I couldn't resist the urge to throw in some buzzwords. :-) -- -Geoff

[htdig] Re:

2000-12-05 Thread Geoff Hutchison
At 4:55 PM +0100 12/5/00, Roberta Minneci wrote: How do I restrict a search to word out script language="JavaScript" /script? See http://www.htdig.org/attrs.html#noindex_start -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To

[htdig] Re: extra_word_characters (PR#952)

2000-11-24 Thread Gilles Detillieux
According to Tomas Frydrych ([EMAIL PROTECTED]): Version: 3.1.5 I need to add '+' to the list of valid word characters; after doing so htdig will index all words that contain '+' inside, but refuses to index words that start with '+' (and I suspect also words that end with it). OK, I was

[htdig] Re: valid_punctuation setting (was: extra_word_characters (PR#952))

2000-11-24 Thread Gilles Detillieux
According to Tomas Frydrych: I do have one question though; when defining valid_punctuation, do I have to include ' ' (i.e. space), or is ' ' always included, and if I have to include it explicitely, where/how do I put into in the string? No, white space characters (space, tab, newline) are

[htdig] Re: htdig-problem

2000-11-16 Thread Gilles Detillieux
According to [EMAIL PROTECTED]: Hi Gilles, first of all - sorry abour my englisch, it could be better. I saw your different answers in the FAQ-Site of HTDIG. It really pleased me verry. I have a problem with HTDIG an i think, that you are able to help me soon. It seems you missed question

[htdig] Re: WELCOME to htdig@htdig.org

2000-11-11 Thread Steve Knoblock
Hello, can anyone tell me if htsearch ignores two letter combinations when searching? Such as SB We have a number of these as META keywords. However, a search returns no match for these codes. Thank you, Steve phphelp.com - Help for PHP developers. Steve Knoblock, ed. [EMAIL PROTECTED]

[htdig] Re: WELCOME to htdig@htdig.org

2000-11-11 Thread Geoff Hutchison
At 8:18 AM -0500 11/11/00, Steve Knoblock wrote: Hello, can anyone tell me if htsearch ignores two letter combinations when searching? Such as See http://www.htdig.org/attrs.html#minimum_word_length -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/

Re: [htdig] Re: WELCOME to htdig@htdig.org

2000-11-11 Thread Steve Knoblock
Thanks. That's it. I will try temporarily setting this to two chars. I hope to get the site to change to keywords with more characters. Steve searching? Such as See http://www.htdig.org/attrs.html#minimum_word_length Steve Knoblock popular history

Re: [htdig] Re: SSL patch for ht://Dig 3.1.5

2000-11-03 Thread Joe R. Jah
On Fri, 3 Nov 2000, Will Ballantyne wrote: Date: Fri, 03 Nov 2000 18:46:19 -0500 From: Will Ballantyne [EMAIL PROTECTED] To: "Joe R. Jah" [EMAIL PROTECTED] Cc: "Brian W. Spolarich" [EMAIL PROTECTED], "J. op den Brouw" [EMAIL PROTECTED], [EMAIL PROTECTED]

Re: [htdig] Re: SSL patch for ht://Dig 3.1.5

2000-11-02 Thread J. op den Brouw
On Wed, 1 Nov 2000, Joe R. Jah wrote: No, of course you didn't; it was originally posted to the list with tabs already converted. I am sure Jesse used "you" as a general pronoun; he didn't mean to say that _you_, Brian, has copied it off the screen. Any way I moved the patch to

Re: [htdig] Re: SSL patch for ht://Dig 3.1.5

2000-11-02 Thread wballant
it looks like the process of mailing the patch converted my tabs to spaces (note you should be able to use "patch -l" to ignore whitespace issues). I am unsure what added those spaces. I am not a regular contributor. If someone lets me know where I can ftp the patch to avoid the conversion

Re: [htdig] Re: SSL patch for ht://Dig 3.1.5

2000-11-02 Thread Joe R. Jah
[EMAIL PROTECTED] Subject: Re: [htdig] Re: SSL patch for ht://Dig 3.1.5 it looks like the process of mailing the patch converted my tabs to spaces (note you should be able to use "patch -l" to ignore whitespace issues). I am unsure what added those spaces. I am not a regular contri

Re: [htdig] Re: SSL patch for ht://Dig 3.1.5

2000-11-02 Thread Brian W. Spolarich
On Thu, 2 Nov 2000 [EMAIL PROTECTED] wrote: | it looks like the process of mailing the patch converted my tabs to | spaces (note you should be able to use "patch -l" to ignore whitespace | issues). I am unsure what added those spaces. I am not a regular | contributor. If someone lets me know

Re: [htdig] Re: SSL patch for ht://Dig 3.1.5

2000-11-02 Thread Geoff Hutchison
On Thu, 2 Nov 2000, Brian W. Spolarich wrote: start_url: https://myserver:443/ Now I connect but I see eventually that htdig tries to index: New server: myserver, 80 Keep in mind that it's not my code. But my guess is that it's trying to get the robots.txt file (in Server.cc). In

[htdig] Re: [htdig3-dev] ht://Dig

2000-11-02 Thread Geoff Hutchison
Hi there, You probably want to take a look at the installation instructions even though the program is probably already installed. If they don't make much sense to you, talk to your sysadmin or local techie. For example it sounds like you will probably want to have things updated on a

Re: [htdig] Re: SSL patch for ht://Dig 3.1.5

2000-11-01 Thread Joe R. Jah
On Wed, 1 Nov 2000, Brian W. Spolarich wrote: Date: Wed, 1 Nov 2000 11:36:16 + (GMT) From: "Brian W. Spolarich" [EMAIL PROTECTED] To: "J. op den Brouw" [EMAIL PROTECTED] Cc: [EMAIL PROTECTED] Subject: Re: [htdig] Re: SSL patch for ht://Dig 3.1.5 Did not! :

Re: [htdig] Re: SSL patch for ht://Dig 3.1.5

2000-11-01 Thread Brian W. Spolarich
On Wed, 1 Nov 2000, Joe R. Jah wrote: | Any way I moved the patch to | ftp://ftp.ccsf.org/htdig-patches/3.1.5/0ld/ssl.0 because it obviously | does not apply correctly. Bless you. :-) -bws -- Brian W. Spolarich - Manager, Network Systems - WALID, Inc. - [EMAIL PROTECTED]

Re: [htdig] Re: SSL patch for ht://Dig 3.1.5

2000-10-31 Thread J. op den Brouw
I think that's what happens when you copy off the screen ;-) "Brian W. Spolarich" wrote: On Tue, 31 Oct 2000, Joe R. Jah wrote: | I am forwarding your message to the patch author and htdig users | mailing list, to which the patch was originally posted. Maintainer of | the patch site

Re: [htdig] re: Problems with iso characters

2000-10-26 Thread Gilles Detillieux
Hmmm, this is a bit of a stumper. I can think of a few possibilities, which are long-shots, but one might point to a solution: 1) Are you sure the /www/cgi-bin/htsearch binary is the same one as the /home/httpd/cgi-bin/htsearch binary that the RPM installs? E.g. is /www a symlink to

Re: [htdig] re: Problems with iso characters

2000-10-24 Thread Petri Lankoski
Peter Peltonen writes: Petri Lankoski wrote: I have bit problems with htdig and iso characters and I can't find solution from FAQ to my problem. Htdig DB contains 8bit Here's how I got htdig working in Finnish (with ISO characters, that is): 1. Configured my htdig.conf: locale:

[htdig] Re: another query string problem - endless

2000-10-24 Thread Glen Davies
Hi Please ignore my previous message about the endless loop problems. I overlooked some messages in the archive that covered this. It is obviously a problem with the web site setup. Regards Glen To unsubscribe from the htdig mailing list, send a message to

[htdig] re: Problems with iso characters

2000-10-18 Thread Petri Lankoski
I forget to mention that i am using htdig-3.1.5-0glibc21. I have bit problems with htdig and iso characters and I can't find solution from FAQ to my problem. Htdig DB contains 8bit characters alright but when using then in search htdig don't find any matches. F.e if I use word -- Petri

Re: [htdig] re: Problems with iso characters

2000-10-18 Thread Peter Peltonen
Petri Lankoski wrote: I have bit problems with htdig and iso characters and I can't find solution from FAQ to my problem. Htdig DB contains 8bit Here's how I got htdig working in Finnish (with ISO characters, that is): 1. Configured my htdig.conf: locale: fi_FI.ISO-8859-1

[htdig] Re: Ornamentation?

2000-10-16 Thread Geoff Hutchison
At 7:51 AM + 10/16/00, crosstar wrote: I was interested in the ht/dig search engine, but all of the examples I came acress had the logo and advertising on it (which we could not use, insofar as we are non-profit and non-commercial). Is there a version or configuration available which does

Re: [htdig] Re: Ornamentation?

2000-10-16 Thread Gilles Detillieux
According to Geoff Hutchison: At 7:51 AM + 10/16/00, crosstar wrote: I was interested in the ht/dig search engine, but all of the examples I came acress had the logo and advertising on it (which we could not use, insofar as we are non-profit and non-commercial). ... There is no

Re: [htdig] Re: Ornamentation?

2000-10-16 Thread Paul Wolstenholme
On Mon, 16 Oct 2000, Gilles Detillieux wrote: According to Geoff Hutchison: At 7:51 AM + 10/16/00, crosstar wrote: I was interested in the ht/dig search engine, but all of the examples I came acress had the logo and advertising on it (which we could not use, insofar as we are

Re: [htdig] Re: Ornamentation?

2000-10-16 Thread Gilles Detillieux
According to crosstar: Thanks. Glad you understand. Sure, if we don't have to use that, we'd like to try it. Can you tell me where to download the source with installation instructions? THanks, again. Everything you need is on the http://www.htdig.org/ web site. See especially these

Re: [htdig] Re: Ornamentation?

2000-10-16 Thread Chris Tubutis
On 16 Oct, Paul Wolstenholme wrote: I was in a similar situation at a university. Not that anyone at the university would have ever noticed but I replaced the image with a text link. I have had no problem. So while it may not be a Ht:/Dig requirement, I believe that even a small notice is

[htdig] Re: 3.1.5 strange freeze problem (fwd)

2000-10-09 Thread Peter L. Peres
Hello, I run 3.1.5 with my own little patch (see htdig archives for prune-parent-dir patch) on a Linux system (Suse base, modified). Recently I have run into a snag: htdig stops at some point in the search (after a few hours - this is a 233 MHz K6 machine doing nothing else). lsof on the

Re: [htdig] Re: 3.1.5 strange freeze problem (fwd)

2000-10-09 Thread Geoff Hutchison
On Mon, 9 Oct 2000, Peter L. Peres wrote: gdb attached to the running process yielded this stack trace: ... 0x400c86b0 in chunk_free (ar_ptr=0x40122300, p=0xa04af30) at malloc.c:2969 malloc.c:2969: No such file or directory. (gdb) bt #0 0x400c86b0 in chunk_free (ar_ptr=0x40122300,

Re: [htdig] Re: 3.1.5 strange freeze problem (fwd)

2000-10-09 Thread Peter L. Peres
swapping No, it was not swapping much just then but it is swapping a LOT when it runs. I have 32 MB RAM and 150 MB swap (on several disks). The swap space gets used up to 80-90 MB at times (when running X11 concurrently, with surprising performance under the circumstances, and load 2.4 ;-). Of

Re: [htdig] Re: 3.1.5 strange freeze problem (fwd)

2000-10-09 Thread Andrew Scherpbier
"Peter L. Peres" wrote: swapping No, it was not swapping much just then but it is swapping a LOT when it runs. I have 32 MB RAM and 150 MB swap (on several disks). The swap space gets used up to 80-90 MB at times (when running X11 concurrently, with surprising performance under the

[htdig] Re: 3.1.5 strange freeze problem

2000-10-08 Thread Peter L. Peres
Hello, I run 3.1.5 with my own little patch (see htdig archives for prune-parent-dir patch) on a Linux system (Suse base, modified). Recently I have run into a snag: htdig stops at some point in the search (after a few hours - this is a 233 MHz K6 machine doing nothing else). lsof on the

[htdig] Re: htdig things

2000-09-28 Thread Geoff Hutchison
At 1:23 PM +0200 9/28/00, Francesc Cosp wrote: I am using htdig, i think is a powerful tool. But i have one question: it's working fine the synonyms db? I created with htfuzzy a database of synonyms, but when i use it with the htsearch, the synonynyms never are found, i don't know what exactly

[htdig] Re: htdig: Modified Date

2000-09-21 Thread Geoff Hutchison
On Thu, 21 Sep 2000, Erik Lyons wrote: I am using 3.1.5. As I understand it, modification_date_is_now provides the indexing date, which (as others have hinted) is not terribly useful for the end user. But isn't this the most appropriate date for truly dynamic content? It represents the

Re: [htdig] Re: htdig: Modified Date

2000-09-21 Thread Erik Lyons
OK, to breifly summarize, I have a large archive of totally static (primarily HTML/PDF) documents generated over the past, oh, eight years or so. I would like htsearch to show the system date of each file. This is the only date which could be significant or useful to users of the search. The

Re: [htdig] Re: Guess this is a FAQ

2000-09-15 Thread Peter Peltonen
Clint Gilders wrote: http://www.apache.org/docs/mod/mod_autoindex.html (Compile in mod_autoindex, and set IndexOptions as you want.) mod_autoindex is compiled into apache by default so unless whomever compiled apache specifically disabled it it is most likely there. Could you send

[htdig] Re: sunos 5.6 / htdig?

2000-09-14 Thread Geoff Hutchison
First off, please read http://www.htdig.org/FAQ.html#q1.16 In your case, I would think there is some fairly serious problem with your build environment. Either your gcc/g++ is not set up properly, or you are missing some header files or the configure script did not complete properly. Are you

Re: [htdig] Re: sunos 5.6 / htdig?

2000-09-13 Thread Anthony Peacock
Hi, Did you ever get any answers to your problems with htdig and sunos 5.6? Im having trouble building it also... I haven't had problems compiling htdig on Solaris--that is I don't compile it on Solaris. What sort of trouble are you having? Can you be more precise? What

[htdig] Re: Problems with htdig 3.2b2

2000-09-13 Thread Geoff Hutchison
At 1:21 PM +0200 9/13/00, Michael Schulz wrote: using the latest version of ht://Dig (Version 3.2b2) i have a problem while indexing the url http://www.umwelt-deutschland.de . I set start_url: http://www.umwelt-deutschland.de/y/ and limit_urls_to:

[htdig] Re: Antwort: Re: [htdig] [htdig] DB2 problem ...: missing orempty key value specified

2000-09-08 Thread Geoff Hutchison
At 1:07 PM +0100 9/8/00, [EMAIL PROTECTED] wrote: Thanx for the quick answer. You're assumption is right (Shouldn't htdig understand a bit javascript??) Only enough to ignore it. There is absolutely no reason for htdig to parse JavaScript or Java or... For one, it would contribute

[htdig] Re: htnotify - feature request (PR#911)

2000-09-07 Thread Gilles Detillieux
According to Richard Beton: Gilles Detillieux wrote: According to Richard Beton: ... There are two simple enhancements that would make htnotify much more useful on company intranets. ... ... we do appreciate any submissions, so if you can implement some of this yourself, or get

Re: [htdig] Re: [htdig3-dev] htdig 3.1.5 indexing

2000-09-05 Thread Geoff Hutchison
At 6:46 AM + 9/5/00, [EMAIL PROTECTED] wrote: while experimenting with htdig, i notice the following in my apacheØs error_log: DB2 problem...: /opt/www/htdig/common/word2root.db: file size not a multiple of the pagesize DB2 problem...: /opt/www/htdig/common/word2root.db: file size not a

Re: [htdig] Re: results.php

2000-09-04 Thread Torsten Neuer
Geoff Hutchison wrote: 1) Without calling exec/passthru/system, is there any way to capture the htsearch output so I can parse it myself? I don't believe so. Not sure, but you could try using a virtual HTTP request. It is much slower than using exec, but it should work, too. hth,

[htdig] Re: results.php

2000-09-03 Thread Geoff Hutchison
At 3:22 PM -0400 9/2/00, Arnold Kamis wrote: Thanks, I found it. My problem remains, though, that both Torsten's and Colin's code require a call to exec. My web account is hosted at Mindspring, who seem to have disabled exec, passthru, and system calls. They also have not upgraded htdig from

[htdig] Re: results.php

2000-09-02 Thread Geoff Hutchison
At 8:27 PM -0400 9/1/00, Arnold Kamis wrote: I am looking for results.php or some other wrapper to capture the output of htsearch. I can't find it in http://www.htdig.org/files/contrib/, though. Funny, I looked in the new(ish) wrapper directory: http://www.htdig.org/files/contrib/wrappers/

[htdig] Re: [htdig3-dev] htdig 3.1.5 indexing

2000-08-31 Thread Gilles Detillieux
According to [EMAIL PROTECTED]: Acording to Geoff Hutchison: On Wed, 2 Aug 2000, Gilles Detillieux wrote: If you can coax your web server into inserting a tag like the following into the head section of the pages it generates for indexes, that would be the solution. meta

Re: [htdig] Re: [htdig3-dev] htdig 3.1.5 indexing

2000-08-31 Thread Geoff Hutchison
On Thu, 31 Aug 2000, Gilles Detillieux wrote: http://www.apache.org/docs/mod/mod_autoindex.html#indexoptions You'd still need to make some sort of header file with the appropriate headers. I suspect you'd need to add these configuration directives in a .htaccess file in each

[htdig] Re: doc_parser.pl

2000-08-30 Thread Gilles Detillieux
According to Benjelloun Adnane: to make doc_parser.pl to work with accents please change this line : push @allwords, grep { length = $minimum_word_length } split /\W+/; to : push @allwords, grep { length = $minimum_word_length } split /[^a-zA-ZàâäÀÂÄéèêëÉÈÊËôÔüùûÜÙÛîïÎÏæÆçÇÿ]+/; Or

[htdig] Re: locale problem in 3.2.0b2 (PR#885)

2000-08-21 Thread Geoff Hutchison
At 11:09 AM -0500 8/14/00, Gilles Detillieux wrote: database that I have currently is 2,5 GB big. I am running it on RH 6.2, x86 . I am using Mylex RAID controller with 8 MB cache and 3x 18G 10.000 rpm drives with 4 MB cache on each, RAID level 5 for database. system drives are on

[htdig] Re: filter words

2000-08-21 Thread Gilles Detillieux
According to Sasa Mutic: is it possible to limit search hits by bad words? not limiting the dig itself, but just hit displays. For example, I would like to have option for user to take out all hits that include the word that he specifies. that means that htsearch should accept some sort

[htdig] Re: that is right htsearch does not stand the CGI standart

2000-08-16 Thread Roman Gavirilov
Please accept my apology, The problem was in Server configuration. This script is written in C so from server side cgi-handler should take care of it and not perl-handler which is by default the main cgi handler in most web servers. To all who facing this problem, Internal Server Error or

Re: [htdig] Re: that is right htsearch does not stand the CGI standart

2000-08-16 Thread Gilles Detillieux
According to Roman Gavirilov: To all who facing this problem, Internal Server Error or Unrecognized character \177 at /usr/local/apache/cgi-bin/htsearch line 1. , in the browser and from shell it work correctly. Those are the steps i did: I created another directory in web root and in

[htdig] RE: [HTdig] Apache errors on keyword search

2000-08-15 Thread Tony Crockford
Thanks for this, now I understand why it wasn't working as I expected. I've come up with another workaround: Since we're using HTdig on a closed intranet with Apache, I've set up a custom error page for error 500 (premature script error...) that informs the user that they have typed

[htdig] Re: [HTdig] Apache errors on keyword search

2000-08-15 Thread Gilles Detillieux
According to Tony Crockford: Thanks for this, now I understand why it wasn't working as I expected. I've come up with another workaround: Since we're using HTdig on a closed intranet with Apache, I've set up a custom error page for error 500 (premature script error...) that informs the

Re: [htdig] RE: [HTdig] Apache errors on keyword search

2000-08-15 Thread Geoff Hutchison
On Tue, 15 Aug 2000, Tony Crockford wrote: I'll wait for a new release binary to include the patch you mentioned (I assume it will?) as I have been unsuccessful in building HTdig from source on our version of Linux. Unless there is a truly critical bug in the 3.1.x series (i.e. a security

[htdig] Re: that is right htsearch does not stand the CGI standart

2000-08-15 Thread Gilles Detillieux
Please don't e-mail me directly with htdig questions. They should go to the [EMAIL PROTECTED] mailing list. According to Roman Gavirilov: I installed htdig and having the same problem. and of course my web server ("Apache_1.3.12") running well. The problem is in htsearch that the first

[htdig] RE: [3.2.0b2] AND operator not working as it should?

2000-08-09 Thread Arthur Prokosch
Hi, all. I spent a while with other folks on htdig-dev trying to figure out what was going on with the AND operator (and phrase matching). If you haven't been following the thread, then briefly, let's say you have one word, call it 'apple', that appears in at least one of the indexed documents,

[htdig] Re: Question About Htdig

2000-08-04 Thread Geoff Hutchison
On Fri, 4 Aug 2000, Larry Linville wrote: We're looking at recommending your site-search product to one of our clients, but first we wanted to verify platform support. Should we expect any problems running Ultraseek on a unix (SGI Origin 200) with customized Apache Web Server software? Not

[htdig] Re: Servlet use (was another setup issue)

2000-08-04 Thread Geoff Hutchison
On Thu, 3 Aug 2000, Dan Amarel wrote: Is there any work out there on running ht://dig in a servlet context? There are two contributed servlet programs in http://www.htdig.org/contrib/. I haven't used them myself, but I have heard good feedback from others. Cheers, -- -Geoff Hutchison

RE: [htdig] RE: [3.2.0b2] AND operator not working as it should?

2000-08-04 Thread Geoff Hutchison
On Thu, 3 Aug 2000, Arthur Prokosch wrote: The word in question (say, 'fluble') is _not_ in bad_words, and is not below 3 characters. It's not in _any_ list. The difference in behavior has to do with whether a word _was ever added to the words database_, as far as I can tell. So what

RE: [htdig] RE: [3.2.0b2] AND operator not working as it should?

2000-08-04 Thread Arthur Prokosch
(skip to (b) below for an executive summary) From: Geoff Hutchison [EMAIL PROTECTED]: On Thu, 3 Aug 2000, Arthur Prokosch wrote: The word in question (say, 'fluble') is _not_ in bad_words, and is not below 3 characters. It's not in _any_ list. The difference in behavior has to do with

[htdig] RE: [3.2.0b2] AND operator not working as it should?

2000-08-03 Thread Arthur Prokosch
I had written, 2:53 PM -0500, 08/02/00: In testing htdig, 3.2.0b2, with just one html file, the AND operator is working like OR, as far as I can tell. after a bit more testing, AND will work fine when both words are present in the wordlist. It appears that when one of the AND'ed words is

RE: [htdig] RE: [3.2.0b2] AND operator not working as it should?

2000-08-03 Thread Arthur Prokosch
From: Geoff Hutchison [EMAIL PROTECTED]: On Thu, 3 Aug 2000, Arthur Prokosch wrote: the wordlist. It appears that when one of the AND'ed words is _not_ present in the wordlist, that word is simply ignored (order doesn't matter). It occurs to me that this could be by design - if so, I

Re: [htdig] Re-indexing

2000-08-01 Thread Geoff Hutchison
At 5:06 AM -0400 7/31/00, justin wrote: On looking at the output I find: 399:0:0:http://server/mail/: retrieved but not changed then about 20 lines of mail/?D=A with S and N and M etc all saying not changed. But I know this has been changed, Whats up? Ah. Interesting bug. The problem is that

[htdig] Re-indexing

2000-07-31 Thread justin
I've been playing some more with ht://dig and have it configured almost perfectly now The one thing I am still not getting to work is updating a database. I am using rundig.sh or the default rundig that comes with ht://dig. The main difference that I can see is that rundig.sh uses -a by

Re: [htdig] Re-indexing

2000-07-31 Thread Geoff Hutchison
On Mon, 31 Jul 2000, justin wrote: The main difference that I can see is that rundig.sh uses -a by default. This works good on updating the database without re-reading the whole archive, but it does not find any new files that were created. Without the -a option ht://dig finds all If you

Re: [htdig] Re: is this a bug?

2000-07-27 Thread Malcolm Austen
On Wed, 26 Jul 2000, Gilles Detillieux wrote: + According to inzli: + i'm using ht://Dig 3.1.0b3 on a windows platform. + seems it is working okay. but if i enter a search-term i receive the following +message: + "Documents 1-8 of 8 matches" + but it only shows two matches. + With another

Re: [htdig] Re: is this a bug?

2000-07-27 Thread Geoff Hutchison
On Thu, 27 Jul 2000, Malcolm Austen wrote: I saw just this symptom when I took over the htdig setup here. That was using a simlarly vintage 3.1.0beta version and the problem was that the dig was being done with a -i flag but the old database was not being dumped before calling htmerge which

  1   2   3   >