Re: [htdig] Q

2000-12-18 Thread Gilles Detillieux

According to ellenliu:
 Dear Sir:
   First ,I send  my great gratitude to Gilles R. Detillieux
 and Daniel Naber for their warmhearted help. 3.2.0b3 has been installed
 on my system successfully.

Which snapshot did you use?  3.2.0b3 is still a work in progress, and is
slowly but surely getting closer to be ready for beta release.  The
121700 snapshot is the most stable one so far.

   Here I have an another question :I had read through the source
 code before installing,but I want to trace some codes also now,would
 you please tell me which develope tool is good at debugging and/or
 tracing C/C++ program for Red Hat Linux platform?

I think most Red Hat Linux users would suggest gdb, or perhaps xxgdb.
If the C++ program you're debugging is htdig, I'd also suggest using
the debugging output already programmed into it, and activated with
multiple -v options, as you get a lot of feedback that way.  (I'm a
big believer in debugging trace prints in general, and do most of my
C/C++ debugging that way.)

   Moreover,I had run it on my LAN,but when I search some words,it
 always gave me " no found "page,(I run it like this command line:
 htsearch word).I'd like to know whether this problem is caused by my
 operation reason.

You should run htsearch from the command line either with no arguments
at all, and let it prompt you for the search words, or you should give
it a full CGI-style query string as an argument, e.g.:

   /opt/www/cgi-bin/htsearch 'words=butterfly+valvemethod=and'

Be sure to quote the query string if it contains any shell meta characters
such as "", ";", "*", etc.

-- 
Gilles R. Detillieux  E-mail: [EMAIL PROTECTED]
Spinal Cord Research Centre   WWW:http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:(204)789-3930


To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  http://www.htdig.org/mail/menu.html
FAQ:http://www.htdig.org/FAQ.html




Re: [htdig] Q: Can I ...

2000-05-11 Thread Geoff Hutchison

At 1:28 AM +0300 5/12/00, Peter L. Peres wrote:
I'm outlining the patch. How can I go about having a db database indexed
by more than one key ? If I can't I'll need another db indexed by hashes,

I'm not quite sure why you want more than one key? I'd first use a 
Dictionary (hash table) object.

Otherwise, I'd have a database indexed by checksums pointing to 
DocIDs. This would be an analogue to the db.docs.index file which is 
a database of URLs pointing to DocIDs.

--
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/


To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.




[htdig] Q: htdig-3.1.5 allow_in_form feature

2000-05-09 Thread Johannes_Lorenz

Hello,

i have some trouble to get the feature "allow_in_form search_algorithm" working.
I would likely use this feature, so users can choose different search algorithms
from a drop-down menu within their html search form.

Is this feature not yet implemented in htdig 3.1.5 or i'm missing something
special in my configuration?

Yours Johannes




To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.




[htdig] Q: htdig-3.2.0b2 merge feature

2000-05-09 Thread Johannes_Lorenz

Hello,

i try to merge two files, e.g.
htmerge -v -s -c main.conf -m sub1.conf

Doing this, the htmerge process takes all cpu time, all memory, all swap space
and terminates with a database error and "out of memory". Has anybody seen this
effect? I'm using suse linux ver. 6.2, kernel 2.2.10

Yours Johannes



To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.




Re: [htdig] Q: htdig-3.1.5 allow_in_form feature

2000-05-09 Thread Gilles Detillieux

According to [EMAIL PROTECTED]:
 i have some trouble to get the feature "allow_in_form search_algorithm" working.
 I would likely use this feature, so users can choose different search algorithms
 from a drop-down menu within their html search form.
 
 Is this feature not yet implemented in htdig 3.1.5 or i'm missing something
 special in my configuration?

Nothing special is required in the configuration file, but you do need a
colon (:) for any attribute definition in the file.  Thus, you should have

allow_in_form: search_algorithm

in your configuration file, and define the drop-down menu in your search.html.
If you want the selected value to be propagated to the follow-up search forms
in nomatch.html, syntax.html and header.html (or wrapper.html), you should
read up on

http://www.htdig.org/attrs.html#build_select_lists

-- 
Gilles R. Detillieux  E-mail: [EMAIL PROTECTED]
Spinal Cord Research Centre   WWW:http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:(204)789-3930


To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.




[htdig] [Q] mod-search

1999-09-08 Thread ChaTo (Carlos Castillo)



Hi.
I'm a computer science student at dcc.uchile.cl, looking for a thesis
subject (i will graduate in July, 2000).

I'm considering the posibility of implementing a mod-search plug-in for
Apache, using some search engine, posibbly HTDIG.

The advantages i've considered are:
- Faster searches (by including the search engine in a shared object
  library and linking it into Apache)
- Better interface (for searching and for BOLDing the keywords in
  the pages that are served as the result for a search)
- Better administrator interface
- On-demand indexing (if the client request a document that is marked
  expired by some meta-tag)
- Better user feedback (to rate the documents by the number of hits)

The disadvantages are:
- Larger memory requirements (BUT if the search engine is a shared
  object, the extra memory is shared by all the child servers) 

I will apreciate ideas about this subject. In particular, i will like
to know if you think it is a GOOD or a BAD idea, or more advantages/
disadvantages.

Thank you,

PS: please reply cc: [EMAIL PROTECTED] (i'm not suscribed to swish-e
mailing list). 

-- 
ChaTo Carlos Castillo   www.chato.cl

" Cc: dios
" dios ... User Unknown




To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word unsubscribe in
the SUBJECT of the message.



RE: [htdig] [Q] mod-search

1999-09-08 Thread Kevin J. Chen
Title: RE: [htdig] [Q] mod-search





Hi! There,
Would you please take me out of the email list because this is my email @ work and I have quota limit.
My email is :[EMAIL PROTECTED] 
   [EMAIL PROTECTED]


Thanks a lot!!



Kevin J. Chen
Hitachi Computer Products America
(408)5883460, 5883184
[EMAIL PROTECTED]


-Original Message-
From: Geoff Hutchison [SMTP:[EMAIL PROTECTED]]
Sent: Wednesday, September 08, 1999 3:32 PM
To: [EMAIL PROTECTED]
Cc: [EMAIL PROTECTED]
Subject: Re: [htdig] [Q] mod-search



At 11:06 AM -0400 9/8/99, ChaTo (Carlos Castillo) wrote:
 - Faster searches (by including the search engine in a shared object
  library and linking it into Apache)


This is probably not going to be a huge performance win. I talked to 
the Apache developers a while back.


 - Better interface (for searching and for BOLDing the keywords in
  the pages that are served as the result for a search)


This is an interesting idea and mentioned often. I hadn't thought 
about this particular route, which may be easier than others.


 - Better administrator interface


I don't know that the interface necessarily follows. However, it's a 
little easier to hide results forbidden by the server config.


 - Better user feedback (to rate the documents by the number of hits)


I don't know if this would be a great relevance rank. I know some 
people who'd hit a page repeatedly to increase the ranking...


 - Larger memory requirements (BUT if the search engine is a shared
  object, the extra memory is shared by all the child servers)


I don't know how to evaluate the tradeoff here. I'd guess it to be a 
loss since the hooks for the Apache module would chew up memory. You 
might be able to share the engine itself between child threads, but 
it still always in memory.


One advantage you *didn't* mention is that a module could more easily 
cache common searches.


PS: please reply cc: [EMAIL PROTECTED] (i'm not suscribed to swish-e
 mailing list).


I think you mean the ht://Dig mailing list. ;-)


If you'd like to discuss this further, feel free to contact me 
personally, or we can take it up on the [EMAIL PROTECTED] list.


-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/



To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word unsubscribe in
the SUBJECT of the message.





[htdig] [Q]: return every document in a database?

1999-07-21 Thread mike grommet


Just out of curiosity, is it possible to return every single document in the
database?
For instance, if I enter an empty wordlist, is it possible to return every
document?

This actually could be semi-useful for getting all entries that were indexed
between certain dates.





To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word unsubscribe in
the SUBJECT of the message.



Re: [htdig] [Q]: return every document in a database?

1999-07-21 Thread Geoff Hutchison


mike grommet wrote:
 Just out of curiosity, is it possible to return every single document in the
 database?
 For instance, if I enter an empty wordlist, is it possible to return every
 document?

It's been asked for, but it's not currently possible. IMHO, having this
feature accessible from a blank wordlist isn't the way to do it. I've
seen logs of too many people just hitting "Search." Granted, some of
them *might* have been trying to get all the documents.

Offhand, something like '*' might be a reasonable way to do it. As
always, open to suggestions...

-- 
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/


To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word unsubscribe in
the SUBJECT of the message.



[htdig] [Q]: templates and paging of results

1999-07-19 Thread mike grommet


I'm creating a simple wrapper for PHP, much like the "Search This" example.

I've gotten pretty far here, but my problem now is that results do not page,
ever.

Do I have to handle the paging myself, or will htdig take care of it for me?

I've specified a global conf file (called global.conf) that contains my
header/results templates
and it is included in my individual htdig.conf (which contains the text
information for next/prev page, etc)

unfortunately this doesnt work.  I've even tried adding this to the bottom
of my results-header.html
$(PREVPAGE) $(PAGELIST) $(NEXTPAGE)

but apparently the text values in my conf files are not being seen, as these
apparently
expand to blank strings...


How should I go about paging results from htdig when using templates?



To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the SUBJECT of the message.



FW: [htdig] [Q]: Hacking source, advice please (adding search dateranges)

1999-02-26 Thread mike grommet




-Original Message-
From: Mike Grommet [mailto:[EMAIL PROTECTED]]
Sent: Friday, February 26, 1999 3:14 PM
To: 'Geoff Hutchison'
Subject: RE: [htdig] [Q]: Hacking source, advice please (adding search
dateranges)


Ok, well, theres always the getdate method from htdig/Document.cc

// time_t Document::getdate(char *datestring)
//   Convert a RFC850 date string into a time value

Granted it would require something similar to be placed in the htsearch
code.
the only thing that would have to be done is to take the web input
and convert it into this format:
Tuesday, 01-Jul-97 16:48:02 GMT or,
Thu, 01 May 1997 00:40:42 GMT
^
this is what the function looks for

well, the getdate routine strips off the day name anyway, and according to
program source,
if there isnt a day name, it doesnt care.

you could put the variables hours,mins,secs
inside the web based form, or leave them as hidden variables with 00 values.

So, if someone needed to search for articles between minutes, they could,
but most likely this wouldnt be necessesary for your average application.

Of course there is still some definate error checking to be performed... but
hey,
thats small stuff I think.



So what do you think?




-Original Message-
From: Geoff Hutchison [mailto:[EMAIL PROTECTED]]
Sent: Friday, February 26, 1999 2:59 PM
To: mike grommet
Cc: [EMAIL PROTECTED]
Subject: Re: [htdig] [Q]: Hacking source, advice please (adding search
dateranges)


On Fri, 26 Feb 1999, mike grommet wrote:

 and of course making sure its in the time_t format, I'm really not quite
 sure where I need to place the check for acceptance and recjection based
on
 the time issue...

 Any ideas here?

Well if you know how to get it into a time_t, I'll be glad to code the
accept/reject code myself and post a full patch. I thought the hard part
was converting it into a time_t, since you don't know what the format is!
Maybe you've decided to restrcit the format?

 This seems to be a very useful concept, maybe this could be merged into
the
 regular distribution?

Most likely. That is if you contribute it. :-)

-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/




To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the SUBJECT of the message.



Re: htdig: Q: words.db size

1998-12-27 Thread Geoff Hutchison


 I'm usin' htdig v3.1.0b1 and it works fine. I'have question 'bout words.db size:
 
 I'm reindexing about 3600 documents every night. After one month words.db file was 
~72000K.

Perhaps this needs to become a FAQ. Versions prior to 3.1.0b3 had a bug
with the db.words.db file. Words from documents that were modified or
deleted were *not* removed from the word database.

This bug is fixed in versions 3.1.0b3 and later.

-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/

--
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the body of the message.



htdig: [Q] French dictionary

1998-09-14 Thread Denis Barbier

Hi,

question of a newbie on this list : has someone databases for french
words? I am compiling them, but i do not know when it will finish.

Thanks.

   Denis

--
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the body of the message.