At 11:08 PM -0700 2/7/02, Neal Richter wrote:
>What is the future of this code?  I've read ill-understood (by
>me) references to new searching/parsing code in previous posts.
>...
>I'm wondering if it wouldn't be easier to start with qtest.cc and
>implement the 'hit' extraction if this code is going to get rewritten
>massively.

Bingo. Unfortunately, neither Quim or I have had the time to devote 
to this project. Somewhere around here I have the additions to 
DocMatch which added the scoring functionality for the new htsearch. 
I'll find those and commit them soon.

The basic idea is that qtest.cc will form the framework for a new 
htsearch. Much of the CGI input and command-line parsing in 
htsearch.cc is fine and can be moved as-is. Additionally qtest.cc 
needs to parse the search_algorithms attribute and set the 
OrFuzzyExpander appropriately.

I should have time tomorrow to sit down and do that much since people 
seem to be interested in picking this up.

For now, I'm going to ignore the collections and plug the ResultList 
directly into the current Display code. What else has been discussed 
as far as necessary cleanups and coding on the htsearch side?
* Re-think collections (i.e. how should they fit into the framework cleanly)
  -- We're all willing to break collections for some period while we 
redo the rest of htsearch.
* Improve the sorting algorithm for results
  -- Right now the entire list of results is sorted, regardless of how 
many are displayed. Much better to make a heap and pull them off as 
needed.
* Clean up the Display class (not many specific details on this yet)
* Add query and results caching
  -- Quim implemented some simple caching already, but an on-disk 
cache of recent queries would improve performance tremendously.

-Geoff

_______________________________________________
htdig-dev mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/htdig-dev

Reply via email to