Aspseek 1.2.5 installation and configuration is much the same as 1.2.4, OK.
My test environment (P3, 256MB RAM), not working because of UtfStorage ... http://pandora.rz.uni-osnabrueck.de/cgi-bin/s.cgi (simple search) http://pandora.rz.uni-osnabrueck.de/cgi-bin/s2.cgi (advanced search) http://pandora.rz.uni-osnabrueck.de/cgi-bin/maketable.cgi (my statistics script, work in progress) I found that these problems are still pending: 1. make install ./. make install-strip On RedHat, the binaries vary from ~10 MB (install) to ~600K (install-stripped). Installation of stripped binaries should be the default, otherwise, 10 index/searchd processes will slow down a big box ... 2. once selected/checked - ever selected/checked I did not succeed in holding options checked between subsequent calls of s.cgi. I uncommented the s.htm-dist template lines for the date selection. But this does not work, though suggested by the authors. Maybe somebody can drop a line how this can be solved. 3. not indexed yet (status 0) After successful indexing (not interrupted, everything runs fine), there are still URLs with status 0, menas 'Not indexed yet'. Why do they appear, if index has finished successfully. What can I do to make them disappear? There is a thread, dealing with this topic and it was suggested to use 'Follow no'. For me, that makes no sense, as the indexer should index everything before stopping. Should I use 'index -q ...'. 4. phrase search ./. stopwords Searching for a phrase which contains a stopword, the results are not satisfactory. It seems as if phrase search returns simple word search, combined with AND. 5. Order of 'xx -yy' and '-yy xx' should not be relevant, but it is ... Using a false order, means specifying first -words and then normal words, the search is not successful. ('Empty query' message). 6. 'UtfStorage yes' and consequences ... [URGENT!] My old database from 1.2.4 was updated to 1.2.5 format when using 'index -b' (see below). The new config directive 'UtfStorage yes' for aspseek.conf and searchd.conf was suggested and the new column 'WordurlU' should be renamed to the still existing name 'WordUrl' in the aspseek12 database. This sounds strange to mee, as there is no further explanation. Output of index -b: [wwwrun@pandora etc]$ ../sbin/index -b Loading configuration from /usr/local/aspseek/etc/db.conf Loading charset [iso88591] from table charsets/iso88591 Loading configuration from /usr/local/aspseek/etc/charsets.conf Loading unicode charset [iso-8859-1] from table tables/iso8859-1.txt Loaded 283 ngrams from /usr/local/aspseek/etc/langmap/german-iso8859-1.lm Loading unicode charset [iso-8859-1] from table tables/iso8859-1.txt Loading configuration from /usr/local/aspseek/etc/ucharset.conf Loading stop words from /usr/local/aspseek/etc/stopwords/german Loading configuration from /usr/local/aspseek/etc/stopwords.conf Loading configuration from /usr/local/aspseek/etc/server.conf Loading configuration from /usr/local/aspseek/etc/aspseek.conf Error: Unknown table 'wordurlu' in <drop table wordurlu> 728000 records of 728323 total inserted Creating index Table "wordurlu" with UTF8 encoded field "word" has been created Rename this table to "wordurl" and put command "UtfStorage yes" to "searchd.conf" and "aspseek.conf" Segmentation fault (core dumped) Which MySQL monitor (mysql) command accomplishes the renaming from wordurlu to wordurl? I Used: mysql> drop table wordurl; Query OK, 0 rows affected (0.03 sec) mysql> alter table wordurlu rename wordurl; Query OK, 0 rows affected (0.00 sec) But searchd will not start ... (<defunct>). 7. Ranking algorithm There is no high level description of the ranking algorithm. 8. Some parameters of s.cgi cannot be used in s.htm (?) There is no cross reference to the s.htm description for the following s.cgi parameters: ln=url - shows all links to a given url (referring urls) o=# - switches between output templates, no example given, which names should be used (res-top, res-bottom, res? for first, names for second, third template ?) 9. ch=url does not work for cached PDF files Clicking on the cache link for PDF files [Cached], my server yields an 'internal server error'. What mime types must be configured to output a cached PDF file (is plain text then) appropriately? 10. M2 and moreurls By chance, I found that $M2 refers to 'More from this site'. Is there a $M or $M1, too? ---- I know these are lots of question, but I think it may be welcome to 'brainstorm' first impressions with a new release. Any help and comments are welcome. I will do my part, too ;:) Frank #-------------------------------------------------------# Dipl.-Math. Frank Elsner Universitaet Osnabrueck (University of Osnabrueck) - Rechenzentrum - (Computing Center) Albrechstrasse 28, AVZ D-49076 Osnabrueck Deutschland (Germany) Tel. (Phone): ++49 (0)541/969-2343 Fax: -2470 E-Mail: [EMAIL PROTECTED] #-------------------------------------------------------#
