Hello, Ferenc
Below are my answers

----- Original Message -----
From: "Ferenc VERES" <[EMAIL PROTECTED]>
To: <[EMAIL PROTECTED]>
Sent: Friday, February 09, 2001 7:33 PM
Subject: Re: [aseek-users] national characters


> Hello There!
>
> First I must thank you for the fast reply to my question, Kir.
>
> You wrote:
> > I don't know what codepage Hungarian characters needed, but if it's ISO
8859-2,
> > it is supported by ASPSeek 1.0.2. All you need to do is uncomment it's
definition
> > in etc/charsets.conf, s.htm, and sets LocalCharset to this charset in
both
> > files. Then you should clear database and reindex everything, and you'll
be
> > able to have your national characters in search. And yes, don't forget
to restart
> > searchd, so it will reread config files.
> >
> > Please report your success (or failure) here.
>
> The comment above the 88592 is Czech, I think that is not only Czech but
all
> CE fonts, isn't it? So please include Hungarian in the comment :) If true.
>
> I had to install 1.0.2 instead 1.0.1 to test this, that's the reason for
the
> delay. Test results are partly success partly failure.
>
> I enter hungarian word "vil&aacute;g" in the search box, I use the real
char,
> not encoding ofcurse!! I get three documents only, while I know there are
a
> lot of others whose are also indexed. (because I can find the other
documents
> using words next to "vilag".
>
> The sent query contains %E1 for the á char, BOTH the found and not found
files
> contain the same codetable, both contains %e1 in that location.
> http://c64.rulez.org/c64lista/feb1998.txt - this one works
I looked into this file and it uses latin "a" letter in the word "vilag"
I couldn't even find %E1 in it.

> http://c64.rulez.org/mags/ - this does not (it's a totally different
context for
> the same word)
>
Results are strange
Anyway, do you use charset guesser? If not try to build ASPseek with it.
Use ./configure --enable-charset-guesser, then make.
Server c64.rulex.org doesn't return any encoding for those documents, and
without knowing of encoding ASPseek will
consider only latin characters in those documents as alphanumeric
characters, that is it will store two words: vil g


> Anyway, this is the first try I indexed 2 servers, and I see that you list
> only 1-2 of each server and I can ask explicitly ask for other match. This
> feature is VERY GOOD! Congratulations! :)
>
> Another question is, that I have the idea setting up a search engine for
> more than one subjects. For example "Search hungary only" "Search the
world"
> (I am going to index Commodore related web pages). Is there a way to have
> a groupname for each server, and switch the search mode:
> 1. search servers belonging to group: a1
> 2. search servers belonging to groups: a1,a2
> 3. search all indexed servers
>
> Well, I think this is not implemented yet, I just say it as an idea...
> Or maybe I will get an aswer containing a different solution to this
problem

We already implemented it and this is one of the most advanced features of
ASPseek.
However we don't have good frontend for this yet.
To create web spaces (groups) and assign sites to them perform the following
steps
1) Assign numeric ID to spaces, so lets group a1 will have  ID 1 and group
a2 will have ID 2.
2) Insert into database table "spaces" records with appropriate fields
space_id (1 or 2) and site ID (site ID can be obtained from field
sites.site_id)
using SQL INSERT command, example: INSERT INTO spaces(space_id, site_id)
VALUES (1, 1) and so on.
3) Run "index -B". This command will create binary files spN, where N is the
space ID in the directory var/<DBName>/subsets.
These files are used by "searchd" if search is restricted by space(s). Note,
that these files are also created after saving of delta files (index -D)
Now database is ready to search within web spaces

To search within particular web space(s) use spN=on parameter of s.cgi,
where N is the number of web space
You can use multiple spN parameters. If values of all spN parameters is off,
then search will not be restricted by any space.
Examples:
s.cgi?sp1=on&sp2=off&...
s.cgi?sp1=on&sp2=on&...
s.cgi?sp1=off&sp2=off&...

Thank you for interest in ASPseek

Alexander.

Reply via email to