On 12 Aug 2010, at 09:17 PM, Ann-Marie Horcher <[email protected]>
wrote:
I am one of the other GSOC students. I would be happy to help my
compatriot with testing.
Hi Ann,
Thank you very much for your kindness.
Mainly we are in need of verifying the search results for languages
other than English. Currently webhelp has extensive support for
English, French, German and CJK languages. As both David and I not
familiar with these languages, it is little hard to verify the search
output.
And we need to verify that build process we specified in the doc is
precise, and easy to follow.
If you can try to build the webhelp and make sure it works perfectly
with one of *your* docbook XML file, it is greatly appreciated.
And if you are familiar with one of these languages I stated above,
and have docbook files to test them, it would be great.
Any feedback about this is welcome!
Ann, I hope you did a great work for this summer, and best of luck for
your project! :)
David, if you can find some docbook files which doesn't have any
confidential issues, please send them to the list.
Regards,
Kasun Gajasinghe
On Thu, Aug 12, 2010 at 10:56 AM, Cramer, David W (David) <[email protected]
> wrote:
Hi Robert,
Kasun knows more about the details of the stemmer, but I can point
you to the documentation for the porter stemmer we used:
http://snowball.tartarus.org/algorithms/porter/stemmer.html
Currently, English, French, and German are supported.
You are correct search does not support wildcards in searches, and I
don't believe that the algorithm would return results with the same
base but different prefixes (i.e. searching for "inhibit" won't show
pages with "exhibit"), but I think that's normal for any search
engine.
I'll add "Support wildcards in query string" to the list of future
features. I thought I had added it there already but I see now that
it's not listed.
Thanks,
David
-----Original Message-----
From: Robert Fekete [mailto:[email protected]]
Sent: Thursday, August 12, 2010 7:14 AM
To: [email protected]
Subject: Re: [docbook-apps] Help needed testing CJK search support
in webhelp
Hi David,
First of all, thank you for both of you for your work, it looks very
promising!
I have a few questions about how search and stemming works:
- Is it possible to add partial matches to the search results? For
example, now
if you search for install, installing, or installed, the same
results are
returned (correctly), because these words all come from install. But
if you
don't type the entire word (say, only 'inst'), there aren't any
results.
- Am I right that the search engine does prefix-only matches?
(nstall, *nstall,
etc. does not work)
Regards,
Robert
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]
open.org
--
Ann-Marie Horcher